14 research outputs found
The complete mitochondrial genome of Yarrowia lipolytica
We here report the complete nucleotide sequence of the 47.9 kb mitochondrial (mt) genome from the obligate aerobic yeast Yarrowia lipolytica. It encodes, all on the same strand, seven subunits of NADH: ubiquinone oxidoreductase (ND1-6, ND4L), apocytochrome b (COB), three subunits of cytochrome oxidase (COX1, 2, 3), three subunits of ATP synthetase (ATP6, 8 and 9), small and large ribosomal RNAs and an incomplete set of tRNAs. The Y. lipolytica mt genome is very similar to the Hansenula wingei mt genome, as judged from blocks of conserved gene order and from sequence homology. The extra DNA in the Y. lipolytica mt genome consists of 17 group 1 introns and stretches of A+Trich sequence, interspersed with potentially transposable GC clusters. The usual mould mt genetic code is used. Interestingly, there is no tRNA able to read CGN (arginine) codons. CGN codons could not be found in exonic open reading frames, whereas they do occur in intronic open reading frames. However, several of the intronic open reading frames have accumulated mutations and must be regarded as pseudogenes. We propose that this may have been triggered by the presence of untranslatable CGN codons. This sequence is available under EMBL Accession No. AJ307410
A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome
SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding
Molecular studies of hemocyanin expression in the Dungeness crab
Typescript.
Includes vita and abstract.
Bibliography: Includes bibliographical references (leaves 151-160).
Description: xiii, 160 leaves : ill. ; 29 cm
Development of a large SNP genotyping array and generation of high-density genetic maps in tomato.
The concurrent development of high-throughput genotyping platforms and next generation sequencing (NGS) has increased the number and density of genetic markers, the efficiency of constructing detailed linkage maps, and our ability to overlay recombination and physical maps of the genome. We developed an array for tomato with 8,784 Single Nucleotide Polymorphisms (SNPs) mainly discovered based on NGS-derived transcriptome sequences. Of the SNPs, 7,720 (88%) passed manufacturing quality control and could be scored in tomato germplasm. The array was used to generate high-density linkage maps for three interspecific F(2) populations: EXPEN 2000 (Solanum lycopersicum LA0925 x S. pennellii LA0716, 79 individuals), EXPEN 2012 (S. lycopersicum Moneymaker x S. pennellii LA0716, 160 individuals), and EXPIM 2012 (S. lycopersicum Moneymaker x S. pimpinellifolium LA0121, 183 individuals). The EXPEN 2000-SNP and EXPEN 2012 maps consisted of 3,503 and 3,687 markers representing 1,076 and 1,229 unique map positions (genetic bins), respectively. The EXPEN 2000-SNP map had an average marker bin interval of 1.6 cM, while the EXPEN 2012 map had an average bin interval of 0.9 cM. The EXPIM 2012 map was constructed with 4,491 markers (1,358 bins) and an average bin interval of 0.8 cM. All three linkage maps revealed an uneven distribution of markers across the genome. The dense EXPEN 2012 and EXPIM 2012 maps showed high levels of colinearity across all 12 chromosomes, and also revealed evidence of small inversions between LA0716 and LA0121. Physical positions of 7,666 SNPs were identified relative to the tomato genome sequence. The genetic and physical positions were mostly consistent. Exceptions were observed for chromosomes 3, 10 and 12. Comparing genetic positions relative to physical positions revealed that genomic regions with high recombination rates were consistent with the known distribution of euchromatin across the 12 chromosomes, while very low recombination rates were observed in the heterochromatic regions
Physical coverage of 7,666 SNP markers.
<p>Flanking sequences of SNPs were used for the automatic batch BLAST against the Tomato WGS chromosome database (v SL2.40; <a href="http://solgenomics.net/organism/Solanum_lycopersicum/genome" target="_blank">http://solgenomics.net/organism/Solanum_lycopersicum/genome</a>). The actual SNP positions relative to the Tomato genome sequence were identified using a custom Python script.</p
Comparative analysis of the EXPEN 2012 and EXPIM 2012 genetic maps relative to the draft assembly (v SL2.40;
<p>
<a href="http://solgenomics.net/organism/Solanum_lycopersicum/genome" target="_blank">http://solgenomics.net/organism/Solanum_lycopersicum/genome</a><b>) of the tomato reference genome sequence.</b></p
Colinearity between common markers for the three linkage maps.
1<p>Colinearity within each chromosome was assessed using common markers. The markers were ranked based on their map positions and the rank order was used for regression analysis, and expressed as R<sup>2</sup>.</p
Regression of marker order between the EXPEN 2012 and EXPIM 2012 linkage maps.
<p>The 2,841 SNP markers common to both maps were ranked based on their map positions within chromosomes for each map and the rank orders were used for regression analysis.</p