41 research outputs found

    Transposable element distribution, abundance and role in genome size variation in the genus Oryza

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The genus <it>Oryza </it>is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop – rice (<it>Oryza sativa </it>[AA]). Genome size variation in the <it>Oryza </it>is more than 3-fold and ranges from 357 Mbp in <it>Oryza glaberrima </it>[AA] to 1283 Mbp in the polyploid <it>Oryza ridleyi </it>[HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative <it>Oryza </it>species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation.</p> <p>Results</p> <p>We identified the elements primarily responsible for the most strikingly genome size variation in <it>Oryza</it>. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the <it>Oryza </it>and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species <it>Oryza coarctata </it>[HHKK] whose placement in the <it>Oryza </it>genus is controversial.</p> <p>Conclusion</p> <p>Long Terminal Repeat retrotransposons are the major component of the <it>Oryza </it>genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the <it>Oryza </it>genus. Two families of Ty3-<it>gypsy </it>elements (<it>RIRE2 </it>and <it>Atlantys</it>) account for a significant portion of the genome size variations present in the <it>Oryza </it>genus.</p

    DNA methylation changes facilitated evolution of genes derived from Mutator-like transposable elements

    Get PDF
    Supplementary file S2. Accession numbers and URLs for genome assembly, transcriptome and methylome data that used in this project. (DOCX 101 kb

    Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends.</p> <p>Results</p> <p>A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the <it>Sal</it>I MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary.</p> <p>Conclusion</p> <p>MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.</p

    Genome-wide association mapping of date palm fruit traits

    Get PDF
    Date palms (Phoenix dactylifera) are an important fruit crop of arid regions of the Middle East and North Africa. Despite its importance, few genomic resources exist for date palms, hampering evolutionary genomic studies of this perennial species. Here we report an improved long-read genome assembly for P. dactylifera that is 772.3 Mb in length, with contig N50 of 897.2 Kb, and use this to perform genome-wide association studies (GWAS) of the sex determining region and 21 fruit traits. We find a fruit color GWAS at the R2R3-MYB transcription factor VIRESCENS gene and identify functional alleles that include a retrotransposon insertion and start codon mutation. We also find a GWAS peak for sugar composition spanning deletion polymorphisms in multiple linked invertase genes. MYB transcription factors and invertase are implicated in fruit color and sugar composition in other crops, demonstrating the importance of parallel evolution in the evolutionary diversification of domesticated species

    A BAC library of the SP80-3280 sugarcane variety (saccharum sp.) and its inferred microsynteny with the sorghum genome

    Get PDF
    BACKGROUND:Sugarcane breeding has significantly progressed in the last 30 years, but achieving additional yield gains has been difficult because of the constraints imposed by the complex ploidy of this crop. Sugarcane cultivars are interspecific hybrids between Saccharum officinarum and Saccharum spontaneum. S. officinarum is an octoploid with 2n=80 chromosomes while S. spontaneum has 2n=40 to 128 chromosomes and ploidy varying from 5 to 16. The hybrid genome is composed of 70-80%S. officinaram and 5-20%S. spontaneum chromosomes and a small proportion of recombinants. Sequencing the genome of this complex crop may help identify useful genes, either per se or through comparative genomics using closely related grasses. The construction and sequencing of a bacterial artificial chromosome (BAC) library of an elite commercial variety of sugarcane could help assembly the sugarcane genome.RESULTS:A BAC library designated SS_SBa was constructed with DNA isolated from the commercial sugarcane variety SP80-3280. The library contains 36,864 clones with an average insert size of 125 Kb, 88% of which has inserts larger than 90 Kb. Based on the estimated genome size of 760-930 Mb, the library exhibits 5-6 times coverage the monoploid sugarcane genome. Bidirectional BAC end sequencing (BESs) from a random sample of 192 BAC clones sampled genes and repetitive elements of the sugarcane genome. Forty-five per cent of the total BES nucleotides represents repetitive elements, 83% of which belonging to LTR retrotransposons. Alignment of BESs corresponding to 42 BACs to the genome sequence of the 10 sorghum chromosomes revealed regions of microsynteny, with expansions and contractions of sorghum genome regions relative to the sugarcane BAC clones. In general, the sampled sorghum genome regions presented an average 29% expansion in relation to the sugarcane syntenic BACs.CONCLUSION:The SS_SBa BAC library represents a new resource for sugarcane genome sequencing. An analysis of insert size, genome coverage and orthologous alignment with the sorghum genome revealed that the library presents whole genome coverage. The comparison of syntenic regions of the sorghum genome to 42 SS_SBa BES pairs revealed that the sorghum genome is expanded in relation to the sugarcane genome.This item is part of the UA Faculty Publications collection. For more information this item or other items in the UA Campus Repository, contact the University of Arizona Libraries at [email protected]

    Transposable element distribution, abundance and role in genome size variation in the genus -6

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "Transposable element distribution, abundance and role in genome size variation in the genus "</p><p>http://www.biomedcentral.com/1471-2148/7/152</p><p>BMC Evolutionary Biology 2007;7():152-152.</p><p>Published online 29 Aug 2007</p><p>PMCID:PMC2041954.</p><p></p>(2006) [9] and normalized to 1 Mbp. The confidence intervals, displayed as error bars, were calculated assuming a Poisson distribution of repeats in the genome. The mean length used in calculation for different repeats were as follows: Ty3-elements: 12 Kbp; Ty1-elements: 5.5 Kbp; others LTR-RTs not classified (LTR-retrotransposons): 8.75 Kbp; LINEs: 3.5 kbp; helitrons (complete autonomous): 12.8 Kbp; CACTA: 15.2 Kbp; hAT: 3.6 Kbp *In this case all the calculations are based on a rough estimate of the genome size of this species: the real value is unknown, we therefore used the value estimated for [HHJJ; 1283 Mbp], which is also an allotetraploid species and shares the HH genome type with

    Transposable element distribution, abundance and role in genome size variation in the genus -1

    No full text
    <p><b>Copyright information:</b></p><p>Taken from "Transposable element distribution, abundance and role in genome size variation in the genus "</p><p>http://www.biomedcentral.com/1471-2148/7/152</p><p>BMC Evolutionary Biology 2007;7():152-152.</p><p>Published online 29 Aug 2007</p><p>PMCID:PMC2041954.</p><p></p>netic tree using the neighbor-joining method. Bootstrap values were calculated for 1000 replicates; only those with values greater than 50 are proposed B) distribution of the domains isolated in different species. Bar colors are the same of those used in the circles marking, on the neighbor-joining tree sequences from different species
    corecore