71 research outputs found

    Phylogenetic Signal Variation in the Genomes of Medicago (Fabaceae)

    Get PDF
    Genome-scale data offer the opportunity to clarify phylogenetic relationships that are difficult to resolve with few loci, but they can also identify genomic regions with evolutionary history distinct from that of the species history. We collected whole-genome sequence data from 29 taxa in the legume genus Medicago, then aligned these sequences to the Medicago truncatula reference genome to confidently identify 87 596 variable homologous sites. We used this data set to estimate phylogenetic relationships among Medicago species, to investigate the number of sites needed to provide robust phylogenetic estimates and to identify specific genomic regions supporting topologies in conflict with the genome-wide phylogeny. Our full genomic data set resolves relationships within the genus that were previously intractable. Subsampling the data reveals considerable variation in phylogenetic signal and power in smaller subsets of the data. Even when sampling 5000 sites, no random sample of the data supports a topology identical to that of the genome-wide phylogeny. Phylogenetic relationships estimated from 500-site sliding windows revealed genome regions supporting several alternative species relationships among recently diverged taxa, consistent with the expected effects of deep coalescence or introgression in the recent history of Medicago. [Medicago; phylogenomics; whole-genome resequencing.

    Methylation-sensitive linking libraries enhance gene-enriched sequencing of complex genomes and map DNA methylation domains

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Many plant genomes are resistant to whole-genome assembly due to an abundance of repetitive sequence, leading to the development of gene-rich sequencing techniques. Two such techniques are hypomethylated partial restriction (HMPR) and methylation spanning linker libraries (MSLL). These libraries differ from other gene-rich datasets in having larger insert sizes, and the MSLL clones are designed to provide reads localized to "epigenetic boundaries" where methylation begins or ends.</p> <p>Results</p> <p>A large-scale study in maize generated 40,299 HMPR sequences and 80,723 MSLL sequences, including MSLL clones exceeding 100 kb. The paired end reads of MSLL and HMPR clones were shown to be effective in linking existing gene-rich sequences into scaffolds. In addition, it was shown that the MSLL clones can be used for anchoring these scaffolds to a BAC-based physical map. The MSLL end reads effectively identified epigenetic boundaries, as indicated by their preferential alignment to regions upstream and downstream from annotated genes. The ability to precisely map long stretches of fully methylated DNA sequence is a unique outcome of MSLL analysis, and was also shown to provide evidence for errors in gene identification. MSLL clones were observed to be significantly more repeat-rich in their interiors than in their end reads, confirming the correlation between methylation and retroelement content. Both MSLL and HMPR reads were found to be substantially gene-enriched, with the <it>Sal</it>I MSLL libraries being the most highly enriched (31% align to an EST contig), while the HMPR clones exhibited exceptional depletion of repetitive DNA (to ~11%). These two techniques were compared with other gene-enrichment methods, and shown to be complementary.</p> <p>Conclusion</p> <p>MSLL technology provides an unparalleled approach for mapping the epigenetic status of repetitive blocks and for identifying sequences mis-identified as genes. Although the types and natures of epigenetic boundaries are barely understood at this time, MSLL technology flags both approximate boundaries and methylated genes that deserve additional investigation. MSLL and HMPR sequences provide a valuable resource for maize genome annotation, and are a uniquely valuable complement to any plant genome sequencing project. In order to make these results fully accessible to the community, a web display was developed that shows the alignment of MSLL, HMPR, and other gene-rich sequences to the BACs; this display is continually updated with the latest ESTs and BAC sequences.</p

    Physical and Genetic Structure of the Maize Genome Reflects Its Complex Evolutionary History

    Get PDF
    Maize (Zea mays L.) is one of the most important cereal crops and a model for the study of genetics, evolution, and domestication. To better understand maize genome organization and to build a framework for genome sequencing, we constructed a sequence-ready fingerprinted contig-based physical map that covers 93.5% of the genome, of which 86.1% is aligned to the genetic map. The fingerprinted contig map contains 25,908 genic markers that enabled us to align nearly 73% of the anchored maize genome to the rice genome. The distribution pattern of expressed sequence tags correlates to that of recombination. In collinear regions, 1 kb in rice corresponds to an average of 3.2 kb in maize, yet maize has a 6-fold genome size expansion. This can be explained by the fact that most rice regions correspond to two regions in maize as a result of its recent polyploid origin. Inversions account for the majority of chromosome structural variations during subsequent maize diploidization. We also find clear evidence of ancient genome duplication predating the divergence of the progenitors of maize and rice. Reconstructing the paleoethnobotany of the maize genome indicates that the progenitors of modern maize contained ten chromosomes

    2007, Physical and genetic structure of the maize genome reflects its complex evolutionary history, PLoS

    Get PDF
    Maize (Zea mays L.) is one of the most important cereal crops and a model for the study of genetics, evolution, and domestication. To better understand maize genome organization and to build a framework for genome sequencing, we constructed a sequence-ready fingerprinted contig-based physical map that covers 93.5% of the genome, of which 86.1% is aligned to the genetic map. The fingerprinted contig map contains 25,908 genic markers that enabled us to align nearly 73% of the anchored maize genome to the rice genome. The distribution pattern of expressed sequence tags correlates to that of recombination. In collinear regions, 1 kb in rice corresponds to an average of 3.2 kb in maize, yet maize has a 6-fold genome size expansion. This can be explained by the fact that most rice regions correspond to two regions in maize as a result of its recent polyploid origin. Inversions account for the majority of chromosome structural variations during subsequent maize diploidization. We also find clear evidence of ancient genome duplication predating the divergence of the progenitors of maize and rice. Reconstructing the paleoethnobotany of the maize genome indicates that the progenitors of modern maize contained ten chromosomes. Citation: Wei F, Coe E, Nelson W, Bharti AK, Engler F, et al. (2007) Physical and genetic structure of the maize genome reflects its complex evolutionary history. PLoS Genet 3(7): e123

    The Expansion of the PRAME Gene Family in Eutheria

    Get PDF
    The PRAME gene family belongs to the group of cancer/testis genes whose expression is restricted primarily to the testis and a variety of cancers. The expansion of this gene family as a result of gene duplication has been observed in primates and rodents. We analyzed the PRAME gene family in Eutheria and discovered a novel Y-linked PRAME gene family in bovine, PRAMEY, which underwent amplification after a lineage-specific, autosome-to-Y transposition. Phylogenetic analyses revealed two major evolutionary clades. Clade I containing the amplified PRAMEYs and the unamplified autosomal homologs in cattle and other eutherians is under stronger functional constraints; whereas, Clade II containing the amplified autosomal PRAMEs is under positive selection. Deep-sequencing analysis indicated that eight of the identified 16 PRAMEY loci are active transcriptionally. Compared to the bovine autosomal PRAME that is expressed predominantly in testis, the PRAMEY gene family is expressed exclusively in testis and is up-regulated during testicular maturation. Furthermore, the sense RNA of PRAMEY is expressed specifically whereas the antisense RNA is expressed predominantly in spermatids. This study revealed that the expansion of the PRAME family occurred in both autosomes and sex chromosomes in a lineage-dependent manner. Differential selection forces have shaped the evolution and function of the PRAME family. The positive selection observed on the autosomal PRAMEs (Clade II) may result in their functional diversification in immunity and reproduction. Conversely, selective constraints have operated on the expanded PRAMEYs to preserve their essential function in spermatogenesis

    ZNF280BY and ZNF280AY: autosome derived Y-chromosome gene families in Bovidae

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recent progress in exploring the Y-chromosome gene content in humans, mice and cats have suggested that "autosome-to-Y" transposition of the male fertility genes is a recurrent theme during the mammalian Y-chromosome evolution. These transpositions are lineage-dependent. The purpose of this study is to investigate the lineage-specific Y-chromosome genes in bovid.</p> <p>Results</p> <p>We took a direct testis cDNA selection strategy and discovered two novel gene families, <it>ZNF280BY </it>and <it>ZNF280AY</it>, on the bovine (<it>Bos taurus</it>) Y-chromosome (BTAY), which originated from the transposition of a gene block on the bovine chromosome 17 (BTA17) and subsequently amplified. Approximately 130 active <it>ZNF280BY </it>loci (and ~240 pseudogenes) and ~130 pseudogenized <it>ZNF280AY </it>copies are present over the majority of the male-specific region (MSY). Phylogenetic analysis indicated that both gene families fit with the "birth-and-death" model of evolution. The active <it>ZNF280BY </it>loci share high sequence similarity and comprise three major genomic structures, resulted from insertions/deletions (indels). Assembly of a 1.2 Mb BTAY sequence in the MSY ampliconic region demonstrated that <it>ZNF280BY </it>and <it>ZNF280AY</it>, together with <it>HSFY </it>and <it>TSPY </it>families, constitute the major elements within the repeat units. The <it>ZNF280BY </it>gene family was found to express in different developmental stages of testis with sense RNA detected in all cell types of the seminiferous tubules while the antisense RNA detected only in the spermatids. Deep sequencing of the selected cDNAs revealed that different loci of <it>ZNF280BY </it>were differentially expressed up to 60-fold. Interestingly, different copies of the <it>ZNF280AY </it>pseudogenes were also found to differentially express up to 10-fold. However, expression level of the <it>ZNF280AY </it>pseudogenes was almost 6-fold lower than that of the <it>ZNF280BY </it>genes. <it>ZNF280BY </it>and <it>ZNF280AY </it>gene families are present in bovid, but absent in other mammalian lineages.</p> <p>Conclusions</p> <p><it>ZNF280BY </it>and <it>ZNF280AY </it>are lineage-specific, multi-copy Y-gene families specific to <it>Bovidae</it>, and are derived from the transposition of an autosomal gene block. The temporal and spatial expression patterns of <it>ZNF280BY</it>s in testis suggest a role in spermatogenesis. This study offers insights into the genomic organization of the bovine MSY and gene regulation in spermatogenesis, and provides a model for studying evolution of multi-copy gene families in mammals.</p

    The Marine Microbial Eukaryote Transcriptome Sequencing Project (MMETSP): illuminating the functional diversity of eukaryotic life in the oceans through transcriptome sequencing

    Get PDF
    International audienceCurrent sampling of genomic sequence data from eukaryotes is relatively poor, biased, and inadequate to address important questions about their biology, evolution, and ecology; this Community Page describes a resource of 700 transcriptomes from marine microbial eukaryotes to help understand their role in the world's oceans

    Molecular characterization of transparent testa (tt) mutants of Arabidopsis thaliana (ecotype Estland) impaired in flavonoid biosynthetic pathway

    No full text
    Detailed analysis of four transparent testa (tt) mutants of Arabidopsis thaliana (ecotype Estland) that lack anthocyanin pigments indicated that three are allelic to known mutants tt3, tt4 and ttg1 (mutants of DFR, CHS and TTG1 genes, respectively) while the fourth represents a new tt mutant (tt17). It is known through 3-D crystal structure analysis of CHS2 in Medicago [Nat. Struct. Biol. 6 (1999) 775] that Cys164 (key active site residue) is activated by His303 (corresponds to His309 in Arabidopsis). The substitution of His309 by Tyr309 in tt4 (Est) mutant analyzed in this study causes instability of CHS protein, thus providing evidence for the functional significance of this histidine residue. The ttg1 (Est) mutant harbors a change from Ser101 to Phe101 in the region preceding the WD-repeats, indicating a critical role of Ser101 in the function of transcriptional regulator TTG1. In tt3 (Est) mutant, 7 bp deletion generates pre-mature stop codon. The nature and function of TT17 in anthocyanin biosynthesis is yet to be defined. This study also revealed reduced transcript abundance of ACCase in all four tt mutants examined, suggesting it to be a control point for flux of supply products from primary to secondary metabolism

    Mutants of Arabidopsis as tools to understand the regulation of phenylpropanoid pathway and UVB protection mechanisms

    No full text
    Plants accumulate certain phenylpropanoid compounds in the vacuoles of their epidermal and subepidermal cell layers thereby protecting the underlying tissue against UVB-induced damage. However, a number of mutants of Arabidopsis thaliana are known that fail to synthesize these protective pigments, thereby allowing harmful UVB radiation to penetrate into their dermal layers. Study of several of these nonlethal mutants, defective in various aspects of flavonoid and lignin biosynthesis, has led to a better understanding of the coordinate regulation and expression of important genes as well as of mechanisms involved in plant defense against UVB radiation. The characteristics of the various phenylpropanoid mutants of Arabidopsis, viz. flavonoid mutants (banyuls [baity, increased chalcone synthase expression 1 [icx1]; transparent testa [tt] and ultraviolet sensitive [uvs]) and hydroxycinnamic acid ester mutants (ferulic acid hydroxylase 1 [fahl] and sinapoylglucose accumulator 1 [sng1]) are discussed in detail. We have briefly touched upon, wherever relevant, the unique aspects in other plant species too
    corecore