37 research outputs found

    ForestTreeDB: a database dedicated to the mining of tree transcriptomes

    Get PDF
    ForestTreeDB is intended as a resource that centralizes large-scale expressed sequence tag (EST) sequencing results from several tree species (). It currently encompasses 344 878 quality sequences from 68 libraries, from diverse organs of conifer and hybrid poplar trees. It utilizes the Nimbus data model to provide a hosting system for multiple projects, and uses object-relational mapping APIs in Java and Perl for data accesses within an Oracle database designed to be scalable, maintainable and extendable. Transcriptome builds or unigene sets occupy the focal point of the system. Several of the five current species-specific unigenes were used to design microarrays and SNP resources. The ForestTreeDB web application provides the means for multiple combination database queries. It presents the user with a list of discrete queries to retrieve and download large EST datasets or sequences from precompiled unigene assemblies. Functional annotation assignment is not trivial in conifers which are distantly related to angiosperm model plants. Optimal annotations are achieved through database queries that integrate results from several procedures based open-source tools. ForestTreeDB aims to facilitate sequence mining of coherent annotations in multiple species to support comparative genomic approaches. We plan to continuously enrich ForestTreeDB with other resources through collaborations with other genomic projects

    Meiosis-specific gene discovery in plants: RNA-Seq applied to isolated Arabidopsis male meiocytes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Meiosis is a critical process in the reproduction and life cycle of flowering plants in which homologous chromosomes pair, synapse, recombine and segregate. Understanding meiosis will not only advance our knowledge of the mechanisms of genetic recombination, but also has substantial applications in crop improvement. Despite the tremendous progress in the past decade in other model organisms (e.g., <it>Saccharomyces cerevisiae </it>and <it>Drosophila melanogaster</it>), the global identification of meiotic genes in flowering plants has remained a challenge due to the lack of efficient methods to collect pure meiocytes for analyzing the temporal and spatial gene expression patterns during meiosis, and for the sensitive identification and quantitation of novel genes.</p> <p>Results</p> <p>A high-throughput approach to identify meiosis-specific genes by combining isolated meiocytes, RNA-Seq, bioinformatic and statistical analysis pipelines was developed. By analyzing the studied genes that have a meiosis function, a pipeline for identifying meiosis-specific genes has been defined. More than 1,000 genes that are specifically or preferentially expressed in meiocytes have been identified as candidate meiosis-specific genes. A group of 55 genes that have mitochondrial genome origins and a significant number of transposable element (TE) genes (1,036) were also found to have up-regulated expression levels in meiocytes.</p> <p>Conclusion</p> <p>These findings advance our understanding of meiotic genes, gene expression and regulation, especially the transcript profiles of MGI genes and TE genes, and provide a framework for functional analysis of genes in meiosis.</p

    Spatiotemporal SNP analysis reveals pronounced biocomplexity at the northern range margin of Atlantic cod Gadus morhua

    Get PDF
    Accurate prediction of species distribution shifts in the face of climate change requires a sound understanding of population diversity and local adaptations. Previous modeling has suggested that global warming will lead to increased abundance of Atlantic cod (Gadus morhua) in the ocean around Greenland, but the dynamics of earlier abundance fluctuations are not well understood. We applied a retrospective spatiotemporal population genomics approach to examine the temporal stability of cod population structure in this region and to search for signatures of divergent selection over a 78-year period spanning major demographic changes. Analyzing >900 gene-associated single nucleotide polymorphisms in 847 individuals, we identified four genetically distinct groups that exhibited varying spatial distributions with considerable overlap and mixture. The genetic composition had remained stable over decades at some spawning grounds, whereas complete population replacement was evident at others. Observations of elevated differentiation in certain genomic regions are consistent with adaptive divergence between the groups, indicating that they may respond differently to environmental variation. Significantly increased temporal changes at a subset of loci also suggest that adaptation may be ongoing. These findings illustrate the power of spatiotemporal population genomics for revealing biocomplexity in both space and time and for informing future fisheries management and conservation efforts

    w

    Full text link

    Generation, annotation, analysis and database integration of 16,500 white spruce EST clusters

    Get PDF
    BACKGROUND: The sequencing and analysis of ESTs is for now the only practical approach for large-scale gene discovery and annotation in conifers because their very large genomes are unlikely to be sequenced in the near future. Our objective was to produce extensive collections of ESTs and cDNA clones to support manufacture of cDNA microarrays and gene discovery in white spruce (Picea glauca [Moench] Voss). RESULTS: We produced 16 cDNA libraries from different tissues and a variety of treatments, and partially sequenced 50,000 cDNA clones. High quality 3' and 5' reads were assembled into 16,578 consensus sequences, 45% of which represented full length inserts. Consensus sequences derived from 5' and 3' reads of the same cDNA clone were linked to define 14,471 transcripts. A large proportion (84%) of the spruce sequences matched a pine sequence, but only 68% of the spruce transcripts had homologs in Arabidopsis or rice. Nearly all the sequences that matched the Populus trichocarpa genome (the only sequenced tree genome) also matched rice or Arabidopsis genomes. We used several sequence similarity search approaches for assignment of putative functions, including blast searches against general and specialized databases (transcription factors, cell wall related proteins), Gene Ontology term assignation and Hidden Markov Model searches against PFAM protein families and domains. In total, 70% of the spruce transcripts displayed matches to proteins of known or unknown function in the Uniref100 database (blastx e-value < 1e-10). We identified multigenic families that appeared larger in spruce than in the Arabidopsis or rice genomes. Detailed analysis of translationally controlled tumour proteins and S-adenosylmethionine synthetase families confirmed a twofold size difference. Sequences and annotations were organized in a dedicated database, SpruceDB. Several search tools were developed to mine the data either based on their occurrence in the cDNA libraries or on functional annotations. CONCLUSION: This report illustrates specific approaches for large-scale gene discovery and annotation in an organism that is very distantly related to any of the fully sequenced genomes. The ArboreaSet sequences and cDNA clones represent a valuable resource for investigations ranging from plant comparative genomics to applied conifer genetics

    The Expansion of the PRAME Gene Family in Eutheria

    Get PDF
    The PRAME gene family belongs to the group of cancer/testis genes whose expression is restricted primarily to the testis and a variety of cancers. The expansion of this gene family as a result of gene duplication has been observed in primates and rodents. We analyzed the PRAME gene family in Eutheria and discovered a novel Y-linked PRAME gene family in bovine, PRAMEY, which underwent amplification after a lineage-specific, autosome-to-Y transposition. Phylogenetic analyses revealed two major evolutionary clades. Clade I containing the amplified PRAMEYs and the unamplified autosomal homologs in cattle and other eutherians is under stronger functional constraints; whereas, Clade II containing the amplified autosomal PRAMEs is under positive selection. Deep-sequencing analysis indicated that eight of the identified 16 PRAMEY loci are active transcriptionally. Compared to the bovine autosomal PRAME that is expressed predominantly in testis, the PRAMEY gene family is expressed exclusively in testis and is up-regulated during testicular maturation. Furthermore, the sense RNA of PRAMEY is expressed specifically whereas the antisense RNA is expressed predominantly in spermatids. This study revealed that the expansion of the PRAME family occurred in both autosomes and sex chromosomes in a lineage-dependent manner. Differential selection forces have shaped the evolution and function of the PRAME family. The positive selection observed on the autosomal PRAMEs (Clade II) may result in their functional diversification in immunity and reproduction. Conversely, selective constraints have operated on the expanded PRAMEYs to preserve their essential function in spermatogenesis

    ZNF280BY and ZNF280AY: autosome derived Y-chromosome gene families in Bovidae

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Recent progress in exploring the Y-chromosome gene content in humans, mice and cats have suggested that "autosome-to-Y" transposition of the male fertility genes is a recurrent theme during the mammalian Y-chromosome evolution. These transpositions are lineage-dependent. The purpose of this study is to investigate the lineage-specific Y-chromosome genes in bovid.</p> <p>Results</p> <p>We took a direct testis cDNA selection strategy and discovered two novel gene families, <it>ZNF280BY </it>and <it>ZNF280AY</it>, on the bovine (<it>Bos taurus</it>) Y-chromosome (BTAY), which originated from the transposition of a gene block on the bovine chromosome 17 (BTA17) and subsequently amplified. Approximately 130 active <it>ZNF280BY </it>loci (and ~240 pseudogenes) and ~130 pseudogenized <it>ZNF280AY </it>copies are present over the majority of the male-specific region (MSY). Phylogenetic analysis indicated that both gene families fit with the "birth-and-death" model of evolution. The active <it>ZNF280BY </it>loci share high sequence similarity and comprise three major genomic structures, resulted from insertions/deletions (indels). Assembly of a 1.2 Mb BTAY sequence in the MSY ampliconic region demonstrated that <it>ZNF280BY </it>and <it>ZNF280AY</it>, together with <it>HSFY </it>and <it>TSPY </it>families, constitute the major elements within the repeat units. The <it>ZNF280BY </it>gene family was found to express in different developmental stages of testis with sense RNA detected in all cell types of the seminiferous tubules while the antisense RNA detected only in the spermatids. Deep sequencing of the selected cDNAs revealed that different loci of <it>ZNF280BY </it>were differentially expressed up to 60-fold. Interestingly, different copies of the <it>ZNF280AY </it>pseudogenes were also found to differentially express up to 10-fold. However, expression level of the <it>ZNF280AY </it>pseudogenes was almost 6-fold lower than that of the <it>ZNF280BY </it>genes. <it>ZNF280BY </it>and <it>ZNF280AY </it>gene families are present in bovid, but absent in other mammalian lineages.</p> <p>Conclusions</p> <p><it>ZNF280BY </it>and <it>ZNF280AY </it>are lineage-specific, multi-copy Y-gene families specific to <it>Bovidae</it>, and are derived from the transposition of an autosomal gene block. The temporal and spatial expression patterns of <it>ZNF280BY</it>s in testis suggest a role in spermatogenesis. This study offers insights into the genomic organization of the bovine MSY and gene regulation in spermatogenesis, and provides a model for studying evolution of multi-copy gene families in mammals.</p

    Amplification and detection of lentiviral DNA inside cells.

    No full text
    corecore