68 research outputs found

    Phylostratigraphic tracking of cancer genes suggests a link to the emergence of multicellularity in metazoa

    Get PDF
    Background: Phylostratigraphy is a method used to correlate the evolutionary origin of founder genes (that is, functional founder protein domains) of gene families with particular macroevolutionary transitions. It is based on a model of genome evolution that suggests that the origin of complex phenotypic innovations will be accompanied by the emergence of such founder genes, the descendants of which can still be traced in extant organisms. The origin of multicellularity can be considered to be a macroevolutionary transition, for which new gene functions would have been required. Cancer should be tightly connected to multicellular life since it can be viewed as a malfunction of interaction between cells in a multicellular organism. A phylostratigraphic tracking of the origin of cancer genes should, therefore, also provide insights into the origin of multicellularity. Results: We find two strong peaks of the emergence of cancer related protein domains, one at the time of the origin of the first cell and the other around the time of the evolution of the multicellular metazoan organisms. These peaks correlate with two major classes of cancer genes, the 'caretakers', which are involved in general functions that support genome stability and the 'gatekeepers', which are involved in cellular signalling and growth processes. Interestingly, this phylogenetic succession mirrors the ontogenetic succession of tumour progression, where mutations in caretakers are thought to precede mutations in gatekeepers. Conclusions: A link between multicellularity and formation of cancer has often been predicted. However, this has not so far been explicitly tested. Although we find that a significant number of protein domains involved in cancer predate the origin of multicellularity, the second peak of cancer protein domain emergence is, indeed, connected to a phylogenetic level where multicellular animals have emerged. The fact that we can find a strong and consistent signal for this second peak in the phylostratigraphic map implies that a complex multi-level selection process has driven the transition to multicellularity

    ProteinHistorian: Tools for the Comparative Analysis of Eukaryote Protein Origin

    Get PDF
    The evolutionary history of a protein reflects the functional history of its ancestors. Recent phylogenetic studies identified distinct evolutionary signatures that characterize proteins involved in cancer, Mendelian disease, and different ontogenic stages. Despite the potential to yield insight into the cellular functions and interactions of proteins, such comparative phylogenetic analyses are rarely performed, because they require custom algorithms. We developed ProteinHistorian to make tools for performing analyses of protein origins widely available. Given a list of proteins of interest, ProteinHistorian estimates the phylogenetic age of each protein, quantifies enrichment for proteins of specific ages, and compares variation in protein age with other protein attributes. ProteinHistorian allows flexibility in the definition of protein age by including several algorithms for estimating ages from different databases of evolutionary relationships. We illustrate the use of ProteinHistorian with three example analyses. First, we demonstrate that proteins with high expression in human, compared to chimpanzee and rhesus macaque, are significantly younger than those with human-specific low expression. Next, we show that human proteins with annotated regulatory functions are significantly younger than proteins with catalytic functions. Finally, we compare protein length and age in many eukaryotic species and, as expected from previous studies, find a positive, though often weak, correlation between protein age and length. ProteinHistorian is available through a web server with an intuitive interface and as a set of command line tools; this allows biologists and bioinformaticians alike to integrate these approaches into their analysis pipelines. ProteinHistorian's modular, extensible design facilitates the integration of new datasets and algorithms. The ProteinHistorian web server, source code, and pre-computed ages for 32 eukaryotic genomes are freely available under the GNU public license at http://lighthouse.ucsf.edu/ProteinHistorian/

    The oyster genome reveals stress adaptation and complexity of shell formation

    Get PDF
    The Pacific oyster Crassostrea gigas belongs to one of the most species-rich but genomically poorly explored phyla, the Mollusca. Here we report the sequencing and assembly of the oyster genome using short reads and a fosmid-pooling strategy, along with transcriptomes of development and stress response and the proteome of the shell. The oyster genome is highly polymorphic and rich in repetitive sequences, with some transposable elements still actively shaping variation. Transcriptome studies reveal an extensive set of genes responding to environmental stress. The expansion of genes coding for heat shock protein 70 and inhibitors of apoptosis is probably central to the oyster's adaptation to sessile life in the highly stressful intertidal zone. Our analyses also show that shell formation in molluscs is more complex than currently understood and involves extensive participation of cells and their exosomes. The oyster genome sequence fills a void in our understanding of the Lophotrochozoa. © 2012 Macmillan Publishers Limited. All rights reserved

    Comparative analysis of the transcriptome across distant species

    Get PDF
    The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters

    A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns

    No full text
    Parallels between phylogeny and ontogeny have been discussed for almost two centuries, and a number of theories have been proposed to explain such patterns1. Especially elusive is the phylotypic stage, a phase during development where species within a phylum are particularly similar to each other2, 3, 4, 5, 6. Although this has formerly been interpreted as a recapitulation of phylogeny1, it is now thought to reflect an ontogenetic progression phase2, where strong constraints on developmental regulation and gene interactions exist2, 3. Several studies have shown that genes expressed during this stage evolve at a slower rate, but it has so far not been possible to derive an unequivocal molecular signature associated with this stage7, 8, 9, 10, 11, 12, 13, 14, 15. Here we use a combination of phylostratigraphy16 and stage-specific gene expression data to generate a cumulative index that reflects the evolutionary age of the transcriptome at given ontogenetic stages. Using zebrafish ontogeny and adult development as a model, we find that the phylotypic stage does indeed express the oldest transcriptome set and that younger sets are expressed during early and late development, thus faithfully mirroring the hourglass model of morphological divergence2, 3. Reproductively active animals show the youngest transcriptome, with major differences between males and females. Notably, ageing animals express increasingly older genes. Comparisons with similar data sets from flies and nematodes show that this pattern occurs across phyla. Our results indicate that an old transcriptome marks the phylotypic phase and that phylogenetic differences at other ontogenetic stages correlate with the expression of newly evolved genes

    Evolutionary origin of orphan genes

    No full text
    Orphangenes are genes that occur in specific evolutionary lineages without similarity to genes outside of these lineages and have, therefore, alternatively been named taxonomically restricted genes. They were so far considered to emerge through duplication–divergence processes, but it isnowbecomingclear that they can also arise de novo out of noncoding deoxyribonucleic acid (DNA). This latter process may even occur much more frequently than previously assumed. It appears that genomes harbour many transcripts in a transition stage from nonfunctional to functional genes, also knownas protogenes, which are exposed to evolutionary testing and can become fixed when they turn out to be useful. Orphan genes may have played key roles in generating lineagespecific adaptations and could be a continuous source of evolutionary novelties. Their existence suggests that functional ribonucleic acids (RNAs) and proteins can relatively easily arise out of randomnucleotide sequences, although these processes still need to be experimentally explored
    corecore