69 research outputs found

    OMA 2011: orthology inference among 1000 complete genomes

    Get PDF
    OMA (Orthologous MAtrix) is a database that identifies orthologs among publicly available, complete genomes. Initiated in 2004, the project is at its 11th release. It now includes 1000 genomes, making it one of the largest resources of its kind. Here, we describe recent developments in terms of species covered; the algorithmic pipeline—in particular regarding the treatment of alternative splicing, and new features of the web (OMA Browser) and programming interface (SOAP API). In the second part, we review the various representations provided by OMA and their typical applications. The database is publicly accessible at http://omabrowser.org

    A revised evolutionary history of the CYP1A subfamily : gene duplication, gene conversion, and positive selection

    Get PDF
    Author Posting. © The Authors, 2005. This is the author's version of the work. It is posted here by permission of Springer for personal use, not for redistribution. The definitive version was published in Journal of Molecular Evolution 62 (2006): 708-717, doi:10.1007/s00239-005-0134-z.Members of cytochrome P450 subfamily 1A (CYP1As) are involved in detoxification and bioactivation of common environmental pollutants. Understanding the functional evolution of these genes is essential to predicting and interpreting species differences in sensitivity to toxicity by such chemicals. The CYP1A gene subfamily comprises a single ancestral representative in most fish species and two paralogs in higher vertebrates, including birds and mammals. Phylogenetic analysis of complete coding sequences suggests that mammalian and bird paralog pairs (CYP1A1/2 and CYP1A4/5, respectively) are the result of independent gene duplication events. However, comparison of vertebrate genome sequences revealed that CYP1A genes lie within an extended region of conserved fine-scale synteny, suggesting that avian and mammalian CYP1A paralogs share a common genomic history. Algorithms designed to detect recombination between nucleotide sequences indicate that gene conversion has homogenized most of the length of the chicken CYP1A genes, as well as the 5’ end of mammalian CYP1As. Together, these data indicate that avian and mammalian CYP1A paralog pairs resulted from a single gene duplication event and that extensive gene conversion is responsible for the exceptionally high degree of sequence similarity between CYP1A4 and CYP1A5. Elevated non-synonymous/synonymous substitution ratios within a putatively unconverted stretch of ~250 bp suggests that positive selection may have reduced the effective rate of gene conversion in this region, which contains two substrate recognition sites. This work significantly alters our understanding of functional evolution in the CYP1A subfamily, suggesting that gene conversion and positive selection have been the dominant processes of sequence evolution.Funding for this work was provided by the NIH Superfund Basic Research Program at Boston University (5-P42-ES-07381) and by the Woods Hole Oceanographic Institution

    The Development of Three Long Universal Nuclear Protein-Coding Locus Markers and Their Application to Osteichthyan Phylogenetics with Nested PCR

    Get PDF
    BACKGROUND: Universal nuclear protein-coding locus (NPCL) markers that are applicable across diverse taxa and show good phylogenetic discrimination have broad applications in molecular phylogenetic studies. For example, RAG1, a representative NPCL marker, has been successfully used to make phylogenetic inferences within all major osteichthyan groups. However, such markers with broad working range and high phylogenetic performance are still scarce. It is necessary to develop more universal NPCL markers comparable to RAG1 for osteichthyan phylogenetics. METHODOLOGY/PRINCIPAL FINDINGS: We developed three long universal NPCL markers (>1.6 kb each) based on single-copy nuclear genes (KIAA1239, SACS and TTN) that possess large exons and exhibit the appropriate evolutionary rates. We then compared their phylogenetic utilities with that of the reference marker RAG1 in 47 jawed vertebrate species. In comparison with RAG1, each of the three long universal markers yielded similar topologies and branch supports, all in congruence with the currently accepted osteichthyan phylogeny. To compare their phylogenetic performance visually, we also estimated the phylogenetic informativeness (PI) profile for each of the four long universal NPCL markers. The PI curves indicated that SACS performed best over the whole timescale, while RAG1, KIAA1239 and TTN exhibited similar phylogenetic performances. In addition, we compared the success of nested PCR and standard PCR when amplifying NPCL marker fragments. The amplification success rate and efficiency of the nested PCR were overwhelmingly higher than those of standard PCR. CONCLUSIONS/SIGNIFICANCE: Our work clearly demonstrates the superiority of nested PCR over the conventional PCR in phylogenetic studies and develops three long universal NPCL markers (KIAA1239, SACS and TTN) with the nested PCR strategy. The three markers exhibit high phylogenetic utilities in osteichthyan phylogenetics and can be widely used as pilot genes for phylogenetic questions of osteichthyans at different taxonomic levels

    The genome, transcriptome, and proteome of the nematode Steinernema carpocapsae: Evolutionary signatures of a pathogenic lifestyle

    Get PDF
    The entomopathogenic nematode Steinernema carpocapsae has been widely used for the biological control of insect pests. It shares a symbiotic relationship with the bacterium Xenorhabdus nematophila, and is emerging as a genetic model to study symbiosis and pathogenesis. We obtained a high-quality draft of the nematode’s genome comprising 84,613,633 bp in 347 scaffolds, with an N50 of 1.24 Mb. To improve annotation, we sequenced both short and long RNA and conducted shotgun proteomic analyses. S. carpocapsae shares orthologous genes with other parasitic nematodes that are absent in the free-living nematode C. elegans, it has ncRNA families that are enriched in parasites, and expresses proteins putatively associated with parasitism and pathogenesis, suggesting an active role for the nematode during the pathogenic process. Host and parasites might engage in a co-evolutionary arms-race dynamic with genes participating in their interaction showing signatures of positive selection. Our analyses indicate that the consequence of this arms race is better characterized by positive selection altering specific functions instead of just increasing the number of positively selected genes, adding a new perspective to these co-evolutionary theories. We identified a protein, ATAD-3, that suggests a relevant role for mitochondrial function in the evolution and mechanisms of nematode parasitism

    Origin and Evolution of TRIM Proteins: New Insights from the Complete TRIM Repertoire of Zebrafish and Pufferfish

    Get PDF
    Tripartite motif proteins (TRIM) constitute a large family of proteins containing a RING-Bbox-Coiled Coil motif followed by different C-terminal domains. Involved in ubiquitination, TRIM proteins participate in many cellular processes including antiviral immunity. The TRIM family is ancient and has been greatly diversified in vertebrates and especially in fish. We analyzed the complete sets of trim genes of the large zebrafish genome and of the compact pufferfish genome. Both contain three large multigene subsets - adding the hsl5/trim35-like genes (hltr) to the ftr and the btr that we previously described - all containing a B30.2 domain that evolved under positive selection. These subsets are conserved among teleosts. By contrast, most human trim genes of the other classes have only one or two orthologues in fish. Loss or gain of C-terminal exons generated proteins with different domain organizations; either by the deletion of the ancestral domain or, remarkably, by the acquisition of a new C-terminal domain. Our survey of fish trim genes in fish identifies subsets with different evolutionary dynamics. trims encoding RBCC-B30.2 proteins show the same evolutionary trends in fish and tetrapods: they evolve fast, often under positive selection, and they duplicate to create multigenic families. We could identify new combinations of domains, which epitomize how new trim classes appear by domain insertion or exon shuffling. Notably, we found that a cyclophilin-A domain replaces the B30.2 domain of a zebrafish fintrim gene, as reported in the macaque and owl monkey antiretroviral TRIM5α. Finally, trim genes encoding RBCC-B30.2 proteins are preferentially located in the vicinity of MHC or MHC gene paralogues, which suggests that such trim genes may have been part of the ancestral MHC

    Finding Single Copy Genes Out of Sequenced Genomes for Multilocus Phylogenetics in Non-Model Fungi

    Get PDF
    Historically, fungal multigene phylogenies have been reconstructed based on a small number of commonly used genes. The availability of complete fungal genomes has given rise to a new wave of model organisms that provide large number of genes potentially useful for building robust gene genealogies. Unfortunately, cross-utilization of these resources to study phylogenetic relationships in the vast majority of non-model fungi (i.e. “orphan” species) remains an unexamined question. To address this problem, we developed a method coupled with a program named “PHYLORPH” (PHYLogenetic markers for ORPHans). The method screens fungal genomic databases (107 fungal genomes fully sequenced) for single copy genes that might be easily transferable and well suited for studies at low taxonomic levels (for example, in species complexes) in non-model fungal species. To maximize the chance to target genes with informative regions, PHYLORPH displays a graphical evaluation system based on the estimation of nucleotide divergence relative to substitution type. The usefulness of this approach was tested by developing markers in four non-model groups of fungal pathogens. For each pathogen considered, 7 to 40% of the 10–15 best candidate genes proposed by PHYLORPH yielded sequencing success. Levels of polymorphism of these genes were compared with those obtained for some genes traditionally used to build fungal phylogenies (e.g. nuclear rDNA, β-tubulin, γ-actin, Elongation factor EF-1α). These genes were ranked among the best-performing ones and resolved accurately taxa relationships in each of the four non-model groups of fungi considered. We envision that PHYLORPH will constitute a useful tool for obtaining new and accurate phylogenetic markers to resolve relationships between closely related non-model fungal species

    Identification of Hyaloperonospora arabidopsidis Transcript Sequences Expressed during Infection Reveals Isolate-Specific Effectors

    Get PDF
    Biotrophic plant pathogens secrete effector proteins that are important for infection of the host. The aim of this study was to identify effectors of the downy mildew pathogen Hyaloperonospora arabidopsidis (Hpa) that are expressed during infection of its natural host Arabidopsis thaliana. Infection-related transcripts were identified from Expressed Sequence Tags (ESTs) derived from leaves of the susceptible Arabidopsis Ws eds1-1 mutant inoculated with the highly virulent Hpa isolate Waco9. Assembly of 6364 ESTs yielded 3729 unigenes, of which 2164 were Hpa-derived. From the translated Hpa unigenes, 198 predicted secreted proteins were identified. Of these, 75 were found to be Hpa-specific and six isolate Waco9-specific. Among 42 putative effectors identified there were three Elicitin-like proteins, 16 Cysteine-rich proteins and 18 host-translocated RXLR effectors. Sequencing of alleles in different Hpa isolates revealed that five RXLR genes show signatures of diversifying selection. Thus, EST analysis of Hpa-infected Arabidopsis is proving to be a powerful method for identifying pathogen effector candidates expressed during infection. Delivery of the Waco9-specific protein RXLR29 in planta revealed that this effector can suppress PAMP-triggered immunity and enhance disease susceptibility. We propose that differences in host colonization can be conditioned by isolate-specific effectors
    corecore