Search CORE

16 research outputs found

Widespread eukaryotic sequences, highly similar to bacterial DNA polymerase I, looking for functions

Author: Sonnhammer Erik L.L.
Wootton John C.
Publication venue: Published by Elsevier Ltd.
Publication date: 01/08/1997
Field of study

Advantages of combined transmembrane topology and signal peptide prediction—the Phobius web server

Author: Krogh Anders
Käll Lukas
Sonnhammer Erik L.L.
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

When using conventional transmembrane topology and signal peptide predictors, such as TMHMM and SignalP, there is a substantial overlap between these two types of predictions. Applying these methods to five complete proteomes, we found that 30–65% of all predicted signal peptides and 25–35% of all predicted transmembrane topologies overlap. This impairs predictions of 5–10% of the proteome, hence this is an important issue in protein annotation

Crossref

PubMed Central

Copenhagen University Research Information System

Big data and other challenges in the quest for orthologs

Author: Boeckmann Brigitte
Dessimoz Christophe
Gabaldón Toni
Martin Maria
Robinson-Rechavi Marc
Sonnhammer Erik L.L.
Sousa da Silva Alan W.
Thomas Paul D.
Publication venue
Publication date: 02/08/2017
Field of study

Given the rapid increase of species with a sequenced genome, the need to identify orthologous genes between them has emerged as a central bioinformatics task. Many different methods exist for orthology detection, which makes it difficult to decide which one to choose for a particular application. Here, we review the latest developments and issues in the orthology field, and summarize the most recent results reported at the third ‘Quest for Orthologs' meeting. We focus on community efforts such as the adoption of reference proteomes, standard file formats and benchmarking. Progress in these areas is good, and they are already beneficial to both orthology consumers and providers. However, a major current issue is that the massive increase in complete proteomes poses computational challenges to many of the ortholog database providers, as most orthology inference algorithms scale at least quadratically with the number of proteomes. The Quest for Orthologs consortium is an open community with a number of working groups that join efforts to enhance various aspects of orthology analysis, such as defining standard formats and datasets, documenting community resources and benchmarking. Availability and implementation: All such materials are available at http://questfororthologs.org. Contact: [email protected] or [email protected]

RERO DOC Digital Library

Global networks of functional coupling in eukaryotes from comprehensive data integration

Author: Alexeyenko Andrey
Sonnhammer Erik L.L.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date
Field of study

No single experimental method can discover all connections in the interactome. A computational approach can help by integrating data from multiple, often unrelated, proteomics and genomics pipelines. Reconstructing global networks of functional coupling (FC) faces the challenges of scale and heterogeneity—how to efficiently integrate huge amounts of diverse data from multiple organisms, yet ensuring high accuracy. We developed FunCoup, an optimized Bayesian framework, to resolve these issues. Because interactomes comprise functional coupling of many types, FunCoup annotates network edges with confidence scores in support of different kinds of interactions: physical interaction, protein complex member, metabolic, or signaling link. This capability boosted overall accuracy. On the whole, the constructed framework was comprehensively tested to optimize the overall confidence and ensure seamless, automated incorporation of new data sets of heterogeneous types. Using over 50 data sets in seven organisms and extensively transferring information between orthologs, FunCoup predicted global networks in eight eukaryotes. For the Ciona intestinalis network, only orthologous information was used, and it recovered a significant number of experimental facts. FunCoup predictions were validated on independent cancer mutation data. We show how FunCoup can be used for discovering candidate members of the Parkinson and Alzheimer pathways. Cross-species pathway conservation analysis provided further support to these observations

Crossref

PubMed Central

Genomic Gene Clustering Analysis of Pathways in Eukaryotes

Author: Lee Jennifer M.
Sonnhammer Erik L.L.
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 01/05/2003
Field of study

Genomic clustering of genes in a pathway is commonly found in prokaryotes due to transcriptional operons, but these are not present in most eukaryotes. Yet, there might be clustering to a lesser extent of pathway members in eukaryotic genomes, that assist coregulation of a set of functionally cooperating genes. We analyzed five sequenced eukaryotic genomes for clustering of genes assigned to the same pathway in the KEGG database. Between 98% and 30% of the analyzed pathways in a genome were found to exhibit significantly higher clustering levels than expected by chance. In descending order by the level of clustering, the genomes studied were Saccharomyces cerevisiae, Homo sapiens, Caenorhabditis elegans, Arabidopsis thaliana, and Drosophila melanogaster. Surprisingly, there is not much agreement between genomes in terms of which pathways are most clustered. Only seven of 69 pathways found in all species were significantly clustered in all five of them. This species-specific pattern of pathway clustering may reflect adaptations or evolutionary events unique to a particular lineage. We note that although operons are common in C. elegans, only 58% of the pathways showed significant clustering, which is less than in human. Virtually all pathways in S. cerevisiae showed significant clustering

Crossref

PubMed Central