Search CORE

3,149 research outputs found

Revealing mammalian evolutionary relationships by comparative analysis of gene clusters

Author: Abi-Rached
Akahoshi
Bailey
Benjamin Dickins
Birney
Cadavid
Cathy Riemer
Chen
Chih-Hao Hsu
Chiu
Colobran
Datta
Degenhardt
Dewey
Dufayard
Edwards
Eric D. Green
Fitch
Fitch
Fitch
Giltae Song
Gish
Gonzalez
Goodstadt
Graef
Guethlein
Guethlein
Han
Hardies
Hardison
Hardison
Hardison
Harris
Hie Lim Kim
Hoffmann
Hou
Hou
Hsu
Hsu
Hu
Huerta-Cepas
Jensen
Johnson
Kim
Kristensen
Lee
Levy
Li
Li
Lopez-Vazquez
Louxin Zhang
Margulies
Martin
Matsuya
Mi
Miyata
Muller
Murphy
NISC Comparative Sequencing Program
Opazo
Opazo
Ostlund
Ouzounis
Parham
Pianezza
Rajalingam
Ross C. Hardison
Sambrook
Shilling
Siepel
Smit
Song
Song
Song
Sonnhammer
Su
Tatusov
The ENCODE Project Consortium
Uchiyama
van der Heijden
Vilella
Wang
Wapinski
Waterhouse
Webb Miller
Wilson
Wilson
Woelk
Yu Zhang
Zhang
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2012
Field of study

Many software tools for comparative analysis of genomic sequence data have been released in recent decades. Despite this, it remains challenging to determine evolutionary relationships in gene clusters due to their complex histories involving duplications, deletions, inversions, and conversions. One concept describing these relationships is orthology. Orthologs derive from a common ancestor by speciation, in contrast to paralogs, which derive from duplication. Discriminating orthologs from paralogs is a necessary step in most multispecies sequence analyses, but doing so accurately is impeded by the occurrence of gene conversion events. We propose a refined method of orthology assignment based on two paradigms for interpreting its definition: by genomic context or by sequence content. X-orthology (based on context) traces orthology resulting from speciation and duplication only, while N-orthology (based on content) includes the influence of conversion events

Crossref

Nottingham Trent Institutional Repository (IRep)

PubMed Central

ScholarBank@NUS

SATCHMO-JS: a webserver for simultaneous protein multiple sequence alignment and phylogenetic tree construction.

Author: Datta Ruchira S
Davidson John R
Hagopian Raffi
Jarvis Glen R
Samad Bushra
Sjölander Kimmen
Publication venue: eScholarship, University of California
Publication date: 29/04/2010
Field of study

We present the jump-start simultaneous alignment and tree construction using hidden Markov models (SATCHMO-JS) web server for simultaneous estimation of protein multiple sequence alignments (MSAs) and phylogenetic trees. The server takes as input a set of sequences in FASTA format, and outputs a phylogenetic tree and MSA; these can be viewed online or downloaded from the website. SATCHMO-JS is an extension of the SATCHMO algorithm, and employs a divide-and-conquer strategy to jump-start SATCHMO at a higher point in the phylogenetic tree, reducing the computational complexity of the progressive all-versus-all HMM-HMM scoring and alignment. Results on a benchmark dataset of 983 structurally aligned pairs from the PREFAB benchmark dataset show that SATCHMO-JS provides a statistically significant improvement in alignment accuracy over MUSCLE, Multiple Alignment using Fast Fourier Transform (MAFFT), ClustalW and the original SATCHMO algorithm. The SATCHMO-JS webserver is available at http://phylogenomics.berkeley.edu/satchmo-js. The datasets used in these experiments are available for download at http://phylogenomics.berkeley.edu/satchmo-js/supplementary/

PubMed Central

eScholarship - University of California

Toward community standards in the quest for orthologs

Author: Dessimoz C.
Gabaldon T.
Herrero J.
Quest for Ortholog Consortium
Roos D.S.
Sonnhammer E.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 10/06/2014
Field of study

CGSpace

Toward community standards in the quest for orthologs

Author: Altenhoff Adrian
Altenhoff Adrian
Apweiler Rolf
Apweiler Rolf
Ashburner Michael
Ashburner Michael
Blake Judith
Blake Judith
Boeckmann Brigitte
Boeckmann Brigitte
Bridge Alan
Bridge Alan
Bruford Elspeth
Bruford Elspeth
Cherry Mike
Cherry Mike
Conte Matthieu
Conte Matthieu
Dannie Durand
Dannie Durand
Datta Ruchira
Datta Ruchira
Dessimoz Christophe
Dessimoz Christophe
Dessimoz Christophe
Dessimoz Christophe
Domelevo Entfellner Jean-Baka
Domelevo Entfellner Jean-Baka
Ebersberger Ingo
Ebersberger Ingo
Gabaldón Toni
Gabaldón Toni
Gabaldón Toni
Gabaldón Toni
Galperin Michael
Galperin Michael
Herrero Javier
Herrero Javier
Herrero Javier
Herrero Javier
Joseph Jacob
Joseph Jacob
Koestler Tina
Koestler Tina
Kriventseva Evgenia
Kriventseva Evgenia
Lecompte Odile
Lecompte Odile
Leunissen Jack
Leunissen Jack
Lewis Suzanna
Lewis Suzanna
Linard Benjamin
Linard Benjamin
Livstone Michael S.
Livstone Michael S.
Lu Hui-Chun
Lu Hui-Chun
Martin Maria
Martin Maria
Mazumder Raja
Mazumder Raja
Messina David
Messina David
Miele Vincent
Miele Vincent
Muffato Matthieu
Muffato Matthieu
Perrière Guy
Perrière Guy
Punta Marco
Punta Marco
Roos David
Roos David
Roos David S.
Roos David S.
Rouard Mathieu
Rouard Mathieu
Schmitt Thomas
Schmitt Thomas
Schreiber Fabian
Schreiber Fabian
Silva Alan
Silva Alan
Sjölander Kimmen
Sjölander Kimmen
Sonnhammer Erik
Sonnhammer Erik
Sonnhammer Erik L. L.
Sonnhammer Erik L. L.
Stanley Eleanor
Stanley Eleanor
Szklarczyk Radek
Szklarczyk Radek
Thomas Paul
Thomas Paul
Uchiyama Ikuo
Uchiyama Ikuo
Van Bel Michiel
Van Bel Michiel
Vandepoele Klaas
Vandepoele Klaas
Vilella Albert J.
Vilella Albert J.
Yates Andrew
Yates Andrew
Zdobnov Evgeny
Zdobnov Evgeny
Škunca Nives
Škunca Nives
Publication venue
Publication date: 02/08/2017
Field of study

The identification of orthologs—genes pairs descended from a common ancestor through speciation, rather than duplication—has emerged as an essential component of many bioinformatics applications, ranging from the annotation of new genomes to experimental target prioritization. Yet, the development and application of orthology inference methods is hampered by the lack of consensus on source proteomes, file formats and benchmarks. The second ‘Quest for Orthologs' meeting brought together stakeholders from various communities to address these challenges. We report on achievements and outcomes of this meeting, focusing on topics of particular relevance to the research community at large. The Quest for Orthologs consortium is an open community that welcomes contributions from all researchers interested in orthology research and applications. Contact: [email protected]

RERO DOC Digital Library

A MOSAIC of methods: Improving ortholog detection through integration of algorithmic diversity

Author: Hernandez Ryan D.
Maher M. Cyrus
Publication venue
Publication date: 18/04/2014
Field of study

Ortholog detection (OD) is a critical step for comparative genomic analysis of protein-coding sequences. In this paper, we begin with a comprehensive comparison of four popular, methodologically diverse OD methods: MultiParanoid, Blat, Multiz, and OMA. In head-to-head comparisons, these methods are shown to significantly outperform one another 12-30% of the time. This high complementarity motivates the presentation of the first tool for integrating methodologically diverse OD methods. We term this program MOSAIC, or Multiple Orthologous Sequence Analysis and Integration by Cluster optimization. Relative to component and competing methods, we demonstrate that MOSAIC more than quintuples the number of alignments for which all species are present, while simultaneously maintaining or improving functional-, phylogenetic-, and sequence identity-based measures of ortholog quality. Further, we demonstrate that this improvement in alignment quality yields 40-280% more confidently aligned sites. Combined, these factors translate to higher estimated levels of overall conservation, while at the same time allowing for the detection of up to 180% more positively selected sites. MOSAIC is available as python package. MOSAIC alignments, source code, and full documentation are available at http://pythonhosted.org/bio-MOSAIC

arXiv.org e-Print Archive

FigShare

SICLE: A high-throughput tool for extracting evolutionary relationships from phylogenetic trees

Author: DeBlasio Dan
Wiscaver Jennifer
Publication venue: 'PeerJ'
Publication date: 16/06/2016
Field of study

We present the phylogeny analysis software SICLE (Sister Clade Extractor), an easy-to-use, high- throughput tool to describe the nearest neighbors to a node of interest in a phylogenetic tree as well as the support value for the relationship. The application is a command line utility that can be embedded into a phylogenetic analysis pipeline or can be used as a subroutine within another C++ program. As a test case, we applied this new tool to the published phylome of Salinibacter ruber, a species of halophilic Bacteriodetes, identifying 13 unique sister relationships to S. ruber across the 4589 gene phylogenies. S. ruber grouped with bacteria, most often other Bacteriodetes, in the majority of phylogenies, but 91 phylogenies showed a branch-supported sister association between S. ruber and Archaea, an evolutionarily intriguing relationship indicative of horizontal gene transfer. This test case demonstrates how SICLE makes it possible to summarize the phylogenetic information produced by automated phylogenetic pipelines to rapidly identify and quantify the possible evolutionary relationships that merit further investigation. SICLE is available for free for noncommercial use at http://eebweb.arizona.edu/sicle/.Comment: 8 pages, 4 figures in journal submission forma

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

Big data and other challenges in the quest for orthologs

Author: Boeckmann Brigitte
Dessimoz Christophe
Gabaldón Toni
Martin Maria
Robinson-Rechavi Marc
Sonnhammer Erik L.L.
Sousa da Silva Alan W.
Thomas Paul D.
Publication venue
Publication date: 02/08/2017
Field of study

Given the rapid increase of species with a sequenced genome, the need to identify orthologous genes between them has emerged as a central bioinformatics task. Many different methods exist for orthology detection, which makes it difficult to decide which one to choose for a particular application. Here, we review the latest developments and issues in the orthology field, and summarize the most recent results reported at the third ‘Quest for Orthologs' meeting. We focus on community efforts such as the adoption of reference proteomes, standard file formats and benchmarking. Progress in these areas is good, and they are already beneficial to both orthology consumers and providers. However, a major current issue is that the massive increase in complete proteomes poses computational challenges to many of the ortholog database providers, as most orthology inference algorithms scale at least quadratically with the number of proteomes. The Quest for Orthologs consortium is an open community with a number of working groups that join efforts to enhance various aspects of orthology analysis, such as defining standard formats and datasets, documenting community resources and benchmarking. Availability and implementation: All such materials are available at http://questfororthologs.org. Contact: [email protected] or [email protected]

RERO DOC Digital Library