Search CORE

36 research outputs found

Bidirectional Best Hits Miss Many Orthologs in Duplication-Rich Clades such as Plants and Animals.

Author: Dalquen DA
Dessimoz C
Publication venue
Publication date: 01/01/2013
Field of study

Bidirectional best hits (BBH), which entails identifying the pairs of genes in two di.erent genomes that are more similar to each other than either is to any other gene in the other genome, is a simple and widely used method to infer orthology. A recent study has analysed the link between BBH and orthology in bacteria and archaea and concluded that, given the very high consistency in BBH they observed among triplets of neighboring genes, a high proportion of BBH are likely to be bona fide orthologs. However, limited by their analysis setup, the previous study could not easily test the reverse question: which proportion of orthologs are BBH? In this follow-up study, we consider this question in theory and answer it based on conceptual arguments, simulated data, and real biological data from all three domains of life. Our analyses corroborate the findings of the previous study, but also show that because of the high rate of gene duplication in plants and animals, as much as 60% of orthologous relations are missed by the BBH criterion

CiteSeerX

UCL Discovery

PubMed Central

Who Watches the Watchmen? An Appraisal of Benchmarks for Multiple Sequence Alignment

Author: A Löytynoja
A Löytynoja
B Sipos
BG Hall
BG Hall
BP Blackburne
C Chothia
C Dessimoz
C Kemena
C Kemena
C Notredame
CB Do
CL Strope
DA Dalquen
DA Morrison
DH Mathews
ER Mardis
G Blackshields
G Jordan
G Landan
GP Raghava
I Walle Van
J Kim
J Stoye
JD Thompson
JD Thompson
JD Thompson
JD Thompson
JD Thompson
JD Thompson
JH Havgaard
JP Huelsenbeck
K Mizuguchi
LA Stebbings
M Anisimova
M Pop
MR Aniba
P Gardner
RA Cartwright
RB Russell
RC Edgar
RC Edgar
SA Berger
SF Altschul
T Golubchik
T Koestler
T Lassmann
T Lassmann
T Lassmann
W Fletcher
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 09/11/2012
Field of study

Multiple sequence alignment (MSA) is a fundamental and ubiquitous technique in bioinformatics used to infer related residues among biological sequences. Thus alignment accuracy is crucial to a vast range of analyses, often in ways difficult to assess in those analyses. To compare the performance of different aligners and help detect systematic errors in alignments, a number of benchmarking strategies have been pursued. Here we present an overview of the main strategies--based on simulation, consistency, protein structure, and phylogeny--and discuss their different advantages and associated risks. We outline a set of desirable characteristics for effective benchmarking, and evaluate each strategy in light of them. We conclude that there is currently no universally applicable means of benchmarking MSA, and that developers and users of alignment tools should base their choice of benchmark depending on the context of application--with a keen awareness of the assumptions underlying each benchmarking strategy.Comment: Revie

arXiv.org e-Print Archive

Crossref

UCL Discovery

Fast and robust multiple sequence alignment with phylogeny-aware gap placement

Author: A Biegert
A Löytynoja
A Löytynoja
A Löytynoja
A Viterbi
Adam M Szalkowski
AM Altenhoff
AM Szalkowski
B Paten
C Dessimoz
C Grasso
C Lee
D Robinson
DA Dalquen
G Gonnet
GH Gonnet
GH Gonnet
GW Stuart
J Felsenstein
JD Thompson
JD Thompson
JL Thorne
JM Sauder
K Katoh
M Anisimova
M Kimura
O Gascuel
O Gotoh
R Durbin
RC Edgar
S Pascarella
S Whelan
SA Benner
SB Needleman
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The paralog-to-contig assignment problem: high quality gene models from fragmented assemblies

Author: AJ Vilella
BL Harty
C Burge
C Camacho
DA Dalquen
DH Huson
E Birney
EW Sayers
F Cortesi
F Cunningham
F Sievers
G Celniker
G Gremme
G Parra
G Pavesi
Henrike Indrischek
HM Wain
J Eid
JP Silva
K Hatje
K Hatje
K Nowick
L Lovász
M Burset
M Stanke
MA Larkin
MR Brent
ND Setta
Nicolas Wieseke
O Keller
Peter F. Stadler
R Burkard
R Guigó
RM Karp
S Scherer
SF Altschul
SL Renninger
Sonja J. Prohaska
SR Eddy
SR Eddy
TD Wu
The UniProt consortium
V Curwen
V Shepelev
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Recommended from our members

Standardised Benchmarking in the Quest for Orthologs

Author: Altenhoff Adrian M.
Boeckmann Brigitte
Bork Peer
Capella-Gutierrez Salvador
Dalquen Daniel A.
DeLuca Todd
Dessimoz Christophe
Forslund Kristoffer
Gabaldón Toni
Huerta-Cepas Jaime
Juhl Jensen Lars
Lecompte Odile
Lewis Suzanna E.
Linard Benjamin
Martin Maria J.
Muffato Matthieu
Pereira Cécile
Pryszcz Leszek P.
Schreiber Fabian
Sjölander Kimmen
Sonnhammer Erik
Sousa da Silva Alan
Szklarczyk Damian
Thomas Paul D.
Train Clément-Marie
von Mering Christian
Xenarios Ioannis
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/11/2016
Field of study

The identification of evolutionarily related genes across different species—orthologs in particular—forms the backbone of many comparative, evolutionary, and functional genomic analyses. Achieving high accuracy in orthology inference is thus essential. Yet the true evolutionary history of genes, required to ascertain orthology, is generally unknown. Furthermore, orthologs are used for very different applications across different phyla, with different requirements in terms of the precision-recall trade-off. As a result, assessing the performance of orthology inference methods remains difficult for both users and method developers. Here, we present a community effort to establish standards in orthology benchmarking and facilitate orthology benchmarking through an automated web-based service (http://orthology.benchmarkservice.org). Using this new service, we characterise the performance of 15 well-established orthology inference methods and resources on a battery of 20 different benchmarks. Standardised benchmarking provides a way for users to identify the most effective methods for the problem at hand, sets a minimal requirement for new tools and resources, and guides the development of more accurate orthology inference methods

Harvard University - DASH