Search CORE

6 research outputs found

Reversing gene erosion: reconstructing ancestral bacterial genomes from gene-content and gene-order data

Author: Earnest-DeYoung J.
Lerat E.
Moret B. M. E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 12/12/2006
Field of study

Infoscience - École polytechnique fédérale de Lausanne

Approximating the true evolutionary distance between genomes

Author: Earnest-DeYoung J. V.
M Swenson K.
Marron M.
Moret B. M. E.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 12/12/2006
Field of study

Infoscience - École polytechnique fédérale de Lausanne

A framework for orthology assignment from gene rearrangement data

Author: A. Caprara
B. Larget
B.M.E. Moret
C. Thach Nguyen
D. Bryant
D. Sankoff
D. Sankoff
D. Sankoff
D. Sankoff
D.A. Bader
G. Tesler
J. Earnest-DeYoung
J. Tang
J.L. Boore
J.L. Boore
K.M. Swenson
M. Blanchette
M. Marron
M.E. Cosner
N. El-Mabrouk
N. El-Mabrouk
S. Hannenhalli
S.R. Downie
X. Chen
Publication venue: Springer
Publication date: 01/01/2005
Field of study

Abstract. Gene rearrangements have successfully been used in phylogenetic reconstruction and comparative genomics, but usually under the assumption that all genomes have the same gene content and that no gene is duplicated. While these assumptions allow one to work with organellar genomes, they are too restrictive when comparing nuclear genomes. The main challenge is how to deal with gene families, specifically, how to identify orthologs. While searching for orthologies is a common task in computational biology, it is usually done using sequence data. We approach that problem using gene rearrangement data, provide an optimization framework in which to phrase the problem, and present some preliminary theoretical results.

CiteSeerX

Crossref

A Methodological Framework for the Reconstruction of Contiguous Regions of Ancestral Genomes and Its Application to Mammalian Genomes

Author: A Bergeron
A Bergeron
A Bhutkar
A Caprara
A Darling
A Sinha
A Sturtevant
C Kemkemer
Cedric Chauve
D Fulkerson
D Karolchik
D Sankoff
Eric Tannier
F Alizadeh
F Richard
F Swidan
F Yang
G Bourque
G Bourque
G Bourque
G Bourque
G Landau
J Earnest-DeYoung
J Ma
J Ma
J Meidanis
J Tang
J Wienberg
Jens Stoye
K Booth
K Lindblad-Toh
L Froenicke
L Froenicke
M Alekseyev
M Belcaid
M Blanchette
M Dom
M Dom
M Habib
M Hajiaghayi
M Muffato
M Rocchi
M Svartman
M Svartman
MJ Benton
MP Beal
N El-Mabrouk
N Eriksen
N Luc
P Goldberg
P Pevzner
Pavel A. Pevzner
R Karp
R McConnell
S Bérard
S Bérard
S Pasek
T Christof
T Faraut
T Mikkelsen
VL Rascol
W Murphy
W Murphy
Y Nakatani
Y van de Peer
Z Adam
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

The reconstruction of ancestral genome architectures and gene orders from homologies between extant species is a long-standing problem, considered by both cytogeneticists and bioinformaticians. A comparison of the two approaches was recently investigated and discussed in a series of papers, sometimes with diverging points of view regarding the performance of these two approaches. We describe a general methodological framework for reconstructing ancestral genome segments from conserved syntenies in extant genomes. We show that this problem, from a computational point of view, is naturally related to physical mapping of chromosomes and benefits from using combinatorial tools developed in this scope. We develop this framework into a new reconstruction method considering conserved gene clusters with similar gene content, mimicking principles used in most cytogenetic studies, although on a different kind of data. We implement and apply it to datasets of mammalian genomes. We perform intensive theoretical and experimental comparisons with other bioinformatics methods for ancestral genome segments reconstruction. We show that the method that we propose is stable and reliable: it gives convergent results using several kinds of data at different levels of resolution, and all predicted ancestral regions are well supported. The results come eventually very close to cytogenetics studies. It suggests that the comparison of methods for ancestral genome reconstruction should include the algorithmic aspects of the methods as well as the disciplinary differences in data aquisition

Public Library of Science (PLOS)

Crossref

INRIA a CCSD electronic archive server

Directory of Open Access Journals

PubMed Central

Simon Fraser University Institutional Repository

HAL Descartes

Hal-Diderot

Genes Order and Phylogenetic Reconstruction: Application to γ-Proteobacteria

Author: A. Bergeron
D. Sankoff
D. Sankoff
D. Sankoff
E. Belda
E. Lerat
G. Blin
G. Bourque
G. Bourque
J.-F. Lefebvre
J.T. Herbeck
J.V. Earnest-DeYoung
K.M. Swenson
M. Blanchette
S. Bérard
S.F. Altschul
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2005
Field of study

Crossref

Linear programming for phylogenetic reconstruction based on gene rearrangements

Author: A. Caprara
A. Caprara
A.C. Siepel
B.M.E. Moret
B.M.E. Moret
B.M.E. Moret
D. Eppstein
D. Sankoff
D.R. Robinson
G. Bourque
G. Lancia
J. Earnest-DeYoung
J. Tang
J. Tang
J.D. Palmer
K.M. Swenson
L.-S. Wang
L.A. Raubeson
M. Blanchette
M.E. Cosner
N. El-Mabrouk
N. Saitou
S.R. Downie
Publication venue: Springer
Publication date: 01/01/2005
Field of study

Phylogenetic reconstruction from gene rearrangements has attracted increasing attention from biologists and computer scientists over the last few years. Methods used in reconstruction include distance-based methods, parsimony methods using sequence-based encodings, and direct optimization. The latter, pioneered by Sankoff and extended by us with the software suite GRAPPA, is the most accurate approach, but has been limited to small genomes because the running time of its scoring algorithm grows exponentially with the number of genes in the genome. We report here on a new method to compute a tight lower bound on the score of a given tree, using a set of linear constraints generated through selective applications of the triangle inequality. Our method generates an integer linear program with a carefully limited number of constraints, rapidly solves its relaxed version, and uses the result to provide a tight lower bound. Since this bound is very close to the optimal tree score, it can be used directly as a selection criterion, thereby enabling us to bypass entirely the expensive scoring procedure. We have implemented this method within our GRAPPA software and run several series of experiments on both biological and simulated datasets to assess its accuracy. Our results show that using the bound as a selection criterion yields excellent trees, with error rates below 5 % up to very large evolutionary distances, consistently beating the baseline Neighbor-Joining. Our new method enables us to extend the range of applicability of the direct optimization method to chromosomes of size comparable to those of bacteria, as well as to datasets with complex combinations of evolutionary events.

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref