Search CORE

10 research outputs found

On the PATHGROUPS approach to rapid small phylogeny

Author: A Caprara
AC Siepel
AW Xu
C Zheng
Chunfang Zheng
D Sankoff
D Sankoff
David Sankoff
E Tannier
G Fertin
KP Byrne
N El-Mabrouk
R Warren
S Yancopoulos
SM Hedtke
Z Adam
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

We present a data structure enabling rapid heuristic solution to the ancestral genome reconstruction problem for given phylogenies under genomic rearrangement metrics. The efficiency of the greedy algorithm is due to fast updating of the structure during run time and a simple priority scheme for choosing the next step. Since accuracy deteriorates for sets of highly divergent genomes, we investigate strategies for improving accuracy and expanding the range of data sets where accurate reconstructions can be expected. This includes a more refined priority system, and a two-step look-ahead, as well as iterative local improvements based on a the median version of the problem, incorporating simulated annealing. We apply this to a set of yeast genomes to corroborate a recent gene sequence-based phylogeny

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

Author: Chunfang Zheng
David Sankoff
Publication venue: Springer Nature
Publication date: 25/06/2012
Field of study

BACKGROUND: Ancestral gene order reconstruction for flowering plants has lagged behind developments in yeasts, insects and higher animals, because of the recency of widespread plant genome sequencing, sequencers' embargoes on public data use, paralogies due to whole genome duplication (WGD) and fractionation of undeleted duplicates, extensive paralogy from other sources, and the computational cost of existing methods. RESULTS: We address these problems, using the gene order of four core eudicot genomes (cacao, castor bean, papaya and grapevine) that have escaped any recent WGD events, and two others (poplar and cucumber) that descend from independent WGDs, in inferring the ancestral gene order of the rosid clade and those of its main subgroups, the fabids and malvids. We improve and adapt techniques including the OMG method for extracting large, paralogy-free, multiple orthologies from conflated pairwise synteny data among the six genomes and the PATHGROUPS approach for ancestral gene order reconstruction in a given phylogeny, where some genomes may be descendants of WGD events. We use the gene order evidence to evaluate the hypothesis that the order Malpighiales belongs to the malvids rather than as traditionally assigned to the fabids. CONCLUSIONS: Gene orders of ancestral eudicot species, involving 10,000 or more genes can be reconstructed in an efficient, parsimonious and consistent way, despite paralogies due to WGD and other processes. Pairwise genomic syntenies provide appropriate input to a parameter-free procedure of multiple ortholog identification followed by gene-order reconstruction in solving instances of the "small phylogeny" problem

Springer - Publisher Connector

PubMed Central

Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

Author: A Muñnoz
A Ouangraoua
AP Chan
B Thomas
B Wang
C Zheng
C Zheng
C Zheng
Chunfang Zheng
D Bertrand
D Sankoff
D Sankoff
D Sankoff
David Sankoff
DE Soltis
E Lyons
E Lyons
E Tannier
F Forest
G Moore
GA Tuskan
H Tang
J Ma
JG Burleigh
JL Gordon
N El-Mabrouk
O Jaillon
R Ming
R Velasco
RJ Langham
S Huang
S Warshall
S Yancopoulos
V Shulaev
WJ Murphy
X Argout
Z Adam
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Analysis of Gene Order Evolution Beyond Single-Copy Genes

Author: A Bergeron
A Bergeron
A Siepel
A Xu
B Arden
B Ma
B Moret
B Vernot
C Chauve
C Zheng
C Zheng
C Zheng
C Zheng
C. Chauve
CM Zmasek
D Bader
D Bertrand
D Bertrand
D Durand
D Durand
D Fulkerson
D Sankoff
D Sankoff
D Sankoff
D Sankoff
D Sankoff
D Soltis
E Eichler
E Lyons
F Murat
F. Murat
G Blanc
G Blin
G Bourque
G Fertin
G Glusman
G Landau
G Shi
G Tesler
G Watterson
H Gavranovic
H Gavranović
I Wapinski
J Bowers
J Cotton
J Demuth
J Gordon
J Mixtacki
J Nadeau
J Salse
J-P Doyon
K Chen
K O’Brien
K Wolfe
L Zhang
L Zhang
M Alekseyev
M Goodman
M Hahn
M Lajoie
M Lajoie
M Lynch
M Muffato
M Sanderson
M Shannon
N El-Mabrouk
O Elemento
O Eulenstein
O Tremblay-Savard
P Bonizzoni
P Gorecki
P Pevzner
Q Zhu
R Guigó
R Hoberman
R LaRue
R Page
R Page
R Page
R Tatusov
R Warren
S Angibaud
S Hannenhalli
S Pham
S Schwartz
S Yancopoulos
S Yancopoulos
T Blomme
T Uno
T Vinař
V Bafna
V Shoja
W Fitch
W Li
WJ Kent
Z Adam
Z Fu
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Ancestral Gene Synteny Reconstruction Improves Extant Species Scaffolding

Author: Anselmetti Yoann
Berry Vincent
Bérard Sèverine
Chateau Annie
Chauve Cedric
Tannier Eric
Publication venue
Publication date: 01/01/2015
Field of study

We exploit the methodological similarity between ancestral genome reconstruction and extant genome scaffolding. We present a method, called ARt-DeCo that constructs neighborhood relationships between genes or contigs, in both ancestral and extant genomes, in a phylogenetic context. It is able to handle dozens of complete genomes, including genes with complex histories, by using gene phylogenies reconciled with a species tree, that is, annotated with speciation, duplication and loss events. Reconstructed ancestral or extant synteny comes with a support computed from an exhaustive exploration of the solution space. We compare our method with a previously published one that follows the same goal on a small number of genomes with universal unicopy genes. Then we test it on the whole Ensembl database, by proposing partial ancestral genome structures, as well as a more complete scaffolding for many partially assembled genomes on 69 eukaryote species. We carefully analyze a couple of extant adjacencies proposed by our method, and show that they are indeed real links in the extant genomes, that were missing in the current assembly. On a reduced data set of 39 eutherian mammals, we estimate the precision and sensitivity of ARt-DeCo by simulating a fragmentation in some well assembled genomes, and measure how many adjacencies are recovered. We find a very high precision, while the sensitivity depends on the quality of the data and on the proximity of closely related genomes

Crossref

Springer - Publisher Connector

INRIA a CCSD electronic archive server

Simon Fraser University Institutional Repository

Ancestral gene synteny reconstruction improves extant species scaffolding

Author
Publication venue: BioMed Central
Publication date
Field of study

Springer - Publisher Connector

Duplication, Rearrangement and Reconciliation: A Follow-Up 13 Years Later

Author: A. Ouangraoua
A. Sturtevant
A. Sturtevant
A. Vilella
B. Boussau
B. Boussau
B. Cai
B. Zhu
B.R. Jones
C. Chauve
C. Chauve
C. Zheng
C.L. Kahn
C.N. Dewey
D. Bertrand
D. Bertrand
D. Bertrand
D. Doerr
D. Durand
D. Graur
D. Sankoff
D. Sankoff
D. Sankoff
D. Sankoff
E. Zuckerkandl
F. Hu
G. Fertin
G.A. Watterson
G.J. Szöllosi
G.J. Szöllosi
H. Shimodaira
I. Miklos
I. Wapinski
I. Wapinski
J. Felsenstein
J. Jun
J. Ma
J. Ma
J. Ma
J. Maňuch
J. Tang
J.P. Doyon
L. Bulteau
L. Goodstadt
L. Zhang
M. Csurös
M. Goodman
M. Lajoie
M. Lajoie
M. Muffato
M. Stolzer
M. Tang
M.D. Rasmussen
N. El-Mabrouk
O. Akerborg
O. Elemento
O. Gascuel
O. Tremblay-Savard
P. Feijão
P.A. Pevzner
R. Durbin
R. Page
R. Page
S. Angibaud
S. Angibaud
S. Bérard
S. Guindon
S. Hannenhalli
S. Hannenhalli
T. Makino
W. Fitch
Y. Gagnon
Y. Lin
Y. Zhang
Y.C. Wu
Z. Fu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

Phylogenetic assembly of paleogenomes integrating ancient DNA data

Author: Luhmann Nina
Publication venue: Universität Bielefeld
Publication date: 01/01/2017
Field of study

Luhmann N. Phylogenetic assembly of paleogenomes integrating ancient DNA data. Bielefeld: Universität Bielefeld; 2017.In comparative genomics, reconstructing the genomes of ancestral species in a given phylogeny is an important problem in order to analyze genome evolution over time. The diversity of present-day genomes in terms of local mutations and genome rearrangements allows to shed light on the dynamics of evolutionary processes that led from a common ancestor to a set of extant genomes. This speciation history is depicted in a phylogenetic tree. Comparative genome reconstruction methods aim to infer genomic features such as an order of markers (e.g. genes) for extinct species at internal nodes of the tree by applying different evolutionary models, relying only on the information available for the extant genomes at the leaves of the phylogenetic tree. Recently, the steady progress in sequencing technologies led to the emergence of the field of paleogenomics, where the study of ancient DNA (aDNA) found in conserved organic material is moving rapidly towards the sequencing and analysis of complete paleogenomes. Such ''genetic time travel'' allows direct insight into specific phases of the evolution of specific genomes that are not only implicitly inferred from extant DNA sequences. However, as DNA is naturally degraded over time after the death of an organism and environmental conditions interfere with the conservation of DNA material, an assembly of these paleogenomes is usually fragmented, preventing a detailed analysis of genome rearrangements along the branches of the phylogenetic tree. In this thesis, we aim to combine the study of aDNA and comparative ancestral reconstruction in a phylogenetic framework. The comparison with extant related genomes can naturally assist in scaffolding a fragmented aDNA assembly, while the aDNA sequencing data can be included as an additional source of information for comparative reconstruction methods to improve the reconstructions of all related genomes in the phylogenetic tree. Our first focus is on integrative methods to reconstruct marker orders globally in a phylogeny under the assumption of parsimony. An underlying rearrangement model can describe the evolutionary operations that occurred along the edges of the tree. However, as much as complex rearrangement scenarios can give insights into underlying biological mechanisms during evolution, from an computational point of view the ancestral reconstruction problem under rearrangement distances is an NP-hard problem. One exception is the Single-Cut-or-Join (SCJ) distance, that uses a marker order-based representation of the involved genomes to model the cut and join of marker adjacencies as evolutionary operations. We build upon this rearrangement model and describe parsimony-based reconstruction methods aiming to minimize the SCJ distance in the tree. In addition, we require the reconstructed solutions to be consistent, such that they represent linear or circular regions of the ancestral genome. Our first polynomial-time method is based on the Sankoff-Rousseau algorithm and directly includes an aDNA assembly graph at one internal node of the tree. We show that including branch lengths in the underlying tree can avoid ambiguity in practice. Our second approach follows a more general strategy and includes the aDNA sequencing data as local weights for adjacencies next to the SCJ distance in the objective. We describe a fixed-parameter-tractable algorithm that also allows to sample co-optimal solutions. Finally, we describe an approach to fill gaps between potentially adjacent markers by aDNA data to reconstruct the complete genome sequence of a paleogenome guided by the related extant genome sequences. In addition, this approach enables us to select the adjacencies that are supported by the sequencing information from sets of conflicting adjacencies. We evaluate our proposed models and algorithms on simulated and biological data. In particular, we integrate two aDNA sequencing data sets for ancient strains of the pathogen Yersinia pestis, that is understood to be the cause of several pandemics in medieval times. We show that the combination of aDNA sequencing reads and a parsimonious reconstruction in the phylogenetic tree reduces the fragmentation of an initial aDNA assembly substantially and explore alternative reconstructions to emphasize reliably reconstructed regions of the ancient genomes

Publications at Bielefeld University

Algorithmes pour la reconstruction de génomes ancestraux

Author: Gagnon Yves
Publication venue
Publication date: 01/05/2012
Field of study

L’inférence de génomes ancestraux est une étape essentielle pour l’étude de l’évolution des génomes. Connaissant les génomes d’espèces éteintes, on peut proposer des mécanismes biologiques expliquant les divergences entre les génomes des espèces modernes. Diverses méthodes visant à résoudre ce problème existent, se classant parmis deux grandes catégories : les méthodes de distance et les méthodes de synténie. L’état de l’art des distances génomiques ne permettant qu’un certain répertoire de réarrangements pour le moment, les méthodes de synténie sont donc plus appropriées en pratique. Nous proposons une méthode de synténie pour la reconstruction de génomes ancestraux basée sur une définition relaxée d’adjacences de gènes, permettant un contenu en gène inégal dans les génomes modernes causé par des pertes de gènes de même que des duplications de génomes entiers (DGE). Des simulations sont effectuées, démontrant une capacité de former une solution assemblée en un nombre réduit de régions ancestrales contigües par rapport à d’autres méthodes tout en gardant une bonne fiabilité. Des applications sur des données de levures et de plantes céréalières montrent des résultats en accord avec d’autres publications, notamment la présence de fusion imbriquée de chromosomes pendant l’évolution des céréales.Ancestral genome inference is a decisive step for studying genome evolution. Knowing genomes from extinct species, one can propose biological mecanisms explaining divergences between extant species genomes. Various methods classified in two categories have been developped : distance based methods and synteny based methods. The state of the art of distance based methods only permit a certain repertoire of genomic rearrangements, thus synteny based methods are more appropriate in practice for the time being. We propose a synteny method for ancestral genome reconstruction based on a relaxed defenition of gene adjacencies, permitting unequal gene content in extant genomes caused by gene losses and whole genome duplications (WGD). Simulations results demonstrate our method’s ability to form a more assembled solution rather than a collection of contiguous ancestral regions (CAR) with respect to other methods, while maintaining a good reliability. Applications on data sets from yeasts and cereal species show results agreeing with other publications, notably the existence of nested chromosome fusion during the evolution of cereals

Dépôt Institutionnel Numérique