Search CORE

INRIA a CCSD electronic archive server

Reconstructing the History of Yeast Genomes

Author: A Bhutkar
AU Sinha
B Dutrillaux
B Llorente
C Soighe
C Zheng
D Sankoff
David Sankoff
E Tannier
FS Dietrich
Jianzhi Zhang
JL Gordon
KH Wolfe
KP Byrne
M Kellis
N Martin
P Pevzner
WJ Murphy
Publication venue: Public Library of Science
Publication date: 01/05/2009
Field of study

Directory of Open Access Journals

arXiv.org e-Print Archive

A Unifying Model of Genome Evolution Under Parsimony

Author: A Bergeron
A Caprara
AE Darling
AW Xu
B Paten
B Paten
B Paten
B Raphael
Benedict Paten
C Chauve
D Bienstock
Daniel R Zerbino
David Haussler
E Tannier
G Bourque
Glenn Hickey
I Elias
J Edmonds
J Felsenstein
J Kim
J Ma
L Chindelevitch
LL Wang
M Alekseyev
M Bader
M Blanchette
M Shao
MD Braga
N El-Mabrouk
N El-Mabrouk
O Westesson
P Medvedev
S Hannenhalli
S Yancopoulos
S Yancopoulos
W Day
W Miller
YS Song
Publication venue
Publication date: 12/05/2014
Field of study

We present a data structure called a history graph that offers a practical basis for the analysis of genome evolution. It conceptually simplifies the study of parsimonious evolutionary histories by representing both substitutions and double cut and join (DCJ) rearrangements in the presence of duplications. The problem of constructing parsimonious history graphs thus subsumes related maximum parsimony problems in the fields of phylogenetic reconstruction and genome rearrangement. We show that tractable functions can be used to define upper and lower bounds on the minimum number of substitutions and DCJ rearrangements needed to explain any history graph. These bounds become tight for a special type of unambiguous history graph called an ancestral variation graph (AVG), which constrains in its combinatorial structure the number of operations required. We finally demonstrate that for a given history graph

G

, a finite set of AVGs describe all parsimonious interpretations of

G

, and this set can be explored with a few sampling moves.Comment: 52 pages, 24 figure

eScholarship - University of California

On the PATHGROUPS approach to rapid small phylogeny

Author: A Caprara
AC Siepel
AW Xu
C Zheng
Chunfang Zheng
D Sankoff
D Sankoff
David Sankoff
E Tannier
G Fertin
KP Byrne
N El-Mabrouk
R Warren
S Yancopoulos
SM Hedtke
Z Adam
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

We present a data structure enabling rapid heuristic solution to the ancestral genome reconstruction problem for given phylogenies under genomic rearrangement metrics. The efficiency of the greedy algorithm is due to fast updating of the structure during run time and a simple priority scheme for choosing the next step. Since accuracy deteriorates for sets of highly divergent genomes, we investigate strategies for improving accuracy and expanding the range of data sets where accurate reconstructions can be expected. This includes a more refined priority system, and a two-step look-ahead, as well as iterative local improvements based on a the median version of the problem, incorporating simulated annealing. We apply this to a set of yeast genomes to corroborate a recent gene sequence-based phylogeny

Directory of Open Access Journals

Multichromosomal median and halving problems under different genomic distances

Author: A Bergeron
A Bergeron
A Bergeron
A Caprara
C Zheng
C Zheng
C Zheng
C Zheng
Chunfang Zheng
D Bryant
D Sankoff
David Sankoff
E Ohlebusch
E Tannier
Eric Tannier
G Bourque
G Fertin
G Jean
G Tesler
G Watterson
I Pe'er
J Aury
J Mixtacki
L Lovasz
M Alekseyev
M Bernt
M Ozery-Flato
MR Garey
N El-Mabrouk
P Berman
P Pevzner
R Lenne
R Warren
S Hannenhalli
S Hannenhalli
S Otto
S Yancopoulos
W Xu
X Chen
Y Lin
YC Lin
Z Adam
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Genome median and genome halving are combinatorial optimization problems that aim at reconstructing ancestral genomes as well as the evolutionary events leading from the ancestor to extant species. Exploring complexity issues is a first step towards devising efficient algorithms. The complexity of the median problem for unichromosomal genomes (permutations) has been settled for both the breakpoint distance and the reversal distance. Although the multichromosomal case has often been assumed to be a simple generalization of the unichromosomal case, it is also a relaxation so that complexity in this context does not follow from existing results, and is open for all distances. Results We settle here the complexity of several genome median and halving problems, including a surprising polynomial result for the breakpoint median and guided halving problems in genomes with circular and linear chromosomes, showing that the multichromosomal problem is actually easier than the unichromosomal problem. Still other variants of these problems are NP-complete, including the DCJ double distance problem, previously mentioned as an open question. We list the remaining open problems. Conclusion This theoretical study clears up a wide swathe of the algorithmical study of genome rearrangements with multiple multichromosomal genomes.</p

Directory of Open Access Journals

INRIA a CCSD electronic archive server

Hal-Diderot

Sampling and counting genome rearrangement scenarios

Author: A Bergeron
A Bergeron
A Caprara
A Darling
A Karzanov
A Ouangraoua
A Rajaraman
AC Siepel
B Larget
C Chauve
C Zheng
D Sankoff
DVM Braga
E Tannier
E Tannier
G Brightwell
Heather Smith
I Miklós
I Miklós
I Miklós
I Miklós
I Miklós
I Miklós
I Miklós
István Miklós
JS Liu
KM Swenson
L Lovász
LG Valiant
MA Alekseyev
MA Alekseyev
MR Jerrum
MR Jerrum
N Metropolis
P Feijão
PL Erdős
R Durrett
R Warren
S Geman
S Hannenhalli
W Hastings
WM Fitch
Y Ajana
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Even for moderate size inputs, there are a tremendous number of optimal rearrangement scenarios, regardless what the model is and which specific question is to be answered. Therefore giving one optimal solution might be misleading and cannot be used for statistical inferring. Statistically well funded methods are necessary to sample uniformly from the solution space and then a small number of samples are sufficient for statistical inferring

SZTAKI Publication Repository

Cerebellar ataxia with oculomotor apraxia type 1: clinical and genetic studies

Author: Beis J. M.
Brice A.
Chamayou C.
Demarquay G.
Durr A.
Habert M. O.
Koenig M.
Kuntzer T.
Le Ber I.
Moreira M. C.
Ochsner F.
Rivaud-Pechoux S.
Said G.
Tannier C.
Tardieu M.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/12/2003
Field of study

Ataxia with ocular motor apraxia type 1 (AOA1) is an autosomal recessive cerebellar ataxia (ARCA) associated with oculomotor apraxia, hypoalbuminaemia and hypercholesterolaemia. The gene APTX, which encodes aprataxin, has been identified recently. We studied a large series of 158 families with non-Friedreich progressive ARCA. We identified 14 patients (nine families) with five different missense or truncating mutations in the aprataxin gene (W279X, A198V, D267G, W279R, IVS5+1), four of which were new. We determined the relative frequency of AOA1 which is 5%. Mutation carriers underwent detailed neurological, neuropsychological, electrophysiological, oculographic and biological examinations, as well as brain imaging. The mean age at onset was 6.8 +/- 4.8 years (range 2-18 years). Cerebellar ataxia with cerebellar atrophy on MRI and severe axonal sensorimotor neuropathy were present in all patients. In contrast, oculomotor apraxia (86%), hypoalbuminaemia (83%) and hypercholesterolaemia (75%) were variable. Choreic movements were frequent at onset (79%), but disappeared in the course of the disease in most cases. However, a remarkably severe and persistent choreic phenotype was associated with one of the mutations (A198V). Cognitive impairment was always present. Ocular saccade initiation was normal, but their duration was increased by the succession of multiple hypometric saccades that could clinically be confused with 'slow saccades'. We emphasize the phenotypic variability over the course of the disease. Cerebellar ataxia and/or chorea predominate at onset, but later on they are often partially masked by severe neuropathy, which is the most typical symptom in young adults. The presence of chorea, sensorimotor neuropathy, oculomotor anomalies, biological abnormalities, cerebellar atrophy on MRI and absence of the Babinski sign can help to distinguish AOA1 from Friedreich's ataxia on a clinical basis. The frequency of chorea at onset suggests that this diagnosis should also be considered in children with chorea who do not carry the IT15 mutation responsible for Huntington's disease

Serveur académique lausannois

Sorting by reversals and block-interchanges with various weight assignments

Author: A Bergeron
A Bergeron
C Mira
Chun-Yuan Lin
Chunhung Richard Lin
DA Bader
DA Christie
E Tannier
GH Lin
H Kaplan
I Elias
J Feng
KM Swenson
KM Swenson
M Bader
M Bader
MEMT Walter
N El-Mabrouk
N Eriksen
QP Gu
S Gog
S Hannenhalli
S Yancopoulos
T Hartman
V Bafna
VV Vazirani
Y Han
YC Lin
YC Lin
Ying Chih Lin
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Gene order in rosid phylogeny, inferred from pairwise syntenies among extant genomes

Author: A Muñnoz
A Ouangraoua
AP Chan
B Thomas
B Wang
C Zheng
C Zheng
C Zheng
Chunfang Zheng
D Bertrand
D Sankoff
D Sankoff
D Sankoff
David Sankoff
DE Soltis
E Lyons
E Lyons
E Tannier
F Forest
G Moore
GA Tuskan
H Tang
J Ma
JG Burleigh
JL Gordon
N El-Mabrouk
O Jaillon
R Ming
R Velasco
RJ Langham
S Huang
S Warshall
S Yancopoulos
V Shulaev
WJ Murphy
X Argout
Z Adam
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Evolution through segmental duplications and losses : A Super-Reconciliation approach

Author: A Bergeron
A Deepak
A Tofigh
AA Abbasi
AV Aho
B Moret
B Vernot
C Chauve
C Semple
CM Zmasek
CW Stevens
David Sankoff
DEK Ferrier
E Tannier
G Bourque
G Brightwell
G Fertin
G Pruesse
G Sundstrom
GJ Szöllősi
I Holyer
J Garcia-Fernàndez
J Ma
J Paszek
J Sjöstrand
JD Thompson
JP Doyon
LX Zhang
M Constantinescu
M Goodman
M Hafeez
M Lafond
MP Ng
MS Bansal
MS Bansal
N El-Mabrouk
O Akerborg
R Chaudhary
R Dondi
S Bérard
S Dreborg
S Kumar
TA Larsson
W Ajmal
Y Anselmetti
YC Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/05/2020
Field of study

The classical gene and species tree reconciliation, used to infer the history of gene gain and loss explaining the evolution of gene families, assumes an independent evolution for each family. While this assumption is reasonable for genes that are far apart in the genome, it is not appropriate for genes grouped into syntenic blocks, which are more plausibly the result of a concerted evolution. Here, we introduce the Super-Reconciliation problem which consists in inferring a history of segmental duplication and loss events (involving a set of neighboring genes) leading to a set of present-day syntenies from a single ancestral one. In other words, we extend the traditional Duplication-Loss reconciliation problem of a single gene tree, to a set of trees, accounting for segmental duplications and losses. Existency of a Super-Reconciliation depends on individual gene tree consistency. In addition, ignoring rearrangements implies that existency also depends on gene order consistency. We first show that the problem of reconstructing a most parsimonious Super-Reconciliation, if any, is NP-hard and give an exact exponential-time algorithm to solve it. Alternatively, we show that accounting for rearrangements in the evolutionary model, but still only minimizing segmental duplication and loss events, leads to an exact polynomial-time algorithm. We finally assess time efficiency of the former exponential time algorithm for the Duplication-Loss model on simulated datasets, and give a proof of concept on the opioid receptor genes