75 research outputs found
Cassis: detection of genomic rearrangement breakpoints
Summary: Genomes undergo large structural changes that alter their organization. The chromosomal regions affected by these rearrangements are called breakpoints, while those which have not been rearranged are called synteny blocks. Lemaitre et al. presented a new method to precisely delimit rearrangement breakpoints in a genome by comparison with the genome of a related species. Receiving as input a list of one2one orthologous genes found in the genomes of two species, the method builds a set of reliable and non-overlapping synteny blocks and refines the regions that are not contained into them. Through the alignment of each breakpoint sequence against its specific orthologous sequences in the other species, we can look for weak similarities inside the breakpoint, thus extending the synteny blocks and narrowing the breakpoints. The identification of the narrowed breakpoints relies on a segmentation algorithm and is statistically assessed. Here, we present the package Cassis that implements this method of precise detection of genomic rearrangement breakpoints
A Unifying Model of Genome Evolution Under Parsimony
We present a data structure called a history graph that offers a practical
basis for the analysis of genome evolution. It conceptually simplifies the
study of parsimonious evolutionary histories by representing both substitutions
and double cut and join (DCJ) rearrangements in the presence of duplications.
The problem of constructing parsimonious history graphs thus subsumes related
maximum parsimony problems in the fields of phylogenetic reconstruction and
genome rearrangement. We show that tractable functions can be used to define
upper and lower bounds on the minimum number of substitutions and DCJ
rearrangements needed to explain any history graph. These bounds become tight
for a special type of unambiguous history graph called an ancestral variation
graph (AVG), which constrains in its combinatorial structure the number of
operations required. We finally demonstrate that for a given history graph ,
a finite set of AVGs describe all parsimonious interpretations of , and this
set can be explored with a few sampling moves.Comment: 52 pages, 24 figure
On the PATHGROUPS approach to rapid small phylogeny
We present a data structure enabling rapid heuristic solution to the ancestral genome reconstruction problem for given phylogenies under genomic rearrangement metrics. The efficiency of the greedy algorithm is due to fast updating of the structure during run time and a simple priority scheme for choosing the next step. Since accuracy deteriorates for sets of highly divergent genomes, we investigate strategies for improving accuracy and expanding the range of data sets where accurate reconstructions can be expected. This includes a more refined priority system, and a two-step look-ahead, as well as iterative local improvements based on a the median version of the problem, incorporating simulated annealing. We apply this to a set of yeast genomes to corroborate a recent gene sequence-based phylogeny
Multichromosomal median and halving problems under different genomic distances
<p>Abstract</p> <p>Background</p> <p>Genome median and genome halving are combinatorial optimization problems that aim at reconstructing ancestral genomes as well as the evolutionary events leading from the ancestor to extant species. Exploring complexity issues is a first step towards devising efficient algorithms. The complexity of the median problem for unichromosomal genomes (permutations) has been settled for both the breakpoint distance and the reversal distance. Although the multichromosomal case has often been assumed to be a simple generalization of the unichromosomal case, it is also a relaxation so that complexity in this context does not follow from existing results, and is open for all distances.</p> <p>Results</p> <p>We settle here the complexity of several genome median and halving problems, including a surprising polynomial result for the breakpoint median and guided halving problems in genomes with circular and linear chromosomes, showing that the multichromosomal problem is actually easier than the unichromosomal problem. Still other variants of these problems are NP-complete, including the DCJ double distance problem, previously mentioned as an open question. We list the remaining open problems.</p> <p>Conclusion</p> <p>This theoretical study clears up a wide swathe of the algorithmical study of genome rearrangements with multiple multichromosomal genomes.</p
Sampling and counting genome rearrangement scenarios
Even for moderate size inputs, there are a tremendous number of optimal rearrangement scenarios, regardless what the model is and which specific question is to be answered. Therefore giving one optimal solution might be misleading and cannot be used for statistical inferring. Statistically well funded methods are necessary to sample uniformly from the solution space and then a small number of samples are sufficient for statistical inferring
Cerebellar ataxia with oculomotor apraxia type 1: clinical and genetic studies
Ataxia with ocular motor apraxia type 1 (AOA1) is an autosomal recessive cerebellar ataxia (ARCA) associated with oculomotor apraxia, hypoalbuminaemia and hypercholesterolaemia. The gene APTX, which encodes aprataxin, has been identified recently. We studied a large series of 158 families with non-Friedreich progressive ARCA. We identified 14 patients (nine families) with five different missense or truncating mutations in the aprataxin gene (W279X, A198V, D267G, W279R, IVS5+1), four of which were new. We determined the relative frequency of AOA1 which is 5%. Mutation carriers underwent detailed neurological, neuropsychological, electrophysiological, oculographic and biological examinations, as well as brain imaging. The mean age at onset was 6.8 +/- 4.8 years (range 2-18 years). Cerebellar ataxia with cerebellar atrophy on MRI and severe axonal sensorimotor neuropathy were present in all patients. In contrast, oculomotor apraxia (86%), hypoalbuminaemia (83%) and hypercholesterolaemia (75%) were variable. Choreic movements were frequent at onset (79%), but disappeared in the course of the disease in most cases. However, a remarkably severe and persistent choreic phenotype was associated with one of the mutations (A198V). Cognitive impairment was always present. Ocular saccade initiation was normal, but their duration was increased by the succession of multiple hypometric saccades that could clinically be confused with 'slow saccades'. We emphasize the phenotypic variability over the course of the disease. Cerebellar ataxia and/or chorea predominate at onset, but later on they are often partially masked by severe neuropathy, which is the most typical symptom in young adults. The presence of chorea, sensorimotor neuropathy, oculomotor anomalies, biological abnormalities, cerebellar atrophy on MRI and absence of the Babinski sign can help to distinguish AOA1 from Friedreich's ataxia on a clinical basis. The frequency of chorea at onset suggests that this diagnosis should also be considered in children with chorea who do not carry the IT15 mutation responsible for Huntington's disease
Evolution through segmental duplications and losses : A Super-Reconciliation approach
The classical gene and species tree reconciliation, used to infer the history of gene gain and loss explaining the evolution of gene families, assumes an independent evolution for each family. While this assumption is reasonable for genes that are far apart in the genome, it is not appropriate for genes grouped into syntenic blocks, which are more plausibly the result of a concerted evolution. Here, we introduce the Super-Reconciliation problem which consists in inferring a history of segmental duplication and loss events (involving a set of neighboring genes) leading to a set of present-day syntenies from a single ancestral one. In other words, we extend the traditional Duplication-Loss reconciliation problem of a single gene tree, to a set of trees, accounting for segmental duplications and losses. Existency of a Super-Reconciliation depends on individual gene tree consistency. In addition, ignoring rearrangements implies that existency also depends on gene order consistency. We first show that the problem of reconstructing a most parsimonious Super-Reconciliation, if any, is NP-hard and give an exact exponential-time algorithm to solve it. Alternatively, we show that accounting for rearrangements in the evolutionary model, but still only minimizing segmental duplication and loss events, leads to an exact polynomial-time algorithm. We finally assess time efficiency of the former exponential time algorithm for the Duplication-Loss model on simulated datasets, and give a proof of concept on the opioid receptor genes
- âŠ