Search CORE

5 research outputs found

Efficient Sampling of Parsimonious Inversion Histories with Application to Genome Rearrangement in Yersinia

Author: Aaron E. Darling
Achaz
Ajana
Alekseyev
Altekar
Bader
Bader
Bergeron
Braga
Chain
Darling
Darling
Deng
Durrett
Geyer
Hannenhalli
Hastings
István Miklós
Kaplan
Larget
Larget
Liu
Metropolis
Miklós
Miklós
Miklós
Mélykúti
Perna
Sankoff
Siepel
Siepel
Sinclair
Song
Swenson
Tannier
von Neumann
York
York
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

Inversions are among the most common mutations acting on the order and orientation of genes in a genome, and polynomial-time algorithms exist to obtain a minimal length series of inversions that transform one genome arrangement to another. However, the minimum length series of inversions (the optimal sorting path) is often not unique as many such optimal sorting paths exist. If we assume that all optimal sorting paths are equally likely, then statistical inference on genome arrangement history must account for all such sorting paths and not just a single estimate. No deterministic polynomial algorithm is known to count the number of optimal sorting paths nor sample from the uniform distribution of optimal sorting paths

Crossref

SZTAKI Publication Repository

OPUS - University of Technology Sydney

PubMed Central

A heuristic approach to determine an appropriate number of topics in topic modeling

Author
Publication venue: BioMed Central
Publication date: 01/12/2015
Field of study

Springer - Publisher Connector

Moments Of Genome Evolution By Double Cut-and-join

Author: Biller
Eric
Laurent
Priscila
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/06/2016
Field of study

Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)We study statistical estimators of the number of genomic events separating two genomes under a Double Cut-and Join (DCJ) rearrangement model, by a method of moment estimation. We first propose an exact, closed, analytically invertible formula for the expected number of breakpoints after a given number of DCJs. This improves over the heuristic, recursive and computationally slower previously proposed one. Then we explore the analogies of genome evolution by DCJ with evolution of binary sequences under substitutions, permutations under transpositions, and random graphs. Each of these are presented in the literature with intuitive justifications, and are used to import results from better known fields. We formalize the relations by proving a correspondence between moments in sequence and genome evolution, provided substitutions appear four by four in the corresponding model. Eventually we prove a bounded error on two estimators of the number of cycles in the breakpoint graph after a given number of rearrangements, by an analogy with cycles in permutations and components in random graphs.1614Agence Nationale pour la Recherche, Ancestrome project [ANR-10-BINF-01-01]Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)FAPESP [2013/25084-2

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio da Producao Cientifica e Intelectual da Unicamp

Evolution of whole genomes through inversions:models and algorithms for duplicates, ancestors, and edit scenarios

Author: Swenson Krister
Publication venue: Lausanne, EPFL
Publication date: 15/10/2009
Field of study

Advances in sequencing technology are yielding DNA sequence data at an alarming rate – a rate reminiscent of Moore's law. Biologists' abilities to analyze this data, however, have not kept pace. On the other hand, the discrete and mechanical nature of the cell life-cycle has been tantalizing to computer scientists. Thus in the 1980s, pioneers of the field now called Computational Biology began to uncover a wealth of computer science problems, some confronting modern Biologists and some hidden in the annals of the biological literature. In particular, many interesting twists were introduced to classical string matching, sorting, and graph problems. One such problem, first posed in 1941 but rediscovered in the early 1980s, is that of sorting by inversions (also called reversals): given two permutations, find the minimum number of inversions required to transform one into the other, where an inversion inverts the order of a subpermutation. Indeed, many genomes have evolved mostly or only through inversions. Thus it becomes possible to trace evolutionary histories by inferring sequences of such inversions that led to today's genomes from a distant common ancestor. But unlike the classic edit distance problem where string editing was relatively simple, editing permutation in this way has proved to be more complex. In this dissertation, we extend the theory so as to make these edit distances more broadly applicable and faster to compute, and work towards more powerful tools that can accurately infer evolutionary histories. In particular, we present work that for the first time considers genomic distances between any pair of genomes, with no limitation on the number of occurrences of a gene. Next we show that there are conditions under which an ancestral genome (or one close to the true ancestor) can be reliably reconstructed. Finally we present new methodology that computes a minimum-length sequence of inversions to transform one permutation into another in, on average, O(n log n) steps, whereas the best worst-case algorithm to compute such a sequence uses O(n√n log n) steps

Infoscience - École polytechnique fédérale de Lausanne

The Metropolized Partial Importance Sampling MCMC mixes slowly on minimum reversal rearrangement paths

Author: Miklós István
Mélykúti B.
Swenson K.
Publication venue
Publication date: 01/01/2010
Field of study

SZTAKI Publication Repository