Search CORE

543 research outputs found

Reconstructing the Genomic Architecture of Mammalian Ancestors Using Multispecies Comparative Maps

Author: Bourque Guillaume
Murphy William J.
O\u27Brien Stephen J.
Pevzner Pavel
Tesler Glenn
Publication venue: NSUWorks
Publication date: 01/11/2003
Field of study

Rapidly developing comparative gene maps in selected mammal species are providing an opportunity to reconstruct the genomic architecture of mammalian ancestors and study rearrangements that transformed this ancestral genome into existing mammalian genomes. Here, the recently developed Multiple Genome Rearrangement (MGR) algorithm is applied to human, mouse, cat and cattle comparative maps (with 311-470 shared markers) to impute the ancestral mammalian genome. Reconstructed ancestors consist of 70-100 conserved segments shared across the genomes that have been exchanged by rearrangement events along the ordinal lineages leading to modern species genomes. Genomic distances between species, dominated by inversions (reversals) and translocations, are presented in a first multispecies attempt using ordered mapping data to reconstruct the evolutionary exchanges that preceded modern placental mammal genomes

PubMed Central

NSU Works

Balanced Vertices in Trees and a Simpler Algorithm to Compute the Genomic Distance

Author: Bergeron
Gyárfás
Hannenhalli
Jens Stoye
Lajos Soukup
Péter L. Erdős
Publication venue: 'Elsevier BV'
Publication date: 15/04/2010
Field of study

This paper provides a short and transparent solution for the covering cost of white-grey trees which play a crucial role in the algorithm of Bergeron {\it et al.}\ to compute the rearrangement distance between two multichromosomal genomes in linear time ({\it Theor. Comput. Sci.}, 410:5300-5316, 2009). In the process it introduces a new {\em center} notion for trees, which seems to be interesting on its own.Comment: 6 pages, submitte

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Crossref

Repository of the Academy's Library

The Tandem Duplication Distance Is NP-Hard

Author: Lafond Manuel
Zhu Binhai
Zou Peng
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 37th International Symposium on Theoretical Aspects of Computer Science (STACS 2020)
Publication date: 12/06/2019
Field of study

In computational biology, tandem duplication is an important biological phenomenon which can occur either at the genome or at the DNA level. A tandem duplication takes a copy of a genome segment and inserts it right after the segment - this can be represented as the string operation AXB ? AXXB. Tandem exon duplications have been found in many species such as human, fly or worm, and have been largely studied in computational biology. The Tandem Duplication (TD) distance problem we investigate in this paper is defined as follows: given two strings S and T over the same alphabet, compute the smallest sequence of tandem duplications required to convert S to T. The natural question of whether the TD distance can be computed in polynomial time was posed in 2004 by Leupold et al. and had remained open, despite the fact that tandem duplications have received much attention ever since. In this paper, we prove that this problem is NP-hard, settling the 16-year old open problem. We further show that this hardness holds even if all characters of S are distinct. This is known as the exemplar TD distance, which is of special relevance in bioinformatics. One of the tools we develop for the reduction is a new problem called the Cost-Effective Subgraph, for which we obtain W[1]-hardness results that might be of independent interest. We finally show that computing the exemplar TD distance between S and T is fixed-parameter tractable. Our results open the door to many other questions, and we conclude with several open problems

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

Approximating the double-cut-and-join distance between unsigned genomes

Author: A Bergeron
A Caprara
A Caprara
CM Papadimitriou
G Lin
H Jiang
JD Kececioglu
Jiadong Yu
MM Halldórsson
R Sun
Ruimin Sun
S Hannenhalli
S Hannenhalli
S Hannenhalli
S Yancopoulos
V Bafna
X Chen
Xin Chen
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

In this paper we study the problem of sorting unsigned genomes by double-cut-and-join operations, where genomes allow a mix of linear and circular chromosomes to be present. First, we formulate an equivalent optimization problem, called maximum cycle/path decomposition, which is aimed at finding a largest collection of edge-disjoint cycles/AA-paths/AB-paths in a breakpoint graph. Then, we show that the problem of finding a largest collection of edge-disjoint cycles/AA-paths/AB-paths of length no more than l can be reduced to the well-known degree-bounded k-set packing problem with k = 2l. Finally, a polynomial-time approximation algorithm for the problem of sorting unsigned genomes by double-cut-and-join operations is devised, which achieves the approximation ratio for any positive ε. For the restricted variation where each genome contains only one linear chromosome, the approximation ratio can be further improved t

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DR-NTU (Digital Repository of NTU)

Multichromosomal median and halving problems under different genomic distances

Author: A Bergeron
A Bergeron
A Bergeron
A Caprara
C Zheng
C Zheng
C Zheng
C Zheng
Chunfang Zheng
D Bryant
D Sankoff
David Sankoff
E Ohlebusch
E Tannier
Eric Tannier
G Bourque
G Fertin
G Jean
G Tesler
G Watterson
I Pe'er
J Aury
J Mixtacki
L Lovasz
M Alekseyev
M Bernt
M Ozery-Flato
MR Garey
N El-Mabrouk
P Berman
P Pevzner
R Lenne
R Warren
S Hannenhalli
S Hannenhalli
S Otto
S Yancopoulos
W Xu
X Chen
Y Lin
YC Lin
Z Adam
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Genome median and genome halving are combinatorial optimization problems that aim at reconstructing ancestral genomes as well as the evolutionary events leading from the ancestor to extant species. Exploring complexity issues is a first step towards devising efficient algorithms. The complexity of the median problem for unichromosomal genomes (permutations) has been settled for both the breakpoint distance and the reversal distance. Although the multichromosomal case has often been assumed to be a simple generalization of the unichromosomal case, it is also a relaxation so that complexity in this context does not follow from existing results, and is open for all distances. Results We settle here the complexity of several genome median and halving problems, including a surprising polynomial result for the breakpoint median and guided halving problems in genomes with circular and linear chromosomes, showing that the multichromosomal problem is actually easier than the unichromosomal problem. Still other variants of these problems are NP-complete, including the DCJ double distance problem, previously mentioned as an open question. We list the remaining open problems. Conclusion This theoretical study clears up a wide swathe of the algorithmical study of genome rearrangements with multiple multichromosomal genomes.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

INRIA a CCSD electronic archive server

PubMed Central

Hal-Diderot

Gossip is synteny: Incomplete gossip and the syntenic distance between genomes

Author: Bafna
Bafna
Baker
Blanchette
Bradley Barbazuk
Bumby
Caprara
Chang
Christie
Cot
DasGupta
Ehrlich
Eriksson
Ferretti
Gu
Hajnal
Hannenhalli
Hannenhalli
Hedetniemi
Hurkens
Kececioglu
Kececioglu
Kleinberg
Lee
Liben-Nowell
Liben-Nowell
McLysaght
Ranz
Richards
Sankoff
Seoighe
Tijdeman
Trachtulec
Voss
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Sobre modelos de rearranjo de genomas

Author: Feijão Pedro Cipriano, 1975-
Publication venue: [s.n.]
Publication date: 21/08/2018
Field of study

Orientador: João MeidanisTese (doutorado) - Universidade Estadual de Campinas, Instituto de ComputaçãoResumo: Rearranjo de genomas é o nome dado a eventos onde grandes blocos de DNA trocam de posição durante o processo evolutivo. Com a crescente disponibilidade de sequências completas de DNA, a análise desse tipo de eventos pode ser uma importante ferramenta para o entendimento da genômica evolutiva. Vários modelos matemáticos de rearranjo de genomas foram propostos ao longo dos últimos vinte anos. Nesta tese, desenvolvemos dois novos modelos. O primeiro foi proposto como uma definição alternativa ao conceito de distância de breakpoint. Essa distância é uma das mais simples medidas de rearranjo, mas ainda não há um consenso quanto à sua definição para o caso de genomas multi-cromossomais. Pevzner e Tesler deram uma definição em 2003 e Tannier et al. a definiram de forma diferente em 2008. Nesta tese, nós desenvolvemos uma outra alternativa, chamada de single-cut-or-join (SCJ). Nós mostramos que, no modelo SCJ, além da distância, vários problemas clássicos de rearranjo, como a mediana de rearranjo, genome halving e pequena parcimônia são fáceis, e apresentamos algoritmos polinomiais para eles. O segundo modelo que apresentamos é o formalismo algébrico por adjacências, uma extensão do formalismo algébrico proposto por Meidanis e Dias, que permite a modelagem de cromossomos lineares. Esta era a principal limitação do formalismo original, que só tratava de cromossomos circulares. Apresentamos algoritmos polinomiais para o cálculo da distância algébrica e também para encontrar cenários de rearranjo entre dois genomas. Também mostramos como calcular a distância algébrica através do grafo de adjacências, para facilitar a comparação com outras distâncias de rearranjo. Por fim, mostramos como modelar todas as operações clássicas de rearranjo de genomas utilizando o formalismo algébricoAbstract: Genome rearrangements are events where large blocks of DNA exchange places during evolution. With the growing availability of whole genome data, the analysis of these events can be a very important and promising tool for understanding evolutionary genomics. Several mathematical models of genome rearrangement have been proposed in the last 20 years. In this thesis, we propose two new rearrangement models. The first was introduced as an alternative definition of the breakpoint distance. The breakpoint distance is one of the most straightforward genome comparison measures, but when it comes to defining it precisely for multichromosomal genomes, there is more than one way to go about it. Pevzner and Tesler gave a definition in a 2003 paper, and Tannier et al. defined it differently in 2008. In this thesis we provide yet another alternative, calling it single-cut-or-join (SCJ). We show that several genome rearrangement problems, such as genome median, genome halving and small parsimony, become easy for SCJ, and provide polynomial time algorithms for them. The second model we introduce is the Adjacency Algebraic Theory, an extension of the Algebraic Formalism proposed by Meidanis and Dias that allows the modeling of linear chromosomes, the main limitation of the original formalism, which could deal with circular chromosomes only. We believe that the algebraic formalism is an interesting alternative for solving rearrangement problems, with a different perspective that could complement the more commonly used combinatorial graph-theoretic approach. We present polynomial time algorithms to compute the algebraic distance and find rearrangement scenarios between two genomes. We show how to compute the rearrangement distance from the adjacency graph, for an easier comparison with other rearrangement distances. Finally, we show how all classic rearrangement operations can be modeled using the algebraic theoryDoutoradoCiência da ComputaçãoDoutor em Ciência da Computaçã

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Repositorio da Producao Cientifica e Intelectual da Unicamp

A Linear Time Algorithm for an Extended Version of the Breakpoint Double Distance

Author: Brockmann Leonie R.
Klerx Katharina
Stoye Jens
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 22nd International Workshop on Algorithms in Bioinformatics (WABI 2022)
Publication date: 01/01/2022
Field of study

Dagstuhl Research Online Publication Server