Search CORE

3 research outputs found

New Genome Similarity Measures based on Conserved Gene Adjacencies

Author: Araujo Eloi
Dantas Simone
Deshpande Shachi
Doerr Daniel
Kowada Luis Antonio B.
Moret Bernard M. E.
Stoye Jens
Publication venue: 'Mary Ann Liebert Inc'
Publication date: 01/01/2017
Field of study

Many important questions in molecular biology, evolution, and biomedicine can be addressed by comparative genomic approaches. One of the basic tasks when comparing genomes is the definition of measures of similarity (or dissimilarity) between two genomes, for example, to elucidate the phylogenetic relationships between species. The power of different genome comparison methods varies with the underlying formal model of a genome. The simplest models impose the strong restriction that each genome under study must contain the same genes, each in exactly one copy. More realistic models allow several copies of a gene in a genome. One speaks of gene families, and comparative genomic methods that allow this kind of input are called gene family-based. The most powerfulbut also most complexmodels avoid this preprocessing of the input data and instead integrate the family assignment within the comparative analysis. Such methods are called gene family-free. In this article, we study an intermediate approach between family-based and family-free genomic similarity measures. Introducing this simpler model, called gene connections, we focus on the combinatorial aspects of gene family-free genome comparison. While in most cases, the computational costs to the general family-free case are the same, we also find an instance where the gene connections model has lower complexity. Within the gene connections model, we define three variants of genomic similarity measures that have different expression powers. We give polynomial-time algorithms for two of them, while we show NP-hardness for the third, most powerful one. We also generalize the measures and algorithms to make them more robust against recent local disruptions in gene order. Our theoretical findings are supported by experimental results, proving the applicability and performance of our newly defined similarity measures

Infoscience - École polytechnique fédérale de Lausanne

Publications at Bielefeld University

Dspace at IIT Bombay

New Genome Similarity Measures based on Conserved Gene Adjacencies

Author: Dantas Simone
Doerr Daniel
Kowada Luis Antonio B.
Singh Mona
Stoye Jens
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Kowada LAB, Doerr D, Dantas S, Stoye J. New Genome Similarity Measures based on Conserved Gene Adjacencies. In: Singh M, ed. Research in Computational Molecular Biology. RECOMB 2016. Lecture Notes in Bioinformatics (LNBI). Vol 9649. Springer; 2016: 204-224

Publications at Bielefeld University

Embracing heterogeneity: coalescing the Tree of Life and the future of phylogenomics

Author: Alexander Schliep
Alexandre Antonelli
Bengt Oxelman
Bernard Pfeil
Christine D. Bacon
Fernanda P. Werneck
Graham Jones
Gustavo A. Bravo
Hélène Morlon
John Wiedenhoeft
Krzysztof Bartoszek
L. Lacey Knowles
Luay K. Nakhleh
Mozes P. K. Blom
Niklas Wahlberg
Sandi Willows-Munro
Sangeet Lamichhaney
Scott V. Edwards
Stella Huynh
Thomas Marcussen
Publication venue: 'PeerJ'
Publication date: 01/01/2019
Field of study

Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are made possible with routine whole-genome sequencing of taxa of interest. We discuss limitations and potential extensions of a model supporting innovation in phylogenomics today, the multispecies coalescent model (MSC). Macroevolutionary models that use phylogenies, such as character mapping, often ignore the heterogeneity on which building phylogenies increasingly rely and suggest that assimilating such heterogeneity is an important goal moving forward. Finally, we argue that an integrative cyberinfrastructure linking all steps of the process of building the ToL, from specimen acquisition in the field to publication and tracking of phylogenomic data, as well as a culture that values contributors at each step, are essential for progress

Publikationer från Linköpings universitet

Lund University Publications

Directory of Open Access Journals

Chalmers Research

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Repositório do INPA

NORA - Norwegian Open Research Archives