Background: Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics comprised of multiple duplicated fragments. This complex genomic organization complicates analysis of the evolutionary history of these sequences. One model proposed to explain this mosaic patterns is a model of repeated aggregation and subsequent duplication of genomic sequences. Results: We describe a polynomial-time exact algorithm to compute duplication distance, a genomic distance defined as the most parsimonious way to build a target string by repeatedly copying substrings of a fixed source string. This distance models the process of repeated aggregation and duplication. We also describe extensions of this distance to include certain types of substring deletions and inversions. Finally, we provide an description of a sequence of duplication events as a context-free grammar (CFG). Conclusion: These new genomic distances will permit more biologically realistic analyses of segmental duplications in genomes. 

Benjamin J Raphael

CL Kahn

Crystal L Kahn

D Bertrand

D Sankoff

J Bailey

J Ma

K Chaudhuri

M Johnson

M Lajoie

M Marron

MA Alekseyev

N El-Mabrouk

O Elemento

P Pevzner

Shay Mozes

X Chen

Y Zhang

Z Jiang

English

PubMed

Springer - Publisher Connector

Efficient algorithms for analyzing segmental duplications with deletions and inversions in genomes

Crystal L  Kahn

Crossref

Abstract Background Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics comprised of multiple duplicated fragments. This complex genomic organization complicates analysis of the evolutionary history of these sequences. One model proposed to explain this mosaic patterns is a model of repeated aggregation and subsequent duplication of genomic sequences. Results We describe a polynomial-time exact algorithm to compute duplication distance, a genomic distance defined as the most parsimonious way to build a target string by repeatedly copying substrings of a fixed source string. This distance models the process of repeated aggregation and duplication. We also describe extensions of this distance to include certain types of substring deletions and inversions. Finally, we provide a description of a sequence of duplication events as a context-free grammar (CFG). Conclusion These new genomic distances will permit more biologically realistic analyses of segmental duplications in genomes.</p

Mozes Shay

Kahn Crystal L

Raphael Benjamin J

Directory of Open Access Journals

Algorithms for Molecular Biology

BME: Genomic Distances Under Deletions and Insertions. TCS

Computational molecular biology: an algorithmic approach

Duplication and Inversion History of a Tandemly Repeated Genes Family.

EE: Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nature Genetics

Eichler E: Primate Segmental Duplications: Crucibles of Evolution, Diversity and Disease. Nat Rev Genet

El-Mabrouk N: Inferring Ancestral Gene Orders for a Family of Tandemly Arrayed Genes.

Gene Order Comparisons for Phylogenetic Inference: Evolution of the Mitochondrial Genome. Proc Natl Acad Sci USA

Genome Rearrangement by Reversals and Insertions/ Deletions of Contiguous Segments.

Haussler D: DUPCAR: Reconstructing Contiguous Ancestral Regions with Duplications.

Jiang T: Assignment of Orthologous Genes via Genome Rearrangement.

Lefranc MP: Reconstructing the Duplication History of Tandemly Repeated Genes. Mol Biol Evol

On the Tandem DuplicationRandom Loss Model of Genome Rearrangement.

Pevzner PA: Whole Genome Duplications and Contracted Breakpoint Graphs. SICOMP

Reconstructing the Evolutionary History of Complex Human Gene Clusters.

Sankoff D: The Reconstruction of Doubled Genomes.

file:///data/remote/core/dit/data/Springer-OA/pdf/f62/aHR0cDovL2xpbmsuc3ByaW5nZXIuY29tLzEwLjExODYvMTc0OC03MTg4LTUtMTEucGRm.pdf

Efficient algorithms for analyzing segmental duplications with deletions and inversions in genomes

Abstract

Similar works

Full text

Available Versions

Springer - Publisher Connector

Crossref

Springer - Publisher Connector

Directory of Open Access Journals