Search CORE

6 research outputs found

Hidden breakpoints in genome alignments

Author: A. Rambaut
A.C.E. Darling
A.E. Darling
A.L. Delcher
C.D. Greenman
D. Medini
E. Tannier
G. Fudenberg
M. Blanchette
M. Nowacki
M.A. Umbarger
S. De
S. Schwartz
S.V. Angiuoli
V. Kolmogorov
Publication venue
Publication date: 01/01/2012
Field of study

During the course of evolution, an organism's genome can undergo changes that affect the large-scale structure of the genome. These changes include gene gain, loss, duplication, chromosome fusion, fission, and rearrangement. When gene gain and loss occurs in addition to other types of rearrangement, breakpoints of rearrangement can exist that are only detectable by comparison of three or more genomes. An arbitrarily large number of these "hidden" breakpoints can exist among genomes that exhibit no rearrangements in pairwise comparisons. We present an extension of the multichromosomal breakpoint median problem to genomes that have undergone gene gain and loss. We then demonstrate that the median distance among three genomes can be used to calculate a lower bound on the number of hidden breakpoints present. We provide an implementation of this calculation including the median distance, along with some practical improvements on the time complexity of the underlying algorithm. We apply our approach to measure the abundance of hidden breakpoints in simulated data sets under a wide range of evolutionary scenarios. We demonstrate that in simulations the hidden breakpoint counts depend strongly on relative rates of inversion and gene gain/loss. Finally we apply current multiple genome aligners to the simulated genomes, and show that all aligners introduce a high degree of error in hidden breakpoint counts, and that this error grows with evolutionary distance in the simulation. Our results suggest that hidden breakpoint error may be pervasive in genome alignments.Comment: 13 pages, 4 figure

arXiv.org e-Print Archive

Crossref

OPUS - University of Technology Sydney

Repository: Freie Universität Berlin (FU), Math Department (fu_mi_publications)

Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii.

Author: Angiuoli S.V.
Carlton J.M.
Kooij T.W.
Suh B.B.
Publication venue: Health Sciences Research Commons
Publication date: 01/01/2002
Field of study

George Washington University: Health Sciences Research Commons (HSRC)

RRCA: Ultra-Fast Multiple In-species Genome Alignments

Author: C. Kemena
C. Notredame
D. Gusfield
H. Carillo
H. Mewes
H.J. Yu
J. Cao
J. Ziv
K. Katoh
K.M. Wong
L. Wang
M. Brudno
M. Larkin
M. Roytberg
M. Schmidt
M.I. Abouelhoda
S. Deorowicz
S. Deorowicz
S. Kreft
S. Wandelt
S.B. Needleman
S.V. Angiuoli
X. Chen
Z. Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Genome sequence of Theileria parva, a bovine pathogen that transforms lymphocytes

Author: Allen J.
Angiuoli S.V.
Berriman M.
Bishop Richard P.
Carlton J.M.
Crabtree J.
Creasy T.H.
Domingo A.R.
Feldblyum T.V.
Fitzhugh H.A.
Fraser C.M.
Gardner M.J.
Haas B.
Hall N.
Jiang L.
Lu C.
Lynn , J.
Mann D.J.
Morzaria S.P.
Nene Vishvanath
Nierman W.C.
Pain A.
Paulsen I.T.
Pertea M.
Ralph S.A.
Ren Q.
Salzberg S.L.
Sato S.
Shah Tushaar
Shallom S.J.
Shoaibi A.
Silva Joana C.
Suh B.
Taracha E.L.N.
Utterback T.R.
Venter J. Craig
Villiers Etienne P. de
Wasawo D.
Weaver B.
Weidman J.
White O.R.
Wilson R.J.M.
Wortman J.R.
Xiong Z.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 03/07/2013
Field of study

We report the genome sequence of Theileria parva, an apicomplexan pathogen causing economic losses to smallholder farmers in Africa. The parasite chromosomes exhibit limited conservation of gene synteny with Plasmodium falciparum, and its plastid-like genome represents the first example where all apicoplast genes are encoded on one DNA strand. We tentatively identify proteins that facilitate parasite segregation during host cell cytokinesis and contribute to persistent infection of transformed host cells. Several biosynthetic pathways are incomplete or absent, suggesting substantial metabolic dependence on the host cell. One protein family that may generate parasite antigenic diversity is not telomere-associated

CGSpace

Draft Genome of the Filarial Nematode Parasite Brugia malayi

Author: Allen J.E.
Allen J.E.
Barton G.J.
Barton G.J.
Ben-Wen Li B-W.
Ben-Wen Li B-W.
Brian Haas B.
Brian Haas B.
Caler E.
Caler E.
Carlow C.K.S.
Carlow C.K.S.
Crabtree J.
Crabtree J.
Crawford M.J.
Crawford M.J.
Daub J.
Daub J.
Delcher A.L.
Delcher A.L.
Dimmic M.W.
Dimmic M.W.
El-Sayed N.M.
El-Sayed N.M.
Estes C.F.
Estes C.F.
Feldblyum T.
Feldblyum T.
Foster J.M.
Foster J.M.
Ganatra M.
Ganatra M.
Ghedin E.
Ghedin E.
Gregory W.F.
Gregory W.F.
Guiliano D.B.
Guiliano D.B.
Ian Korf I.
Ian Korf I.
Jinming Jin J.
Jinming Jin J.
Johnson N.M.
Johnson N.M.
Koo H.
Koo H.
Lindblom T.H.
Lindblom T.H.
Lustigman S.
Lustigman S.
Ma D.
Ma D.
Maina C.V.
Maina C.V.
Martin D.M.A.
Martin D.M.A.
McCarter J.P.
McCarter J.P.
McReynolds L.
McReynolds L.
Miranda-Saavedra D.
Miranda-Saavedra D.
Mitreva M.
Mitreva M.
Paolo Amedeo P.
Paolo Amedeo P.
Pertea M.
Pertea M.
Pop M.
Pop M.
Richard Komuniecki R.
Richard Komuniecki R.
Salzberg S.L.
Salzberg S.L.
Samuel V. Angiuoli S.V.
Samuel V. Angiuoli S.V.
Sandra Laney S.
Sandra Laney S.
Sanjay Kumar S.
Sanjay Kumar S.
Schatz M.
Schatz M.
Schobel S.
Schobel S.
Shumway M.
Shumway M.
Spiro D.
Spiro D.
Tallon L.
Tallon L.
Todd Creasy T.
Todd Creasy T.
Wang S.
Wang S.
Wen Li W.
Wen Li W.
White O.
White O.
Wortman J.R.
Wortman J.R.
Zhao Q.
Zhao Q.
Publication venue: 'American Association for the Advancement of Science (AAAS)'
Publication date: 01/01/2007
Field of study

Parasitic nematodes that cause elephantiasis and river blindness threaten hundreds of millions of people in the developing world. We have sequenced the ∼90 megabase (Mb) genome of the human filarial parasite Brugia malayi and predict ∼11,500 protein coding genes in 71 Mb of robustly assembled sequence. Comparative analysis with the free-living, model nematode Caenorhabditis elegans revealed that, despite these genes having maintained little conservation of local synteny during ∼350 million years of evolution, they largely remain in linkage on chromosomal units. More than 100 conserved operons were identified. Analysis of the predicted proteome provides evidence for adaptations of B. malayi to niches in its human and vector hosts and insights into the molecular basis of a mutualistic relationship with its Wolbachia endosymbiont. These findings offer a foundation for rational drug design

WestminsterResearch

Large-scale multiple sequence alignment and phylogeny estimation

Author: A. Bouchard-Côté
A. Darling
A. Darling
A. Drummond
A. Graybeal
A. Lemmon
A. Lobkovsky
A. Loytynoja
A. Löytynoja
A. Mokaddem
A. Neuwald
A. Novák
A. Stamatakis
A. Toth
B. Baum
B. Blackburne
B. Blaisdell
B. Boussau
B. Boussau
B. Dwivedi
B. Larget
B. Ma
B. Moret
B. Paten
B. Qian
B. Rannala
B. Raphael
B. Redelings
B. Redelings
B. Sennblad
B. Thatte
B.G. Hall
C. Blair
C. Chauve
C. Daskalakis
C. Daskalakis
C. Daskalakis
C. Daskalakis
C. Dessimoz
C. Do
C. Do
C. Kemena
C. Kosiol
C. Lakner
C. Linder
C. Roquet
C. Semple
C. Tuffley
C. Wilke
C.R. Linder
C.W. Dunn
D. Brown
D. Brown
D. Chen
D. Gardner
D. Gerard
D. Hillis
D. Hillis
D. Hillis
D. Huson
D. Huson
D. Liberles
D. McDonald
D. Metzler
D. Mindell
D. Morrison
D. Morrison
D. Neves
D. Pollock
D. Robinson
D. Sankoff
D. Sankoff
D. Swofford
D. Wu
D. Zwickl
D.A. Morrison
D.D. Pollock
D.G. Brown
D.J. Zwickl
E. Allman
E. Koonin
E. Mossel
E. Mossel
E. Mossel
E. Preusse
E. Rivas
E. Rivas
E. Sessa
E.P. Nawrocki
E.S. Allman
E.S. Allman
E.S. Allman
F. Abascal
F. Delsuc
F. Matsen
F. Ronquist
F. Ronquist
F. Sievers
F.A. Matsen
G. Ganapathy
G. Ganapathy
G. Giribet
G. Jin
G. Jin
G. Lunter
G. Lunter
G. Lunter
G. Raghava
G. Reeck
G. Sims
G.A. Lunter
H. Carroll
H. Zhou
I. Dubchak
I. Gronau
I. Holmes
I. Mayrose
I. Miklós
I. Miklós
I.L.V. Walle
J. Adachi
J. Agren
J. Blazewicz
J. Chang
J. Eisen
J. Felsenstein
J. Felsenstein
J. Felsenstein
J. Gogarten
J. Gogarten
J. Hartigan
J. Heled
J. Huelsenbeck
J. Huelsenbeck
J. Pei
J. Pei
J. Thompson
J. Thorne
J. Thorne
J. Wiens
J. Wiens
J. Yang
J.A. Eisen
J.D. Thompson
J.H. Degnan
J.H. Degnan
J.L. Thorne
J.L. Thorne
J.L. Thorne
J.P. Doyon
J.P. Huelsenbeck
J.S. Papadopoulos
J.T. Chang
K. Atteson
K. Katoh
K. Katoh
K. Kjer
K. Liu
K. Liu
K. Liu
K. Liu
K. Liu
K. Muller
K. Rice
K. Sjolander
K. Yang
K. Yoshizawa
K.C. Nixon
L. Arvestad
L. Iersel van
L. Liu
L. Nagy
L. Nakhleh
L. Nakhleh
L. Nakhleh
L. Nakhleh
L. Nakhleh
L. Nakhleh
L. Wang
L. Wang
L. Wang
L. Wang
L. Zhang
L.-S. Wang
L.R. Foulds
M. Aniba
M. Bayzid
M. Bonet
M. Brudno
M. Chang
M. Csurös
M. Csürős
M. Dayhoff
M. Galperin
M. Holder
M. Moody
M. Pagel
M. Price
M. Scherrer
M. Simmons
M. Simmons
M. Stark
M. Steel
M. Steel
M. Swenson
M. Swenson
M.A. Larkin
M.A. Steel
M.A. Steel
M.A. Steel
M.A. Steel
M.A. Suchard
M.J. Claesson
M.N. Price
M.R. Lacey
M.T. Hallett
N. Galtier
N. Goldman
N. Goldman
N. Saitou
N. Stojanovic
N.M. Kopelman
O. Gascuel
O. Gill
O. O’Sullivan
O. Penn
O.R.P. Bininda-Emonds
P. Arunapuram
P. Erdos
P. Erdos
P. Erdos
P. Foster
P. Gardner
P. Goloboff
P. Lapierre
P. Lewis
P. Lopez
P.A. Goloboff
P.T. Chardin de
R. Chaudhary
R. Chowdhury
R. Desper
R. Edgar
R. Finn
R. Fleissner
R. Hagopian
R. Vos
R.C. Edgar
R.C. Edgar
S. Angiuoli
S. Capella-Gutiérrez
S. Eddy
S. Edwards
S. Evans
S. Guindon
S. Hartmann
S. Kumar
S. Le
S. Lehtonen
S. Mirarab
S. Mirarab
S. Nelesen
S. Nelesen
S. Roch
S. Roch
S. Roch
S. Roch
S. Roch
S. Smith
S. Smith
S. Snir
S. Tavaré
S. Vinga
S. Whelan
S. Whelan
S. Whelan
S. Whelan
S. Whelan
S.A. Berger
S.A. Berger
S.F. Altschul
S.R. Jun
S.V. Edwards
T. DeSantis
T. Dobzhansky
T. Lassmann
T. Ogden
T. Ogden
T. Phuong
T. Warnow
T. Warnow
T. Warnow
T. Wheeler
T. Yuri
T.H. Ogden
U. Bergthorsson
U. Bergthorsson
U. Roshan
U. Roshan
U. Roshan
U. Roshan
V. Barriel
V. King
W. Fletcher
W. Maddison
W. Wheeler
W.J. Bruno
X. Deng
X. Gu
X. Liu
Y. Lin
Y. Wolf
Y. Wu
Y. Yu
Y. Yu
Y. Yu
Publication venue
Publication date: 01/01/2013
Field of study

With the advent of next generation sequencing technologies, alignment and phylogeny estimation of datasets with thousands of sequences is being attempted. To address these challenges, new algorithmic approaches have been developed that have been able to provide substantial improvements over standard methods. This paper focuses on new approaches for ultra-large tree estimation, including methods for co-estimation of alignments and trees, estimating trees without needing a full sequence alignment, and phylogenetic placement. While the main focus is on methods with empirical performance advantages, we also discuss the theoretical guarantees of methods under Markov models of evolution. Finally, we include a discussion of the future of large-scale phylogenetic analysis

CiteSeerX

Crossref