Search CORE

University of Queensland eSpace

A two-phase approach for detecting recombination in nucleotide sequences

Author: Beiko Robert G.
Chan Cheong Xin
Ragan Mark A.
Publication venue
Publication date: 01/01/2007
Field of study

Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. Delineating recombination events is important in the study of molecular evolution, as inference of such events provides a clearer picture of the phylogenetic relationships among different gene sequences or genomes. Nevertheless, detecting recombination events can be a daunting task, as the performance of different recombinationdetecting approaches can vary, depending on evolutionary events that take place after recombination. We recently evaluated the effects of postrecombination events on the prediction accuracy of recombination-detecting approaches using simulated nucleotide sequence data. The main conclusion, supported by other studies, is that one should not depend on a single method when searching for recombination events. In this paper, we introduce a two-phase strategy, applying three statistical measures to detect the occurrence of recombination events, and a Bayesian phylogenetic approach in delineating breakpoints of such events in nucleotide sequences. We evaluate the performance of these approaches using simulated data, and demonstrate the applicability of this strategy to empirical data. The two-phase strategy proves to be time-efficient when applied to large datasets, and yields high-confidence results.Comment: 5 pages, 3 figures. Chan CX, Beiko RG and Ragan MA (2007). A two-phase approach for detecting recombination in nucleotide sequences. In Hazelhurst S and Ramsay M (Eds) Proceedings of the First Southern African Bioinformatics Workshop, 28-30 January, Johannesburg, 9-1

arXiv.org e-Print Archive

University of Queensland eSpace

A Model-Based Analysis of GC-Biased Gene Conversion in the Human and Chimpanzee Genomes

Author: A Auton
A Kong
A Navarro
A Necşulea
A Ratnakumar
A Siepel
Adam Siepel
AJ Jeffreys
AJ Webb
AP Boyle
BC Lamb
C Kosiol
CC Spencer
CF Mugal
D Karolchik
D Kostka
Dennis Kostka
E Mancera
G Marais
Graham Coop
J Berglund
J Harrow
J Romiguier
JA Capra
JM Chen
John A. Capra
JW IJdo
K Lindblad-Toh
K Pollard
Katherine S. Pollard
L Arbiza
L Duret
L Duret
LR Meyer
M Blanchette
M Hasegawa
Melissa J. Hubisz
MJ Hubisz
N Galtier
N Galtier
N Lartillot
P Flicek
P Stenson
RD George
S Glémin
S Katzman
S Katzman
S Myers
S Myers
SE Ptak
ST Sherry
T Nagylaki
TC Brown
TR Dreszer
W Winckler
Y Zhang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

GC-biased gene conversion (gBGC) is a recombination-associated process that favors the fixation of G/C alleles over A/T alleles. In mammals, gBGC is hypothesized to contribute to variation in GC content, rapidly evolving sequences, and the fixation of deleterious mutations, but its prevalence and general functional consequences remain poorly understood. gBGC is difficult to incorporate into models of molecular evolution and so far has primarily been studied using summary statistics from genomic comparisons. Here, we introduce a new probabilistic model that captures the joint effects of natural selection and gBGC on nucleotide substitution patterns, while allowing for correlations along the genome in these effects. We implemented our model in a computer program, called phastBias, that can accurately detect gBGC tracts about 1 kilobase or longer in simulated sequence alignments. When applied to real primate genome sequences, phastBias predicts gBGC tracts that cover roughly 0.3% of the human and chimpanzee genomes and account for 1.2% of human-chimpanzee nucleotide differences. These tracts fall in clusters, particularly in subtelomeric regions; they are enriched for recombination hotspots and fast-evolving sequences; and they display an ongoing fixation preference for G and C alleles. They are also significantly enriched for disease-associated polymorphisms, suggesting that they contribute to the fixation of deleterious alleles. The gBGC tracts provide a unique window into historical recombination processes along the human and chimpanzee lineages. They supply additional evidence of long-term conservation of megabase-scale recombination rates accompanied by rapid turnover of hotspots. Together, these findings shed new light on the evolutionary, functional, and disease implications of gBGC. The phastBias program and our predicted tracts are freely available. © 2013 Capra et al

arXiv.org e-Print Archive

Cold Spring Harbor Laboratory Institutional Repository

eScholarship - University of California

D-Scholarship@Pitt

FigShare

Phylodynamic analysis of porcine circovirus type 2: Methodological approach and datasets

Author: Cortey Martì
Drigo Michele
Franzo Giovanni
Hughes Joseph
Segalés Joaquim
Publication venue: 'Elsevier BV'
Publication date: 01/01/2016
Field of study

Since its first description, PCV2 has emerged as one of the most economically relevant diseases for the swine industry. Despite the introduction of vaccines effective in controlling clinical syndromes, PCV2 spread was not prevented and some potential evidences of vaccine immuno escape have recently been reported (“Complete genome sequence of a novel porcine circovirus type 2b variant present in cases of vaccine failures in the United States” (Xiao and Halbur, 2012) [1], “Genetic and antigenic characterization of a newly emerging porcine circovirus type 2b mutant first isolated in cases of vaccine failure in Korea” (Seo et al., 2014) [2]). In this article, we used a collection of PCV2 full genomes, provided in the present manuscript, and several phylogentic, phylodynamic and bioinformatic methods to investigate different aspects of PCV2 epidemiology, history and evolution (more thoroughly described in “PHYLODYNAMIC ANALYSIS of PORCINE CIRCOVIRUS TYPE 2 REVEALS GLOBAL WAVES of EMERGING GENOTYPES and the CIRCULATION of RECOMBINANT FORMS”[3]). The methodological approaches used to consistently detect recombiantion events and estimate population dymanics and spreading patterns of rapidly evolving ssDNA viruses are herein reported. Programs used are described and original scripts have been provided. Ensembled databases used are also made available. These consist of a broad collection of complete genome sequences (i.e. 843 sequences; 63 complete genomes of PCV2a, 310 of PCV2b, 4 of PCV2c, 217 of PCV2d, 64 of CRF01, 140 of CRF02 and 45 of CRF03.), divided in differnt ORF (i.e. ORF1, ORF2 and intergenic regions), of PCV2 genotypes and major Circulating Recombinat Forms (CRF) properly annotated with respective collection data and country. Globally, all of these data can be used as a starting point for further studies and for classification purpose

Archivio istituzionale della ricerca - Università di Padova

Enlighten

Turnip mosaic potyvirus probably first spread to Eurasian brassica crops from wild orchids about 1000 years ago

Author: A Gibbs
A Luo
Adrian J. Gibbs
AJ Drummond
AJ Drummond
AJ Drummond
AJ Gibbs
AJ Gibbs
AJ Gibbs
AJ Gibbs
ALN. Rao
BY Chung
CC Chen
CE Jenner
CE Jenner
CE Jenner
D Martin
D Posada
DE Lesemann
DH Huson
Dietrich Lesemann
DP Martin
DW Pallett
E Kozubek
EC Holmes
GF Weiller
HE Simmons
Heinrich-Josef Vetten
Huy D. Nguyen
HY Wang
I Pagán
I Pagán
J Chen
J Chen
John A. Walsh
K Ohshima
K Ohshima
K Ohshima
K Tomimura
K Tomimura
Kazusato Ohshima
KP Schliep
KS Lole
MA Larkin
MJ Gibbs
MO Salminen
MW Gardner
N Suehiro
O Nicolas
R Pinhasi
R Sanjuán
S Farzadfar
S Fuji
S Fuji
S Guindon
S Korkmaz
Sebastián Duchêne
Simon Y. W. Ho
Smith Maynard
SYW Ho
SYW Ho
TA Hall
Yasuhiro Tomitaka
Z Tan
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 06/02/2013
Field of study

Turnip mosaic potyvirus (TuMV) is probably the most widespread and damaging virus that infects cultivated brassicas worldwide. Previous work has indicated that the virus originated in western Eurasia, with all of its closest relatives being viruses of monocotyledonous plants. Here we report that we have identified a sister lineage of TuMV-like potyviruses (TuMV-OM) from European orchids. The isolates of TuMV-OM form a monophyletic sister lineage to the brassica-infecting TuMVs (TuMV-BIs), and are nested within a clade of monocotyledon-infecting viruses. Extensive host-range tests showed that all of the TuMV-OMs are biologically similar to, but distinct from, TuMV-BIs and do not readily infect brassicas. We conclude that it is more likely that TuMV evolved from a TuMV-OM-like ancestor than the reverse. We did Bayesian coalescent analyses using a combination of novel and published sequence data from four TuMV genes [helper component-proteinase protein (HC-Pro), protein 3(P3), nuclear inclusion b protein (NIb), and coat protein (CP)]. Three genes (HC-Pro, P3, and NIb), but not the CP gene, gave results indicating that the TuMV-BI viruses diverged from TuMV-OMs around 1000 years ago. Only 150 years later, the four lineages of the present global population of TuMV-BIs diverged from one another. These dates are congruent with historical records of the spread of agriculture in Western Europe. From about 1200 years ago, there was a warming of the climate, and agriculture and the human population of the region greatly increased. Farming replaced woodlands, fostering viruses and aphid vectors that could invade the crops, which included several brassica cultivars and weeds. Later, starting 500 years ago, inter-continental maritime trade probably spread the TuMV-BIs to the remainder of the world

Public Library of Science (PLOS)

Warwick Research Archives Portal Repository

The Australian National University

University of Melbourne Institutional Repository

FigShare

Organellar inheritance in the green lineage: insights from Ostreococcus tauri

Author: Adam Eyre-Walker
Baur
Birky
Bonen
Boynton
Bruen
Correns
De Clerck
Derelle
Duret
Grimsley
Guindon
Gwenael Piganeau
Hasegawa
Hill
Hill
Houliston
Hua
Huang
Hurst
Hutson
Jancek
Kurtz
Larkin
Lewis
Lewontin
Li
MacAlpine
Marin
Marshall
Maréchal
Maynard Smith
McVean
Miyamura
Muller
Nei
Ness
Olson
Piganeau
Posada
Posada
R Development Core Team
Robbens
Rodríguez-Ezpeleta
Romain Blanc-Mathieu
Sager
Sager
Sager
Simpson
Sophie Sanchez-Ferandin
Städler
Sun
Sung
Swofford
Tamura
Tamura
Tsai
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2013
Field of study

Along the green lineage (Chlorophyta and Streptophyta), mitochondria and chloroplast are mainly uniparentally transmitted and their evolution is thus clonal. The mode of organellar inheritance in their ancestor is less certain. The inability to make clear phylogenetic inference is partly due to a lack of information for deep branching organisms in this lineage. Here, we investigate organellar evolution in the early branching green alga Ostreococcus tauri using population genomics data from the complete mitochondrial and chloroplast genomes. The haplotype structure is consistent with clonal evolution in mitochondria, while we find evidence for recombination in the chloroplast genome. The number of recombination events in the genealogy of the chloroplast suggests that recombination, and thus biparental inheritance, is not rare. Consistent with the evidence of recombination, we find that the ratio of the number of nonsynonymous to the synonymous polymorphisms per site is lower in chloroplast than in the mitochondria genome. We also find evidence for the segregation of two selfish genetic elements in the chloroplast. These results shed light on the role of recombination and the evolutionary history of organellar inheritance in the green lineage

arXiv.org e-Print Archive

Sussex Research Online

Recombination rate and selection strength in HIV intra-patient evolution

Author: A Eyre-Walker
A Jung
AE Jetzt
AR Templeton
AS Perelson
B Asquith
C Charpentier
C Kuiken
C Neuhauser
Christophe Fraser
CL Althaus
CTT Edwards
D Shriner
D Shriner
DJ Wilson
DN Levy
E Jones
E Simon-Loriere
G McVean
GA Bazykin
HY Lee
IM Rouzine
IM Rouzine
J Archer
J Chen
J Hunter
J Zhuang
JH Gillespie
L Chen
M Kimura
N Barton
N Barton
R Nielsen
R Shankarappa
RA Kaslow
RA Neher
RC Edgar
RC Griffiths
Richard A. Neher
RR Hudson
S Duffy
SA Seibert
SL Liu
T Leitner
T Nora
T Oliphant
Thomas Leitner
WJ Ewens
Y Yamaguchi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

The evolutionary dynamics of HIV during the chronic phase of infection is driven by the host immune response and by selective pressures exerted through drug treatment. To understand and model the evolution of HIV quantitatively, the parameters governing genetic diversification and the strength of selection need to be known. While mutation rates can be measured in single replication cycles, the relevant effective recombination rate depends on the probability of coinfection of a cell with more than one virus and can only be inferred from population data. However, most population genetic estimators for recombination rates assume absence of selection and are hence of limited applicability to HIV, since positive and purifying selection are important in HIV evolution. Here, we estimate the rate of recombination and the distribution of selection coefficients from time-resolved sequence data tracking the evolution of HIV within single patients. By examining temporal changes in the genetic composition of the population, we estimate the effective recombination to be r=1.4e-5 recombinations per site and generation. Furthermore, we provide evidence that selection coefficients of at least 15% of the observed non-synonymous polymorphisms exceed 0.8% per generation. These results provide a basis for a more detailed understanding of the evolution of HIV. A particularly interesting case is evolution in response to drug treatment, where recombination can facilitate the rapid acquisition of multiple resistance mutations. With the methods developed here, more precise and more detailed studies will be possible, as soon as data with higher time resolution and greater sample sizes is available.Comment: to appear in PLoS Computational Biolog

Public Library of Science (PLOS)

CiteSeerX

edoc

Bayesian modeling of recombination events in bacterial populations

Author: A Baldwin
A Baldwin
A Baldwin
A Rambaut
A Skalka
Adam Baldwin
C Fraser
Chris Dowson
CP Robert
CX Chan
D Falush
D Husmeier
D Posada
DJ Hand
E Mahenthiralingam
E Mahenthiralingam
EHL Aarts
Eshwar Mahenthiralingam
FM Cohan
J Corander
J Corander
J Corander
J Corander
J Felsenstein
J Hein
J Maynard Smith
JG Lawrence
JS Sinsheimer
Jukka Corander
JV Braun
M Arenas
M Hasegawa
MA Suchard
MJ Schervish
NC Grassly
P Marttinen
Pekka Marttinen
R Jain
RA Elton
S Sawyer
SA Sisson
VN Minin
VN Minin
William P Hanage
WJ Wiersinga
WP Hanage
X Didelot
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Background: We consider the discovery of recombinant segments jointly with their origins within multilocus DNA sequences from bacteria representing heterogeneous populations of fairly closely related species. The currently available methods for recombination detection capable of probabilistic characterization of uncertainty have a limited applicability in practice as the number of strains in a data set increases. Results: We introduce a Bayesian spatial structural model representing the continuum of origins over sites within the observed sequences, including a probabilistic characterization of uncertainty related to the origin of any particular site. To enable a statistically accurate and practically feasible approach to the analysis of large-scale data sets representing a single genus, we have developed a novel software tool (BRAT, Bayesian Recombination Tracker) implementing the model and the corresponding learning algorithm, which is capable of identifying the posterior optimal structure and to estimate the marginal posterior probabilities of putative origins over the sites. Conclusion: A multitude of challenging simulation scenarios and an analysis of real data from seven housekeeping genes of 120 strains of genus Burkholderia are used to illustrate the possibilities offered by our approach. The software is freely available for download at URL http://web.abo.fi/fak/ mnf//mate/jc/software/brat.html

Online Research @ Cardiff

Springer - Publisher Connector