Search CORE

34 research outputs found

Using phylogenetic models to characterise natural selection from molecular data

Author: Tamuri AU
Publication venue: UCL (University College London)
Publication date: 28/12/2012
Field of study

Molecular phylogenetics is the application of mathematical and computational techniques to analyse molecular sequences and make inferences about their evolutionary relationships. There is substantial interest in developing probabilistic models of evolution that effectively detect, locate and characterise different type of selection in genes, driven by the relationship of selection to protein structural constraints and function. In this thesis we propose novel approaches that can be used not only to detect the presence of selection but also to characterise its kind and strength. We first develop a phylogenetic method to identify changes in selective constraints and use it to identify those mutations that allow influenza viruses from avian origin to spread successfully in the human population. The model explicitly takes into account differences in the equilibrium frequencies of amino acids in different hosts and locations. We then use these results to develop a measure of the level of adaptation of any given influenza virus sequence to the selective constraints imposed by avian or human hosts. We show that adaptation to the human host has been gradual when applied to historical data. Our results also indicate that the 1918 influenza virus had undergone a period of pre-adaptation prior to 1918 when compared to the adaptation of other avian influenza viruses. Finally, we develop a codon-based model of mutation-selection to estimate the distribution of selection coefficients and find that we can recover distributions similar to those expected by population genetics theory. We show that the distribution of mammalian mitochondrial proteins is bimodal with the majority of mutations being deleterious. When we apply the model to the PB2 influenza polymerase protein following a host shift from birds to humans, we find a trimodal distribution with a significant proportion of advantageous substitutions

UCL Discovery

Haplotype assignment of longitudinal viral deep sequencing data using covariation of variant frequencies.

Author: Breuer J
Breuer J
Fullerton C.
Fullerton C.
Goldstein RA
Goldstein RA
Griffiths P
Griffiths P
Pang J
Pang J
Roy S
Roy S
Tamuri AU
Tamuri AU
Venturini C
Venturini C
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2022
Field of study

Longitudinal deep sequencing of viruses can provide detailed information about intra-host evolutionary dynamics including how viruses interact with and transmit between hosts. Many analyses require haplotype reconstruction, identifying which variants are co-located on the same genomic element. Most current methods to perform this reconstruction are based on a high density of variants and cannot perform this reconstruction for slowly evolving viruses. We present a new approach, HaROLD (HAplotype Reconstruction Of Longitudinal Deep sequencing data), which performs this reconstruction based on identifying co-varying variant frequencies using a probabilistic framework. We illustrate HaROLD on both RNA and DNA viruses with synthetic Illumina paired read data created from mixed human cytomegalovirus (HCMV) and norovirus genomes, and clinical datasets of HCMV and norovirus samples, demonstrating high accuracy, especially when longitudinal samples are available

LSBU Research Open

Rapid morphological evolution in placental mammals post-dates the origin of the crown group.

Author: Dos Reis M
Ferguson-Gow H
Goswami A
Halliday TJD
Tamuri AU
Yang Z
Publication venue: The Royal Society
Publication date: 01/03/2019
Field of study

Resolving the timing and pattern of early placental mammal evolution has been confounded by conflict among divergence date estimates from interpretation of the fossil record and from molecular-clock dating studies. Despite both fossil occurrences and molecular sequences favouring a Cretaceous origin for Placentalia, no unambiguous Cretaceous placental mammal has been discovered. Investigating the differing patterns of evolution in morphological and molecular data reveals a possible explanation for this conflict. Here, we quantified the relationship between morphological and molecular rates of evolution. We show that, independent of divergence dates, morphological rates of evolution were slow relative to molecular evolution during the initial divergence of Placentalia, but substantially increased during the origination of the extant orders. The rapid radiation of placentals into a highly morphologically disparate Cenozoic fauna is thus not associated with the origin of Placentalia, but post-dates superordinal origins. These findings predict that early members of major placental groups may not be easily distinguishable from one another or from stem eutherians on the basis of skeleto-dental morphology. This result supports a Late Cretaceous origin of crown placentals with an ordinal-level adaptive radiation in the early Paleocene, with the high relative rate permitting rapid anatomical change without requiring unreasonably fast molecular evolutionary rates. The lack of definitive Cretaceous placental mammals may be a result of morphological similarity among stem and early crown eutherians, providing an avenue for reconciling the fossil record with molecular divergence estimates for Placentalia

University of Birmingham Research Portal

ZENODO

Dryad Digital Repository (Duke University)

UCL Discovery

Electronic Archiving System

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Queen Mary Research Online

Finding Direction in the Search for Selection.

Author: AL Halpern
AL Hughes
AL Hughes
AL Hughes
AM Wensing
AU Tamuri
AU Tamuri
AU Tamuri
B Knudsen
B Murrell
BW Matthews
CJ Creevey
GE Crooks
Grant Thiltgen
KS Dorman
L Carroll
L Valen Van
M Hasegawa
M Kimura
M Reis dos
M Reis dos
Mario dos Reis
MK Hughes
N Rodrigue
R Nielsen
Richard A. Goldstein
S Bonhoeffer
S Woolley
SJ Spielman
SLK Pond
SLK Pond
T Endo
T Tanaka
W Messier
X Gu
Z Yang
Z Yang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Tests for positive selection have mostly been developed to look for diversifying selection where change away from the current amino acid is often favorable. However, in many cases we are interested in directional selection where there is a shift toward specific amino acids, resulting in increased fitness in the species. Recently, a few methods have been developed to detect and characterize directional selection on a molecular level. Using the results of evolutionary simulations as well as HIV drug resistance data as models of directional selection, we compare two such methods with each other, as well as against a standard method for detecting diversifying selection. We find that the method to detect diversifying selection also detects directional selection under certain conditions. One method developed for detecting directional selection is powerful and accurate for a wide range of conditions, while the other can generate an excessive number of false positives

Crossref

Springer - Publisher Connector

UCL Discovery

PubMed Central

Queen Mary Research Online

FigShare

Epistasis not needed to explain low dN/dS

Author: AL Halpern
AS Kondrashov
AU Tamuri
David M. McCandlish
DM Fowler
Etienne Rajon
J da Silva
Joshua B. Plotkin
MA DePristo
MLM Salverda
MS Breen
N Rodrigue
Premal Shah
S Kryazhimskiy
SC Choi
TF Hansen
WH Li
Yang Ding
Z Yang
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/12/2012
Field of study

An important question in molecular evolution is whether an amino acid that occurs at a given position makes an independent contribution to fitness, or whether its effect depends on the state of other loci in the organism's genome, a phenomenon known as epistasis. In a recent letter to Nature, Breen et al. (2012) argued that epistasis must be "pervasive throughout protein evolution" because the observed ratio between the per-site rates of non-synonymous and synonymous substitutions (dN/dS) is much lower than would be expected in the absence of epistasis. However, when calculating the expected dN/dS ratio in the absence of epistasis, Breen et al. assumed that all amino acids observed in a protein alignment at any particular position have equal fitness. Here, we relax this unrealistic assumption and show that any dN/dS value can in principle be achieved at a site, without epistasis. Furthermore, for all nuclear and chloroplast genes in the Breen et al. dataset, we show that the observed dN/dS values and the observed patterns of amino acid diversity at each site are jointly consistent with a non-epistatic model of protein evolution.Comment: This manuscript is in response to "Epistasis as the primary factor in molecular evolution" by Breen et al. Nature 490, 535-538 (2012

arXiv.org e-Print Archive

Crossref

Cold Spring Harbor Laboratory Institutional Repository

INRIA a CCSD electronic archive server

HAL Descartes

Human cytomegalovirus haplotype reconstruction reveals high diversity due to superinfection and evidence of within-host recombination

Author: Breuer J
Bryant JM
Cudini J
Depledge DP
Goldstein RA
Houldcroft CJ
Roy S
Tamuri AU
Tutill H
Veys P
Williams R
Worth AJJ
Publication venue
Publication date: 19/03/2019
Field of study

Recent sequencing efforts have led to estimates of human cytomegalovirus (HCMV) genome-wide intrahost diversity that rival those of persistent RNA viruses [Renzette N, Bhattacharjee B, Jensen JD, Gibson L, Kowalik TF (2011) PLoS Pathog 7:e1001344]. Here, we deep sequence HCMV genomes recovered from single and longitudinally collected blood samples from immunocompromised children to show that the observations of high within-host HCMV nucleotide diversity are explained by the frequent occurrence of mixed infections caused by genetically distant strains. To confirm this finding, we reconstructed within-host viral haplotypes from short-read sequence data. We verify that within-host HCMV nucleotide diversity in unmixed infections is no greater than that of other DNA viruses analyzed by the same sequencing and bioinformatic methods and considerably less than that of human immunodeficiency and hepatitis C viruses. By resolving individual viral haplotypes within patients, we reconstruct the timing, likely origins, and natural history of superinfecting strains. We uncover evidence for within-host recombination between genetically distinct HCMV strains, observing the loss of the parental virus containing the nonrecombinant fragment. The data suggest selection for strains containing the recombinant fragment, generating testable hypotheses about HCMV evolution and pathogenesis. These results highlight that high HCMV diversity present in some samples is caused by coinfection with multiple distinct strains and provide reassurance that within the host diversity for single-strain HCMV infections is no greater than for other herpesviruses

UCL Discovery

Glycosylation Focuses Sequence Variation in the Influenza A Virus H1 Hemagglutinin Globular Domain

Author: AJ Caton
AJ Petrescu
AS Gambaryan
AU Tamuri
BP Blackburne
CC Wang
CJ Wei
Darrell E. Hurt
DC Wiley
DC Wiley
DJ Vigerust
DN Hebert
E Tsuchiya
Esteban Domingo
G Neumann
G Russ
HD Klenk
IA Wilson
IT Schulze
J Stevens
Jack R. Bennink
JJ Skehel
JJ Skehel
JL Cherry
JM Nicholls
Jonathan W. Yewdell
JW Yewdell
JW Yewdell
KL Deshpande
L Kasturi
M Igarashi
M Zhang
P Gallagher
P Wessa
Pere Puigbò
PJ Gallagher
R Daniels
R Ohuchi
R Wagner
R Xu
RC Edgar
RJ Garten
S Ben-Dor
Scott E. Hensley
Suman R. Das
SY Mir-Shekari
T Francis
T Francis Jr
W Breuer
W Gerhard
Y Itoh
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Antigenic drift in the influenza A virus hemagglutinin (HA) is responsible for seasonal reformulation of influenza vaccines. Here, we address an important and largely overlooked issue in antigenic drift: how does the number and location of glycosylation sites affect HA evolution in man? We analyzed the glycosylation status of all full-length H1 subtype HA sequences available in the NCBI influenza database. We devised the “flow index” (FI), a simple algorithm that calculates the tendency for viruses to gain or lose consensus glycosylation sites. The FI predicts the predominance of glycosylation states among existing strains. Our analyses show that while the number of glycosylation sites in the HA globular domain does not influence the overall magnitude of variation in defined antigenic regions, variation focuses on those regions unshielded by glycosylation. This supports the conclusion that glycosylation generally shields HA from antibody-mediated neutralization, and implies that fitness costs in accommodating oligosaccharides limit virus escape via HA hyperglycosylation

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Exploring the Evolution of Novel Enzyme Functions within Structurally Defined Protein Superfamilies

Author: A Andreeva
AE Todd
AL Cuff
Alison L. Cuff
AU Tamuri
BE Engelhardt
BH Dessailly
C Chothia
CA Orengo
Christine A. Orengo
DA Benson
DE Almonacid
DM Schmidt
DS Tawfik
G Caetano-Anolles
GA Reeves
Gemma L. Holliday
GJ Bartlett
GJ Binford
GL Holliday
GL Holliday
GL Holliday
HS Park
I Nobeli
Ian Sillitoe
J Ruan
J Shi
Janet M. Thornton
JP Overington
K Katoh
LH Greene
M Bashton
M Groll
M Xu
ME Glasner
MT Murakami
N Furnham
N Gallastegui
Nicholas Furnham
NJ Mulder
O Khersonsky
PF Gherardini
PJ O'Brien
Roman A. Laskowski
SC Pegg
SD Brown
SF Altschul
W Heinemeyer
WS Valdar
Yanay Ofran
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

In order to understand the evolution of enzyme reactions and to gain an overview of biological catalysis we have combined sequence and structural data to generate phylogenetic trees in an analysis of 276 structurally defined enzyme superfamilies, and used these to study how enzyme functions have evolved. We describe in detail the analysis of two superfamilies to illustrate different paradigms of enzyme evolution. Gathering together data from all the superfamilies supports and develops the observation that they have all evolved to act on a diverse set of substrates, whilst the evolution of new chemistry is much less common. Despite that, by bringing together so much data, we can provide a comprehensive overview of the most common and rare types of changes in function. Our analysis demonstrates on a larger scale than previously studied, that modifications in overall chemistry still occur, with all possible changes at the primary level of the Enzyme Commission (E.C.) classification observed to a greater or lesser extent. The phylogenetic trees map out the evolutionary route taken within a superfamily, as well as all the possible changes within a superfamily. This has been used to generate a matrix of observed exchanges from one enzyme function to another, revealing the scale and nature of enzyme evolution and that some types of exchanges between and within E.C. classes are more prevalent than others. Surprisingly a large proportion (71%) of all known enzyme functions are performed by this relatively small set of 276 superfamilies. This reinforces the hypothesis that relatively few ancient enzymatic domain superfamilies were progenitors for most of the chemistry required for life

Public Library of Science (PLOS)

CiteSeerX

Crossref

LSHTM Research Online

Directory of Open Access Journals

PubMed Central

UCL Discovery

FigShare

Modeling HIV-1 Drug Resistance as Episodic Directional Selection

The evolution of substitutions conferring drug resistance to HIV-1 is both episodic, occurring when patients are on antiretroviral therapy, and strongly directional, with site-specific resistant residues increasing in frequency over time. While methods exist to detect episodic diversifying selection and continuous directional selection, no evolutionary model combining these two properties has been proposed. We present two models of episodic directional selection (MEDS and EDEPS) which allow the a priori specification of lineages expected to have undergone directional selection. The models infer the sites and target residues that were likely subject to directional selection, using either codon or protein sequences. Compared to its null model of episodic diversifying selection, MEDS provides a superior fit to most sites known to be involved in drug resistance, and neither one test for episodic diversifying selection nor another for constant directional selection are able to detect as many true positives as MEDS and EDEPS while maintaining acceptable levels of false positives. This suggests that episodic directional selection is a better description of the process driving the evolution of drug resistance

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Stellenbosch University SUNScholar Repository

FigShare

New Insights into the Phylogeny and Molecular Classification of Nicotinamide Mononucleotide Deamidases

Author: Agustín Sola-Carvajal
Ana Belén Martínez-Moñino
AU Tamuri
B Martin
BC Smith
BJ Pearce
BM Olivera
C Combet
C Kaimer
C Quast
D Hillyard
DA Rodionov
DM Kinney
EF Pettersen
F Glaser
Francisco García-Carmona
G Schwarz
G Schwarz
G Sánchez-Carrón
G Sánchez-Carrón
GE Crooks
Guiomar Sánchez-Carrón
H Takami
HC Friedmann
Hideto Takami
I Letunic
JD Nichols
JD Thompson
JW Foster
JW Foster
L Cialabrini
L Galeazzi
L Schrodinger
M Punta
MT Liu
MV Han
P Gouet
R Thomsen
RD Finn
S Karlin
SF Altschul
T Imai
U Pieper
UE Park
Valerie de Crécy-Lagard
W Cheng
WF De Azevedo Jr.
Álvaro Sánchez-Ferrer
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 05/12/2013
Field of study

Nicotinamide mononucleotide (NMN) deamidase is one of the key enzymes of the bacterial pyridine nucleotide cycle (PNC). It catalyzes the conversion of NMN to nicotinic acid mononucleotide, which is later converted to NAD+ by entering the Preiss-Handler pathway. However, very few biochemical data are available regarding this enzyme. This paper represents the first complete molecular characterization of a novel NMN deamidase from the halotolerant and alkaliphilic bacterium Oceanobacillus iheyensis (OiPncC). The enzyme was active over a broad pH range, with an optimum at pH 7.4, whilst maintaining 90 % activity at pH 10.0. Surprisingly, the enzyme was quite stable at such basic pH, maintaining 61 % activity after 21 days. As regard temperature, it had an optimum at 65 °C but its stability was better below 50 °C. OiPncC was a Michaelian enzyme towards its only substrate NMN, with a Km value of 0.18 mM and a kcat/Km of 2.1 mM-1 s-1. To further our understanding of these enzymes, a complete phylogenetic and structural analysis was carried out taking into account the two Pfam domains usually associated with them (MocF and CinA). This analysis sheds light on the evolution of NMN deamidases, and enables the classification of NMN deamidases into 12 different subgroups, pointing to a novel domain architecture never before described. Using a Logo representation, conserved blocks were determined, providing new insights on the crucial residues involved in the binding and catalysis of both CinA and MocF domains. The analysis of these conserved blocks within new protein sequences could permit the more efficient data curation of incoming NMN deamidases

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

JAMSTEC Repository

FigShare