Search CORE

49 research outputs found

SNPs Occur in Regions with Less Genomic Sequence Conservation

Author: A Grimson
A Siepel
BP Lewis
CF Baer
CM Wade
D Chasman
E Pennisi
ES Lander
FW Allendorf
GE Crooks
H Zhang
Ilya Ruvinsky
J Stapley
JC Venter
John C. Castle
JV Chamary
K Chen
L Cartegni
M Lynch
M Stratton
MA Saunders
MP Miller
PA Morin
RH Waterston
RM Durbin
RM Kuhn
ST Sherry
V Matys
WG Fairbrother
Publication venue: Public Library of Science
Publication date: 06/06/2011
Field of study

Rates of SNPs (single nucleotide polymorphisms) and cross-species genomic sequence conservation reflect intra- and inter-species variation, respectively. Here, I report SNP rates and genomic sequence conservation adjacent to mRNA processing regions and show that, as expected, more SNPs occur in less conserved regions and that functional regions have fewer SNPs. Results are confirmed using both mouse and human data. Regions include protein start codons, 3′ splice sites, 5′ splice sites, protein stop codons, predicted miRNA binding sites, and polyadenylation sites. Throughout, SNP rates are lower and conservation is higher at regulatory sites. Within coding regions, SNP rates are highest and conservation is lowest at codon position three and the fewest SNPs are found at codon position two, reflecting codon degeneracy for amino acid encoding. Exon splice sites show high conservation and very low SNP rates, reflecting both splicing signals and protein coding. Relaxed constraint on the codon third position is dramatically seen when separating exonic SNP rates based on intron phase. At polyadenylation sites, a peak of conservation and low SNP rate occurs from 30 to 17 nt preceding the site. This region is highly enriched for the sequence AAUAAA, reflecting the location of the conserved polyA signal. miRNA 3′ UTR target sites are predicted incorporating interspecies genomic sequence conservation; SNP rates are low in these sites, again showing fewer SNPs in conserved regions. Together, these results confirm that SNPs, reflecting recent genetic variation, occur more frequently in regions with less evolutionarily conservation

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Inferring stabilizing mutations from protein phylogenies : application to influenza hemagglutinin

Author: A Akasako
A Akasako
A Cao
A Martin
A Mitraki
A Rambaut
AA Pakula
AR Dinner
AR Fersht
AR Fersht
AS Yang
AS Yang
AV Gribenko
B Steipe
B Steipe
BM Broome
C Pal
C Park
CB Anfinsen
CB Do
CM Dobson
CT Saunders
D Gilis
D Perl
D Shortle
DA Cowan
DA Drummond
DA Drummond
DD Loeb
DM Taverna
DM Taverna
E Capriotti
E Hoffmann
E van Nimwegen
EPC Rocha
Eugene I. Shakhnovich
F Chiti
F Ronquist
G Parisi
GG Brownlee
H Akashi
H Li
H Schindelin
H Zhao
H Zhou
HW Hellinga
I Keller
IE Sanchez
IMP del Pino
J Felsenstein
J Felsenstein
J Felsenstein
J Felsenstein
J Kyte
JA Wells
JB Garrett
JD Bloom
JD Bloom
JD Bloom
JD Bloom
Jesse D. Bloom
JL Thorne
JM Koshi
JP Huelsenbeck
JP Huelsenbeck
JR Cochran
JR Lepock
JV Chamary
K Ishikawa
K Ishikawa
K Katayanagi
KA Bava
KA Gray
KB Zeldovich
KJ Szretter
KL Maxwell
L Giver
L Serrano
M Dai
M Haruki
M Jacob
M Lehmann
M Matrosovich
M Ueda
M Wunderlich
Matthew J. Glassman
MD Kumar
MF Sippl
MM Garcia-Mira
MM Gromiha
MP Canadillas
MS Fornasari
MW Pantoliano
N Amin
N Goldman
N Goldman
N Lartillot
N Tong
R Godoy-Ruiz
R Godoy-Ruiz
R Godoy-Ruiz
R Guerois
R Rabadan
R Sakaue
RC Edgar
RJ Ellis
S Govindarajan
S Kimura
S Kimura
S Nakajima
S Sato
SC Choi
SH White
SJ Gamblin
SS Jaswal
U Bastolla
V Parthiban
VG Dugan
VN Uversky
W Besenmatter
WS Sandberg
WSW Wong
XJ Zhang
Y Bao
YY Tseng
Z Chen
Publication venue: International Society for Computational Biology
Publication date: 01/04/2009
Field of study

One selection pressure shaping sequence evolution is the requirement that a protein fold with sufficient stability to perform its biological functions. We present a conceptual framework that explains how this requirement causes the probability that a particular amino acid mutation is fixed during evolution to depend on its effect on protein stability. We mathematically formalize this framework to develop a Bayesian approach for inferring the stability effects of individual mutations from homologous protein sequences of known phylogeny. This approach is able to predict published experimentally measured mutational stability effects (ΔΔG values) with an accuracy that exceeds both a state-of-the-art physicochemical modeling program and the sequence-based consensus approach. As a further test, we use our phylogenetic inference approach to predict stabilizing mutations to influenza hemagglutinin. We introduce these mutations into a temperature-sensitive influenza virus with a defect in its hemagglutinin gene and experimentally demonstrate that some of the mutations allow the virus to grow at higher temperatures. Our work therefore describes a powerful new approach for predicting stabilizing mutations that can be successfully applied even to large, complex proteins such as hemagglutinin. This approach also makes a mathematical link between phylogenetics and experimentally measurable protein properties, potentially paving the way for more accurate analyses of molecular evolution

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Caltech Authors

Differential Trends in the Codon Usage Patterns in HIV-1 Genes

Author: A Rambaut
A van Weringh
A Wagner
AC Rencher
AG Fisher
AI Dayton
AL Brass
AO Urrutia
Aridaman Pandit
B Korber
BD Greenbaum
BS Taylor
D Trono
DA Benson
DC Krakauer
E Eisenberg
E Kotsopoulou
EHM Wong
EP Rocha
F Wright
G Jenkins
G Perriere
GH Kijak
GM Jenkins
H Suzuki
I Ahn
J He
J Woo
JB Lucks
JB Plotkin
JB Plotkin
JL Anderson
JV Chamary
JV Chamary
L Deml
L Duret
M Choisy
M Pavon-Eternod
M Worobey
M Worobey
MA Gilchrist
MA Martinez
N Stoletzki
Philip Kim
PL Meintjes
R Grantham
R Hershberg
R Shankarappa
RH Miller
S Andre
S Williamson
S Williamson
SJ Arrigo
SL Kosakovsky Pond
SM Wolinsky
Somdatta Sinha
T Ikemura
V Muller
Y Benjamini
Y Nakamura
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Host-pathogen interactions underlie one of the most complex evolutionary phenomena resulting in continual adaptive genetic changes, where pathogens exploit the host's molecular resources for growth and survival, while hosts try to eliminate the pathogen. Deciphering the molecular basis of host–pathogen interactions is useful in understanding the factors governing pathogen evolution and disease propagation. In host-pathogen context, a balance between mutation, selection, and genetic drift is known to maintain codon bias in both organisms. Studies revealing determinants of the bias and its dynamics are central to the understanding of host-pathogen evolution. We considered the Human Immunodeficiency Virus (HIV) type 1 and its human host to search for evolutionary signatures in the viral genome. Positive selection is known to dominate intra-host evolution of HIV-1, whereas high genetic variability underlies the belief that neutral processes drive inter-host differences. In this study, we analyze the codon usage patterns of HIV-1 genomes across all subtypes and clades sequenced over a period of 23 years. We show presence of unique temporal correlations in the codon bias of three HIV-1 genes illustrating differential adaptation of the HIV-1 genes towards the host preferred codons. Our results point towards gene-specific translational selection to be an important force driving the evolution of HIV-1 at the population level

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

A Universal Trend of Reduced mRNA Stability near the Translation-Initiation Site in Prokaryotes and Eukaryotes

Author: A Eyre-Walker
A Tats
AA Komar
AE Vinogradov
AI Su
AV Komarova
B Lemos
Berend Snel
C Hoede
C Kimchi-Sarfaty
C Pal
Claus O. Wilke
CM Stenstrom
DA Drummond
DA Drummond
DH Mathews
EI Gonzalez de Valdivia
F Wright
FCP Holstege
G Kudla
G Kudla
G Qing
G Zhang
H Akashi
H Akashi
H Chen
H Musto
HC Wang
IL Hofacker
IL Hofacker
J Mandel
J Sanchez
J Shine
JE Brock
JL Parmley
JL Parmley
JP Etchegaray
JV Chamary
JV Chamary
K Yamagishi
KB Zeldovich
KE Griswold
L Duret
L Duret
L Katz
M Eames
M Kozak
M Kozak
M Stenico
MW Covert
N Galtier
N Stoletzki
N Stoletzki
P Cortazzo
P Goymer
PG Higgs
PM Sharp
S Nakagawa
SA Shabalina
T Ikemura
T Warnecke
T Zhou
TA Thanaraj
Tong Zhou
V Stolc
V Vimberg
W Seffens
Wanjun Gu
YI Wolf
YM Zalucki
Z Yang
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Recent studies have suggested that the thermodynamic stability of mRNA secondary structure near the start codon can regulate translation efficiency in Escherichia coli, and that translation is more efficient the less stable the secondary structure. We survey the complete genomes of 340 species for signals of reduced mRNA secondary structure near the start codon. Our analysis includes bacteria, archaea, fungi, plants, insects, fishes, birds, and mammals. We find that nearly all species show evidence for reduced mRNA stability near the start codon. The reduction in stability generally increases with increasing genomic GC content. In prokaryotes, the reduction also increases with decreasing optimal growth temperature. Within genomes, there is variation in the stability among genes, and this variation correlates with gene GC content, codon bias, and gene expression level. For birds and mammals, however, we do not find a genome-wide trend of reduced mRNA stability near the start codon. Yet the most GC rich genes in these organisms do show such a signal. We conclude that reduced stability of the mRNA secondary structure near the start codon is a universal feature of all cellular life. We suggest that the origin of this reduction is selection for efficient recognition of the start codon by initiator-tRNA

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks

Depletion of somatic mutations in splicing-associated sequences in cancer genomes

Author: A Busch
A Woolfe
B Schuster-Bockler
BJ Blencowe
Cancer Genome Atlas Research Network
DA Denisov
DB Carlini
E Sebestyen
E Sebestyen
EF Caceres
EP Rocha
F Pagani
F Supek
F Supek
H Jung
J-V Chamary
JJ Gartner
JL Parmley
JL Parmley
JL Parmley
JV Chamary
L Chen
Laurence D. Hurst
LB Alexandrov
LD Hurst
M Raponi
M Secrier
MS Lawrence
N Waddell
Nizar N. Batada
O Soukarieh
P Julien
P Polak
P Polak
PA Futreal
R Savisaar
R Savisaar
R Soemedi
RC Hunt
RD Schreiber
RS Hansen
S Kogan
S Nik-Zainal
S Subramanian
SH Lelieveld
T Derrien
T Khare
T Warnecke
VA Blomen
WG Fairbrother
WG Fairbrother
X Chen
X Wu
XM Wu
Y Xing
ZE Sauna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2017
Field of study

Abstract Background An important goal of cancer genomics is to identify systematically cancer-causing mutations. A common approach is to identify sites with high ratios of non-synonymous to synonymous mutations; however, if synonymous mutations are under purifying selection, this methodology leads to identification of false-positive mutations. Here, using synonymous somatic mutations (SSMs) identified in over 4000 tumours across 15 different cancer types, we sought to test this assumption by focusing on coding regions required for splicing. Results Exon flanks, which are enriched for sequences required for splicing fidelity, have ~ 17% lower SSM density compared to exonic cores, even after excluding canonical splice sites. While it is impossible to eliminate a mutation bias of unknown cause, multiple lines of evidence support a purifying selection model above a mutational bias explanation. The flank/core difference is not explained by skewed nucleotide content, replication timing, nucleosome occupancy or deficiency in mismatch repair. The depletion is not seen in tumour suppressors, consistent with their role in positive tumour selection, but is otherwise observed in cancer-associated and non-cancer genes, both essential and non-essential. Consistent with a role in splicing modulation, exonic splice enhancers have a lower SSM density before and after controlling for nucleotide composition; moreover, flanks at the 5’ end of the exons have significantly lower SSM density than at the 3’ end. Conclusions These results suggest that the observable mutational spectrum of cancer genomes is not simply a product of various mutational processes and positive selection, but might also be shaped by negative selection

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

p53 mutations in classic and pleomorphic invasive lobular carcinoma of the breast

Author: A Petitjean
A Storey
AD Thor
AM Thompson
BJ Chae
BW Lisboa
C Noma
CC Harris
D Frolik
D Lohmann
E Gudlaugsson
FC Schmitt
G Lamolle
G Mazoujian
HPR Kini
IB Runnebaum
J Bartek
J Lukas
JM Dixon
JS Bentz
JV Chamary
LP Middleton
M Gasco
M Hollstein
MM Candeias
MM Siddique
N Buyru
N Perry
N Sneige
N Weidner
P Rossner Jr
PA Muller
PD Pharoah
PJ Diest van
PW Derksen
PW Derksen
S Kato
T Ohayon
TG Kalemi
TI Andersen
V Eusebi
Y Umekita
Publication venue: Springer Netherlands
Publication date: 01/01/2012
Field of study

Contains fulltext : 110338.pdf (publisher's version ) (Open Access)BACKGROUND: p53 is a tumor suppressor that is frequently mutated in human cancers. Although alterations in p53 are common in breast cancer, few studies have specifically investigated TP53 mutations in the breast cancer subtype invasive lobular carcinoma (ILC). Recently reported conditional mouse models have indicated that functional p53 inactivation may play a role in ILC development and progression. Since reports on the detection of TP53 mutations in the relatively favorable classic and more aggressive pleomorphic variants of ILC (PILC) are rare and ambiguous, we performed a comprehensive analysis to determine the mutation status of TP53 in these breast cancer subtypes. METHODS: To increase our understanding of p53-mediated pathways and the roles they may play in the etiology of classic ILC and PILC, we investigated TP53 mutations and p53 accumulation in a cohort of 22 cases of classic and 19 cases of PILC by direct DNA sequencing and immunohistochemistry. RESULTS: We observed 11 potentially pathogenic TP53 mutations, of which three were detected in classic ILC (13.6%) and 8 in PILC (42.1%; p = 0.04). While p53 protein accumulation was not significantly different between classic and pleomorphic ILC, mutations that affected structure and protein function were significantly associated with p53 protein levels. CONCLUSION: TP53 mutations occur more frequently in PILC than classic ILC.1 april 201

Crossref

Springer - Publisher Connector

PubMed Central

Radboud Repository

Mutation analysis of the MDM4 gene in German breast cancer patients

Abstract Background MDM4 is a negative regulator of p53 and cooperates with MDM2 in the cellular response to DNA damage. It is unknown, however, whether <it>MDM4 </it>gene alterations play some role in the inherited component of breast cancer susceptibility. Methods We sequenced the whole <it>MDM4 </it>coding region and flanking untranslated regions in genomic DNA samples obtained from 40 German patients with familial breast cancer. Selected variants were subsequently screened by RFLP-based assays in an extended set of breast cancer cases and controls. Results Our resequencing study uncovered two <it>MDM4 </it>coding variants in 4/40 patients. Three patients carried a silent substitution at codon 74 that was linked with another rare variant in the 5'UTR. No association of this allele with breast cancer was found in a subsequent screening of 133 patients with bilateral breast cancer and 136 controls. The fourth patient was heterozygous for the missense substitution D153G which is located in a less conserved region of the MDM4 protein but may affect a predicted phosphorylation site. The D153G substitution only partially segregated with breast cancer in the family and was not identified on additional 680 chromosomes screened. Conclusion This study did not reveal clearly pathogenic mutations although it uncovered two new unclassified variants at a low frequency. We conclude that there is no evidence for a major role of <it>MDM4 </it>coding variants in the inherited susceptibility towards breast cancer in German patients.</p

CiteSeerX

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Late Replicating Domains Are Highly Recombining in Females but Have Low Male Recombination Rates: Implications for Isochore Evolution

Author: A Cox
A Necsulea
BL Dumont
C Schmegner
C Schmegner
C-L Chen
Catherine J. Pink
CJ Pink
CJ Pink
CM Malcom
CM Ramsdell
D Karolchik
DJ Gaffney
E Yaffe
G Marais
G McVicker
G Piganeau
GE Magni
GE Magni
GI Lang
H Ellegren
I Hellmann
I Hiratani
J Berglund
J Meunier
J Perry
J-V Chamary
JA Stamatoyannopoulos
JF Crow
JF Crow
JN Strathern
JT Eppig
K Tamura
K Woodfine
KD Makova
KH Wolfe
L Duret
L Duret
Laurence D. Hurst
M Brudno
M Costantini
M Touchon
M-C Marsolier-Kergoat
MI Jensen-Seaman
MJ Lercher
MJ Lercher
MT Webster
N Galtier
N Weddington
Pawel Michalak
PD Keightley
S Farkash-Amar
S Ptak
S Shifman
S Tyekucheva
TC Brown
TR Dreszer
WH Li
Y Clément
Y Watanabe
Publication venue: Public Library of Science
Publication date: 20/09/2011
Field of study

In mammals sequences that are either late replicating or highly recombining have high rates of evolution at putatively neutral sites. As early replicating domains and highly recombining domains both tend to be GC rich we a priori expect these two variables to covary. If so, the relative contribution of either of these variables to the local neutral substitution rate might have been wrongly estimated owing to covariance with the other. Against our expectations, we find that sex-averaged recombination rates show little or no correlation with replication timing, suggesting that they are independent determinants of substitution rates. However, this result masks significant sex-specific complexity: late replicating domains tend to have high recombination rates in females but low recombination rates in males. That these trends are antagonistic explains why sex-averaged recombination is not correlated with replication timing. This unexpected result has several important implications. First, although both male and female recombination rates covary significantly with intronic substitution rates, the magnitude of this correlation is moderately underestimated for male recombination and slightly overestimated for female recombination, owing to covariance with replicating timing. Second, the result could explain why male recombination is strongly correlated with GC content but female recombination is not. If to explain the correlation between GC content and replication timing we suppose that late replication forces reduced GC content, then GC promotion by biased gene conversion during female recombination is partly countered by the antagonistic effect of later replicating sequence tending increase AT content. Indeed, the strength of the correlation between female recombination rate and local GC content is more than doubled by control for replication timing. Our results underpin the need to consider sex-specific recombination rates and potential covariates in analysis of GC content and rates of evolution

Public Library of Science (PLOS)

Crossref

PubMed Central

The surprising negative correlation of gene length and optimal codon use - disentangling translational selection from GC-biased gene conversion in yeast

Abstract Background Surprisingly, in several multi-cellular eukaryotes optimal codon use correlates negatively with gene length. This contrasts with the expectation under selection for translational accuracy. While suggested explanations focus on variation in strength and efficiency of translational selection, it has rarely been noticed that the negative correlation is reported only in organisms whose optimal codons are biased towards codons that end with G or C (-GC). This raises the question whether forces that affect base composition - such as GC-biased gene conversion - contribute to the negative correlation between optimal codon use and gene length. Results Yeast is a good organism to study this as equal numbers of optimal codons end in -GC and -AT and one may hence compare frequencies of optimal GC- with optimal AT-ending codons to disentangle the forces. Results of this study demonstrate in yeast frequencies of GC-ending (optimal AND non-optimal) codons decrease with gene length and increase with recombination. A decrease of GC-ending codons along genes contributes to the negative correlation with gene length. Correlations with recombination and gene expression differentiate between GC-ending and optimal codons, and also substitution patterns support effects of GC-biased gene conversion. Conclusion While the general effect of GC-biased gene conversion is well known, the negative correlation of optimal codon use with gene length has not been considered in this context before. Initiation of gene conversion events in promoter regions and the presence of a gene conversion gradient most likely explain the observed decrease of GC-ending codons with gene length and gene position.</p

Crossref

Harvard University - DASH

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Open Access LMU