Search CORE

46 research outputs found

Preferred and avoided codon pairs in three domains of life

Author: Remm Maido
Tats Age
Tenson Tanel
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Highly expressed proteins have an increased frequency of alanine in the second amino acid position

Author: Remm Maido
Tats Age
Tenson Tanel
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: Although the sequence requirements for translation initiation regions have been frequently analysed, usually the highly expressed genes are not treated as a separate dataset. RESULTS: To investigate this, we analysed the mRNA regions downstream of initiation codons in nine bacteria, three archaea and three unicellular eukaryotes, comparing the dataset of highly expressed genes to the dataset of all genes. In addition to the detailed analysis of the nucleotide and codon frequencies we compared the N-termini of highly expressed proteins to the N-termini of all proteins coded in the genome. CONCLUSION: The most conserved pattern was observed at the amino acid level: strong alanine over-representation was observed at the second amino acid position of highly expressed proteins. This pattern is well conserved in all three domains of life

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Translation initiation region sequence preferences in Escherichia coli

Author: Remm Maido
Tats Age
Tenson Tanel
Vimberg Vladimir
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The mRNA translation initiation region (TIR) comprises the initiator codon, Shine-Dalgarno (SD) sequence and translational enhancers. Probably the most abundant class of enhancers contains A/U-rich sequences. We have tested the influence of SD sequence length and the presence of enhancers on the efficiency of translation initiation. Results We found that during bacterial growth at 37°C, a six-nucleotide SD (AGGAGG) is more efficient than shorter or longer sequences. The A/U-rich enhancer contributes strongly to the efficiency of initiation, having the greatest stimulatory effect in the exponential growth phase of the bacteria. The SD sequences and the A/U-rich enhancer stimulate translation co-operatively: strong SDs are stimulated by the enhancer much more than weak SDs. The bacterial growth rate does not have a major influence on the TIR selection pattern. On the other hand, temperature affects the TIR preference pattern: shorter SD sequences are preferred at lower growth temperatures. We also performed an <it>in silico </it>analysis of the TIRs in all <it>E. coli </it>mRNAs. The base pairing potential of the SD sequences does not correlate with the codon adaptation index, which is used as an estimate of gene expression level. Conclusion In <it>E. coli </it>the SD selection preferences are influenced by the growth temperature and not influenced by the growth rate. The A/U rich enhancers stimulate translation considerably by acting co-operatively with the SD sequences.</p

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Stops making sense: translational trade-offs and stop codon reassignment

Author: A Eyre-Walker
A Tats
B Bonetti
Conrad P Lichtenstein
D Haig
D Kotlar
F Abascal
G Bertram
Greg S Elgar
H Liang
H Naora
H Seligmann
J Swire
James A Cotton
JP McCutcheon
L Kisselev
L Major
Louise J Johnson
M Adachi
M Pinotti
N Amrana
p David Polly
P Sicinski
PJ Keeling
PM Sharp
R Hershberg
RD Knight
RD Knight
Richard A Nichols
S Itzkovitz
S Karlin
S Osawa
Steven C Le Comber
TH Jukes
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Background Efficient gene expression involves a trade-off between (i) premature termination of protein synthesis; and (ii) readthrough, where the ribosome fails to dissociate at the terminal stop. Sense codons that are similar in sequence to stop codons are more susceptible to nonsense mutation, and are also likely to be more susceptible to transcriptional or translational errors causing premature termination. We therefore expect this trade-off to be influenced by the number of stop codons in the genetic code. Although genetic codes are highly constrained, stop codon number appears to be their most volatile feature. Results In the human genome, codons readily mutable to stops are underrepresented in coding sequences. We construct a simple mathematical model based on the relative likelihoods of premature termination and readthrough. When readthrough occurs, the resultant protein has a tail of amino acid residues incorrectly added to the C-terminus. Our results depend strongly on the number of stop codons in the genetic code. When the code has more stop codons, premature termination is relatively more likely, particularly for longer genes. When the code has fewer stop codons, the length of the tail added by readthrough will, on average, be longer, and thus more deleterious. Comparative analysis of taxa with a range of stop codon numbers suggests that genomes whose code includes more stop codons have shorter coding sequences. Conclusions We suggest that the differing trade-offs presented by alternative genetic codes may result in differences in genome structure. More speculatively, multiple stop codons may mitigate readthrough, counteracting the disadvantage of a higher rate of nonsense mutation. This could help explain the puzzling overrepresentation of stop codons in the canonical genetic code and most variants

Central Archive at the University of Reading

Crossref

Springer

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Enlighten

Queen Mary Research Online

Intra-genome variability in the dinucleotide composition of SARS-CoV-2

Author: Atkinson
Azhar
Banerjee
Berkhout
Burns
Chenik
Cooper
Cotten
Cox
Davidson
Deaton
Dev
Ficarelli
Firth
Futcher
Gaunt
Greenbaum
Groenke
Guo
Gutman
Hiscott
Hu
Irigoyen
Kanaya
Khoddami
Kim
Kozak
Kumar
Kunec
Liao
Lin
Lin
Lu
Marra
McClelland
Medvedeva
Moratorio
O'Connor
Odon
Perlman
Rasschaert
Rima
Rota
Ryabova
Sawicki
Sawicki
Schaecher
Schneider
Senanayake
Shi
Simmonds
Simmonds
Squires
Sved
Takata
Tang
Tats
Tomso
Tulloch
van der Hoek
Vlasova
Wise
Woo
Woo
Xia
Xie
Zaki
Zhu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 13/08/2020
Field of study

Crossref

Edinburgh Research Explorer

The +4G Site in Kozak Consensus Is Not Related to the Efficiency of Translation Initiation

Author: A Coghlan
A Robert-Seilaniantz
A Tats
AM Cigan
B Futcher
C Flinta
DC Rowe
Emmanouil Dermitzakis
F Frottin
G Chen
GL Vilas
J Shine
J Shine
JB Plotkin
JH Zar
JS de Vries
L Duret
M Bentham
M Gouy
M Kozak
M Kozak
M Kozak
M Kozak
M Kozak
M Kozak
M Kozak
M Semon
N Sakurai
P Provitera
P Rice
PM Sharp
Q Tian
RP Moerschell
S Breuer
S Dinel
S Grunert
S Harkins
SP Gygi
T Ideker
T Ikemura
TA Farazi
TJ Griffin
TM Lowe
VE Velculescu
X Xia
X Xia
X Xia
X Xia
Xuhua Xia
Publication venue: Public Library of Science
Publication date: 07/02/2007
Field of study

The optimal context for translation initiation in mammalian species is GCCRCCaugG (where R = purine and “aug” is the initiation codon), with the -3R and +4G being particularly important. The presence of +4G has been interpreted as necessary for efficient translation initiation. Accumulated experimental and bioinformatic evidence has suggested an alternative explanation based on amino acid constraint on the second codon, i.e., amino acid Ala or Gly are needed as the second amino acid in the nascent peptide for the cleavage of the initiator Met, and the consequent overuse of Ala and Gly codons (GCN and GGN) leads to the +4G consensus. I performed a critical test of these alternative hypotheses on +4G based on 34169 human protein-coding genes and published gene expression data. The result shows that the prevalence of +4G is not related to translation initiation. Among the five G-starting codons, only alanine codons (GCN), and glycine codons (GGN) to a much smaller extent, are overrepresented at the second codon, whereas the other three codons are not overrepresented. While highly expressed genes have more +4G than lowly expressed genes, the difference is caused by GCN and GGN codons at the second codon. These results are inconsistent with +4G being needed for efficient translation initiation, but consistent with the proposal of amino acid constraint hypothesis

Public Library of Science (PLOS)

Crossref

PubMed Central

RNA virus attenuation by codon pair deoptimisation is an artefact of increases in CpG/UpA dinucleotide frequencies

Author: Atkinson
Bennetzen
Beutler
Boycheva
Buchan
Burns
Chevance
Coleman
Duan
Everitt
Folley
Gingold
Gutman
Hambleton
Irwin
Karlin
Le Nouen
Martrus
Moura
Moura
Mueller
Ni
Pothlichet
Puigbò
Ramsey
Rima
Sharp
Simmen
Simmonds
Simmonds
Tats
Thomas
Wang
Wimmer
Wright
Wu
Yang
Yarus
Publication venue: 'eLife Sciences Publications, Ltd'
Publication date: 09/12/2014
Field of study

Mutating RNA virus genomes to alter codon pair (CP) frequencies and reduce translation efficiency has been advocated as a method to generate safe, attenuated virus vaccines. However, selection for disfavoured CPs leads to unintended increases in CpG and UpA dinucleotide frequencies that also attenuate replication. We designed and phenotypically characterised mutants of the picornavirus, echovirus 7, in which these parameters were independently varied to determine which most influenced virus replication. CpG and UpA dinucleotide frequencies primarily influenced virus replication ability while no fitness differences were observed between mutants with different CP usage where dinucleotide frequencies were kept constant. Contrastingly, translation efficiency was unaffected by either CP usage or dinucleotide frequencies. This mechanistic insight is critical for future rational design of live virus vaccines and their safety evaluation; attenuation is mediated through enhanced innate immune responses to viruses with elevated CpG/UpA dinucleotide frequencies rather the viruses themselves being intrinsically defective

Crossref

PubMed Central

Edinburgh Research Explorer

Warwick Research Archives Portal Repository

University of St. Andrews - Pure

St Andrews Research Repository

A Universal Trend of Reduced mRNA Stability near the Translation-Initiation Site in Prokaryotes and Eukaryotes

Author: A Eyre-Walker
A Tats
AA Komar
AE Vinogradov
AI Su
AV Komarova
B Lemos
Berend Snel
C Hoede
C Kimchi-Sarfaty
C Pal
Claus O. Wilke
CM Stenstrom
DA Drummond
DA Drummond
DH Mathews
EI Gonzalez de Valdivia
F Wright
FCP Holstege
G Kudla
G Kudla
G Qing
G Zhang
H Akashi
H Akashi
H Chen
H Musto
HC Wang
IL Hofacker
IL Hofacker
J Mandel
J Sanchez
J Shine
JE Brock
JL Parmley
JL Parmley
JP Etchegaray
JV Chamary
JV Chamary
K Yamagishi
KB Zeldovich
KE Griswold
L Duret
L Duret
L Katz
M Eames
M Kozak
M Kozak
M Stenico
MW Covert
N Galtier
N Stoletzki
N Stoletzki
P Cortazzo
P Goymer
PG Higgs
PM Sharp
S Nakagawa
SA Shabalina
T Ikemura
T Warnecke
T Zhou
TA Thanaraj
Tong Zhou
V Stolc
V Vimberg
W Seffens
Wanjun Gu
YI Wolf
YM Zalucki
Z Yang
Publication venue: Public Library of Science
Publication date: 01/02/2010
Field of study

Recent studies have suggested that the thermodynamic stability of mRNA secondary structure near the start codon can regulate translation efficiency in Escherichia coli, and that translation is more efficient the less stable the secondary structure. We survey the complete genomes of 340 species for signals of reduced mRNA secondary structure near the start codon. Our analysis includes bacteria, archaea, fungi, plants, insects, fishes, birds, and mammals. We find that nearly all species show evidence for reduced mRNA stability near the start codon. The reduction in stability generally increases with increasing genomic GC content. In prokaryotes, the reduction also increases with decreasing optimal growth temperature. Within genomes, there is variation in the stability among genes, and this variation correlates with gene GC content, codon bias, and gene expression level. For birds and mammals, however, we do not find a genome-wide trend of reduced mRNA stability near the start codon. Yet the most GC rich genes in these organisms do show such a signal. We conclude that reduced stability of the mRNA secondary structure near the start codon is a universal feature of all cellular life. We suggest that the origin of this reduction is selection for efficient recognition of the start codon by initiator-tRNA

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Texas ScholarWorks