Search CORE

AUG_hairpin: prediction of a downstream secondary structure influencing the recognition of a translation start site

Author: Akinori Sarai
Alex V Kochetov
Andrey Palyanov
AV Kochetov
AV Kochetov
AV Kochetov
AV Pisarev
C Touriol
Dmitry Grigorovich
HS Kwon
I Ventoso
IB Rogozin
Igor I Titov
IL Hofacker
IM Meyer
JL Riechmann
JS McCaskill
K Clyde
K Takahashi
K-N Zhao
L Yang
M Ciullo
M Kozak
M Kozak
M Kozak
M Lukaszewicz
M Nguyen
Nikolay A Kolchanov
RJ Jackson
SA Shabalina
SA Shabalina
SD Baird
SV Sawant
W-L Hwang
Y Kobayashi
Publication venue: BioMed Central
Publication date: 01/08/2007
Field of study

Abstract Background The translation start site plays an important role in the control of translation efficiency of eukaryotic mRNAs. The recognition of the start AUG codon by eukaryotic ribosomes is considered to depend on its nucleotide context. However, the fraction of eukaryotic mRNAs with the start codon in a suboptimal context is relatively large. It may be expected that mRNA should possess some features providing efficient translation, including the proper recognition of a translation start site. It has been experimentally shown that a downstream hairpin located in certain positions with respect to start codon can compensate in part for the suboptimal AUG context and also increases translation from non-AUG initiation codons. Prediction of such a compensatory hairpin may be useful in the evaluation of eukaryotic mRNA translation properties. Results We evaluated interdependency between the start codon context and mRNA secondary structure at the CDS beginning: it was found that a suboptimal start codon context significantly correlated with higher base pairing probabilities at positions 13 – 17 of CDS of human and mouse mRNAs. It is likely that the downstream hairpins are used to enhance translation of some mammalian mRNAs <it>in vivo</it>. Thus, we have developed a tool, <it>AUG_hairpin</it>, to predict local stem-loop structures located within the defined region at the beginning of mRNA coding part. The implemented algorithm is based on the available published experimental data on the CDS-located stem-loop structures influencing the recognition of upstream start codons. Conclusion An occurrence of a potential secondary structure downstream of start AUG codon in a suboptimal context (or downstream of a potential non-AUG start codon) may provide researchers with a testable assumption on the presence of additional regulatory signal influencing mRNA translation initiation rate and the start codon choice. <it>AUG_hairpin</it>, which has a convenient Web-interface with adjustable parameters, will make such an evaluation easy and efficient.</p

Springer - Publisher Connector

Global Mapping of DNA Conformational Flexibility on Saccharomyces cerevisiae

Author: A Aranda
A Fungtammasan
A Letessier
A Re
A Sarai
AM Casper
AM Puliti
Andrea Bedini
B Gelfand
B Le Tallec
BR Graveley
C Vaillant
CA Beelman
Christos A. Ouzounis
D Mishmar
D Scannell
DJ Hogan
E Segal
E Zlotorynski
E Zlotorynski
EA Ozonov
EM Prescott
F Ozsolak
Giulia Menconi
H Zhang
I Sbrana
I Tirosh
I Tirosh
I Tirosh
Isabella Sbrana
J Zhao
JD Lieb
JH Graber
JM Perez-Canadillas
K Mimori
K Mrasek
KE Shearwin
KP Byrne
KP O’Brien
L Hurst
M Debatisse
MD Vinces
MTJ van Loenhout
NN Batada
O Shalem
P Milani
PJ Coates
R Gemayel
R Shalgi
Roberto Barale
S Kruglyak
S Semba
SG Durkin
T Tuller
TW Glover
U Nagalakshmi
W1 Lee
Y Field
Y Lai
Y Wang
Y Yang
Z Guo
Z Guo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2015
Field of study

In this study we provide the first comprehensive map of DNA conformational flexibility in Saccharomyces cerevisiae complete genome. Flexibility plays a key role in DNA supercoiling and DNA/protein binding, regulating DNA transcription, replication or repair. Specific interest in flexibility analysis concerns its relationship with human genome instability. Enrichment in flexible sequences has been detected in unstable regions of human genome defined fragile sites, where genes map and carry frequent deletions and rearrangements in cancer. Flexible sequences have been suggested to be the determinants of fragile gene proneness to breakage; however, their actual role and properties remain elusive. Our in silico analysis carried out genome-wide via the StabFlex algorithm, shows the conserved presence of highly flexible regions in budding yeast genome as well as in genomes of other Saccharomyces sensu stricto species. Flexibile peaks in S. cerevisiae identify 175 ORFs mapping on their 3’UTR, a region affecting mRNA translation, localization and stability. (TA)n repeats of different extension shape the central structure of peaks and co-localize with polyadenylation efficiency element (EE) signals. ORFs with flexible peaks share common features. Transcripts are characterized by decreased half-life: this is considered peculiar of genes involved in regulatory systems with high turnover; consistently, their function affects biological processes such as cell cycle regulation or stress response. Our findings support the functional importance of flexibility peaks, suggesting that the flexible sequence may be derived by an expansion of canonical TAYRTA polyadenylation efficiency element. The flexible (TA)n repeat amplification could be the outcome of an evolutionary neofunctionalization leading to a differential 3’-end processing and expression regulation in genes with peculiar function. Our study provides a new support to the functional role of flexibility in genomes and a strategy for its characterization inside human fragile sites

Archivio della Ricerca - Università di Pisa

FigShare

Inferring Binding Energies from Selected Binding Sites

Author: A Sarai
AE Kel
C Tuerk
Christopher Workman
DA Gilchrist
David Granas
DS Fields
DSF Homsi
E Roulet
E Sharon
Gary D. Stormo
GD Stormo
GD Stormo
GD Stormo
GD Stormo
H Ji
HF Teh
HG Roider
J Linnell
J Liu
JB Kinney
JJ Moré
L van Oeffelen
M Djordjevic
M Djordjevic
MF Berger
ML Lee
MQ Zhang
O Berg
PH von Hippel
PV Benos
PV Benos
Q Zhou
R Staden
SJ Maerkl
TH Cormen
TK Blackwell
TK Man
U Gerland
V Mustonen
VH Nagaraj
WE Wright
X Liu
X Meng
Y Takeda
Yue Zhao
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms

Digital Commons@Becker

PrognoScan: a new database for meta-analysis of the prognostic value of genes

Author: A Elfilali
A Mazumder
A Murat
AB Als
AH Bild
Akinori Sarai
AS Levenson
AV Ivshina
C Desmedt
C Sotiriou
CH Chung
CK Anders
CM Shachaf
D Schrag
DC Brown
DG Altman
DG Beer
E Will
F Vernooij
F Zhan
H Dai
H Parkinson
Hideaki Mizuno
HL Ford
HS Phillips
JP Jais
K Behbakht
K Chin
K Jeganathan
KC Jensen
KE Paulson
Kenta Nakai
KH Metzeler
KJ Reichenberger
KT Ng
Kunio Kitada
KY Bilimoria
LD Miller
LJ van't Veer
M Chanrion
M Hummel
M Mazumdar
M Prosniak
M Raponi
M Raponi
M Schmidt
N Holländer
P Kronqvist
R Mehra
R Miller
RA Irizarry
RD Coletta
RD Coletta
RL Camp
S Draghici
S Loi
S Mizuarai
S Tomida
SS Kim
T Barrett
U Abel
XJ Ma
Y Pawitan
Y Wang
Y Zhou
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background In cancer research, the association between a gene and clinical outcome suggests the underlying etiology of the disease and consequently can motivate further studies. The recent availability of published cancer microarray datasets with clinical annotation provides the opportunity for linking gene expression to prognosis. However, the data are not easy to access and analyze without an effective analysis platform. Description To take advantage of public resources in full, a database named "PrognoScan" has been developed. This is 1) a large collection of publicly available cancer microarray datasets with clinical annotation, as well as 2) a tool for assessing the biological relationship between gene expression and prognosis. PrognoScan employs the minimum <it>P</it>-value approach for grouping patients for survival analysis that finds the optimal cutpoint in continuous gene expression measurement without prior biological knowledge or assumption and, as a result, enables systematic meta-analysis of multiple datasets. Conclusion PrognoScan provides a powerful platform for evaluating potential tumor markers and therapeutic targets and would accelerate cancer research. The database is publicly accessible at <url>http://gibk21.bse.kyutech.ac.jp/PrognoScan/index.html</url>.</p

Springer - Publisher Connector

Aggressive PDACs show hypomethylation of repetitive elements and the execution of an intrinsic IFN program linked to a ductal cell of origin

Author: Arda H. Efsun
Backx Elyne
Brors Benedikt
Büscher Magdalena
Donato Elisa
Eils Roland
Espinet Elisa
Falcone Mattia
Gaida Matthias M.
Giese Nathalia A.
Gu Zuguang
Hackert Thilo
Imbusch Charles D.
Insua-Rodríguez Jacob
Klein Corinna
Kopp Janel L.
Kossi Steffi O.
Lee Alex Y. L.
Muckenhuber Alexander
Pfarr Nicole
Plass Christoph
Reitberger Manuel
Rodríguez-Paredes Manuel
Rooman Ilse
Safavi Mariam
Sarai Karnjit
Schlesner Matthias
Sprick Martin R.
Steiger Katja
Strobel Oliver
Thiel Vera
Trumpp Andreas
Vogel Vanessa
Weichenhan Dieter
Weichert Wilko
Weisenburger Silke
Yen Hsi-Yu
Zarei Soheila
Publication venue: 'American Association for Cancer Research (AACR)'
Publication date: 15/10/2020
Field of study

Pancreatic ductal adenocarcinoma (PDAC) is characterized by extensive desmoplasia, which challenges the molecular analyses of bulk tumor samples. Here we FACS-purified epithelial cells from human PDAC and normal pancreas and derived their genome-wide transcriptome and DNA methylome landscapes. Clustering based on DNA methylation revealed two distinct PDAC groups displaying different methylation patterns at regions encoding repeat elements. Methylation(low) tumors are characterized by higher expression of endogenous retroviral (ERV) transcripts and dsRNA sensors which leads to a cell intrinsic activation of an interferon signature (IFNsign). This results in a pro-tumorigenic microenvironment and poor patient outcome. Methylation(low)/IFNsign(high) and Methylation(high)/IFNsign(low) PDAC cells preserve lineage traits, respective of normal ductal or acinar pancreatic cells. Moreover, ductal-derived Kras(G12D)/Trp53(−/−) mouse PDACs show higher expression of IFNsign compared to acinar-derived counterparts. Collectively, our data point to two different origins and etiologies of human PDACs, with the aggressive Methylation(low)/IFNsign(high) subtype potentially targetable by agents blocking intrinsic IFN-signaling

OPUS Augsburg

Simulation of developmental changes in action potentials with ventricular cell models

Author: CH Cho
D Franco
EG Lakatta
F Chen
G Olivetti
GM Faber
H Masuda
H Satoh
H Takeshima
H Yokoshiki
Hitomi Itoh
I Nakajima
J Sanchez-Chapula
JL Puglisi
JR Couch
K Ono
K Takahashi
K Yasui
KR Chun
KW Linz
L Ferron
L Mahony
L Wang
LH Xie
M Artman
M Artman
M Kojima
M Nagashima
Masaru Tomita
MJ Kilborn
MP Davies
N Hagiwara
N Klugbauer
N Niwa
N Sarai
R Shirokov
S Matsuoka
S Seki
SG Spence
T Kiyosue
TV Huynh
W Liu
Y Kato
Yasuhiro Naito
ZJ Zhang
Publication venue: Kluwer Academic Publishers
Publication date: 01/01/2006
Field of study

During cardiomyocyte development, early embryonic ventricular cells show spontaneous activity that disappears at a later stage. Dramatic changes in action potential are mediated by developmental changes in individual ionic currents. Hence, reconstruction of the individual ionic currents into an integrated mathematical model would lead to a better understanding of cardiomyocyte development. To simulate the action potential of the rodent ventricular cell at three representative developmental stages, quantitative changes in the ionic currents, pumps, exchangers, and sarcoplasmic reticulum (SR) Ca2+ kinetics were represented as relative activities, which were multiplied by conductance or conversion factors for individual ionic systems. The simulated action potential of the early embryonic ventricular cell model exhibited spontaneous activity, which ceased in the simulated action potential of the late embryonic and neonatal ventricular cell models. The simulations with our models were able to reproduce action potentials that were consistent with the reported characteristics of the cells in vitro. The action potential of rodent ventricular cells at different developmental stages can be reproduced with common sets of mathematical equations by multiplying conductance or conversion factors for ionic currents, pumps, exchangers, and SR Ca2+ kinetics by relative activities

Springer - Publisher Connector

Dengue Virus Type 4 Phylogenetics in Brazil 2011: Looking beyond the Veil

Dengue Fever and Dengue Hemorrhagic Fever are diseases affecting approximately 100 million people/year and are a major concern in developing countries. In the present study, the phylogenetic relationship of six strains of the first autochthonous cases of DENV-4 infection occurred in Sao Paulo State, Parana State and Rio Grande do Sul State, Brazil, 2011 were studied. Nucleotide sequences of the envelope gene were determined and compared with sequences representative of the genotypes I, II, III and Sylvatic for DEN4 retrieved from GenBank. We employed a Bayesian phylogenetic approach to reconstruct the phylogenetic relationships of Brazilian DENV-4 and we estimated evolutionary rates and dates of divergence for DENV-4 found in Brazil in 2011. All samples sequenced in this study were located in Genotype II. The studied strains are monophyletic and our data suggest that they have been evolving separately for at least 4 to 6 years. Our data suggest that the virus might have been present in the region for some time, without being noticed by Health Surveillance Services due to a low level of circulation and a higher prevalence of DENV-1 and DENV- 2

CiteSeerX

Local Gene Regulation Details a Recognition Code within the LacI Transcriptional Factor Family

Author: A Glasfeld
A Sandelin
A Sarai
A Ureta-Vidal
AE Kazakov
AV Morozov
BM Hall
BW Matthews
C Francke
CE Bell
CG Kalodimos
CI Jørgensen
CO Pabo
CO Pabo
EJ Alm
Eric J. Alm
FM Camas
Francisco M. Camas
G Kolesov
G Paillard
Gary D. Stormo
GP Smith
J Boch
J Castresana
J Nardelli
J Sartorius
J Schultz
JL Betz
JO Korbel
JR Desjarlais
Juan F. Poyatos
L Milk
M Lewis
M Lewis
M Lewis
M Perros
M Suzuki
MA Schumacher
MA Schumacher
MJ Moscou
MJ Weickert
MM Gromiha
NC Seeman
NM Luscombe
P Baldi
PB Warren
PV Benos
R Hershberg
RC Edgar
RK Salinas
S Mahony
S Mahony
SA Wolfe
SJ Maerlk
T Sera
TA Desai
V Espinosa Angarica
W Thompson
WW Wasserman
Y Choo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2010
Field of study

The specific binding of regulatory proteins to DNA sequences exhibits no clear patterns of association between amino acids (AAs) and nucleotides (NTs). This complexity of protein-DNA interactions raises the question of whether a simple set of wide-coverage recognition rules can ever be identified. Here, we analyzed this issue using the extensive LacI family of transcriptional factors (TFs). We searched for recognition patterns by introducing a new approach to phylogenetic footprinting, based on the pervasive presence of local regulation in prokaryotic transcriptional networks. We identified a set of specificity correlations –determined by two AAs of the TFs and two NTs in the binding sites– that is conserved throughout a dominant subgroup within the family regardless of the evolutionary distance, and that act as a relatively consistent recognition code. The proposed rules are confirmed with data of previous experimental studies and by events of convergent evolution in the phylogenetic tree. The presence of a code emphasizes the stable structural context of the LacI family, while defining a precise blueprint to reprogram TF specificity with many practical applications.Ministerio de Ciencia e Innovación, Spain (Formación de Profesorado Universitario fellowship)Ministerio de Ciencia e Innovación, Spain (grant BFU2008-03632/BMC)Madrid (Spain : Region) (grant CCG08-CSIC/SAL-3651

CiteSeerX

DSpace@MIT

Digital.CSIC

Computational Structural Analysis: Multiple Proteins Bound to DNA

Author: A Sarai
AC Martin
AD McLachlan
Andrija Tomovic
B Jayaram
BP Berman
BP Berman
C Moorman
C Zhang
CK Reddy
CO Pabo
D Eisenberg
D Lejeune
E Krissinel
E Krissinel
Edward J. Oakeley
EG Hutchinson
G Barequet
G Bergen
G Turk
G Wang
G Wang
G Zhao
HM Berman
J Janin
JJ Ellis
JR Williamson
K Nadassy
KI Cho
L Lo Conte
LA Mirny
M Guharoy
M Hughes
M Michael Gromiha
M Teschner
M Treger
Mark Isalan
N Banerjee
NM Luscombe
NM Luscombe
P Chakrabarti
P Hubbard
S Ahmad
S Gottschalk
S Jones
S Jones
S Jones
S Jones
S Jones
S Liu
S Sinha
X Yu
XJ Lu
Y Mandel-Gutfreund
Publication venue: Public Library of Science
Publication date: 19/09/2008
Field of study

BACKGROUND: With increasing numbers of crystal structures of proteinratioDNA and proteinratioproteinratioDNA complexes publically available, it is now possible to extract sufficient structural, physical-chemical and thermodynamic parameters to make general observations and predictions about their interactions. In particular, the properties of macromolecular assemblies of multiple proteins bound to DNA have not previously been investigated in detail. METHODOLOGY/PRINCIPAL FINDINGS: We have performed computational structural analyses on macromolecular assemblies of multiple proteins bound to DNA using a variety of different computational tools: PISA; PROMOTIF; X3DNA; ReadOut; DDNA and DCOMPLEX. Additionally, we have developed and employed an algorithm for approximate collision detection and overlapping volume estimation of two macromolecules. An implementation of this algorithm is available at http://promoterplot.fmi.ch/Collision1/. The results obtained are compared with structural, physical-chemical and thermodynamic parameters from proteinratioprotein and single proteinratioDNA complexes. Many of interface properties of multiple proteinratioDNA complexes were found to be very similar to those observed in binary proteinratioDNA and proteinratioprotein complexes. However, the conformational change of the DNA upon protein binding is significantly higher when multiple proteins bind to it than is observed when single proteins bind. The water mediated contacts are less important (found in less quantity) between the interfaces of components in ternary (proteinratioproteinratioDNA) complexes than in those of binary complexes (proteinratioprotein and proteinratioDNA).The thermodynamic stability of ternary complexes is also higher than in the binary interactions. Greater specificity and affinity of multiple proteins binding to DNA in comparison with binary protein-DNA interactions were observed. However, protein-protein binding affinities are stronger in complexes without the presence of DNA. CONCLUSIONS/SIGNIFICANCE: Our results indicate that the interface properties: interface area; number of interface residues/atoms and hydrogen bonds; and the distribution of interface residues, hydrogen bonds, van der Walls contacts and secondary structure motifs are independent of whether or not a protein is in a binary or ternary complex with DNA. However, changes in the shape of the DNA reduce the off-rate of the proteins which greatly enhances the stability and specificity of ternary complexes compared to binary ones