Search CORE

86 research outputs found

The Escherichia coli transcriptome mostly consists of independently regulated modules

Author: A Anand
A Biton
A Delorme
A Frigyesi
A Hyvärinen
A Santos-Zavaleta
A-M Martoglio
AE Teschendorff
B Dalrymple
B Langmead
B-K Cho
B-K Cho
BM Bolstad
C Vijayendran
CL Turnbough Jr
D Kim
D Marbach
D Risso
D-S Huang
DS Latchman
E Nudler
EJ O’Brien
ENCODE Project Consortium.
ER Gansner
F Pedregosa
GI Guzmán
GI Guzmán
H Zou
HS Rhee
I Kristoficova
IM Keseler
J Pouyssegur
J Utrilla
JE Galagan
JJ Faith
JM Buescher
JM Engreitz
JM Monk
JT Leek
K Valgepea
K-K Yan
KF Jensen
KJ Karczewski
L Wang
M Ester
M Kim
M Lawrence
M Moretto
M Scott
M Scott
MB Gerstein
MI Love
NE Lewis
O Alter
P Chiappetta
P Comon
PR Subbarayan
PV Phaneuf
R De Smet
R Kolter
RA LaCroix
RB D’agostino
S Gama-Castro
S Lin
SJ Larsen
SW Seo
T Baba
T Barrett
TM Henkin
W Kong
W Liebermeister
W Saelens
X Zhang
Xin Fang
XW Zhang
Y Gao
Y Yamanaka
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Underlying cellular responses is a transcriptional regulatory network (TRN) that modulates gene expression. A useful description of the TRN would decompose the transcriptome into targeted effects of individual transcriptional regulators. Here, we apply unsupervised machine learning to a diverse compendium of over 250 high-quality Escherichia coli RNA-seq datasets to identify 92 statistically independent signals that modulate the expression of specific gene sets. We show that 61 of these transcriptomic signals represent the effects of currently characterized transcriptional regulators. Condition-specific activation of signals is validated by exposure of E. coli to new environmental conditions. The resulting decomposition of the transcriptome provides: a mechanistic, systems-level, network-based explanation of responses to environmental and genetic perturbations; a guide to gene and regulator function discovery; and a basis for characterizing transcriptomic differences in multiple strains. Taken together, our results show that signal summation describes the composition of a model prokaryotic transcriptome

Crossref

ScholarWorks@UNIST

eScholarship - University of California

Online Research Database In Technology

Co- and post-translational translocation through the protein-conducting channel:analogous mechanisms at work?

Many proteins are translocated across, or integrated into, membranes. Both functions are fulfilled by the 'translocon/translocase', which contains a membrane-embedded proteinconducting channel (PCC) and associated soluble factors that drive translocation and insertion reactions using nucleotide triphosphates as fuel. This perspective focuses on reinterpreting existing experimental data in light of a recently proposed PCC model comprising a front-to-front dimer of SecY or Sec61 heterotrimeric complexes. In this new framework, we propose (i) a revised model for SRP-SR-mediated docking of the ribosome-nascent polypeptide to the PCC; (ii) that the dynamic interplay between protein substrate, soluble factors and PCC controls the opening and closing of a transmembrane channel across, and/or a lateral gate into, the membrane; and (iii) that co-and post-translational translocation, involving the ribosome and SecA, respectively, not only converge at the PCC but also use analogous mechanisms for coordinating protein translocation

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Factor analysis for gene regulatory networks and transcription factor activity profiles

Author: A Frigyesi
A Utsugi
AL Boulesteix
AM Martoglio
C Sabatti
C Sabatti
E Fokoue
G Hinton
H Kaiser
H Ming
H Salgado
I Pournara
Iosifina Pournara
J Liao
K Kao
L Tran
Lorenz Wernisch
M Tipping
M West
O Aguilar
P Schönemann
W Liebermeister
Z Ghahramani
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

BACKGROUND: Most existing algorithms for the inference of the structure of gene regulatory networks from gene expression data assume that the activity levels of transcription factors (TFs) are proportional to their mRNA levels. This assumption is invalid for most biological systems. However, one might be able to reconstruct unobserved activity profiles of TFs from the expression profiles of target genes. A simple model is a two-layer network with unobserved TF variables in the first layer and observed gene expression variables in the second layer. TFs are connected to regulated genes by weighted edges. The weights, known as factor loadings, indicate the strength and direction of regulation. Of particular interest are methods that produce sparse networks, networks with few edges, since it is known that most genes are regulated by only a small number of TFs, and most TFs regulate only a small number of genes. RESULTS: In this paper, we explore the performance of five factor analysis algorithms, Bayesian as well as classical, on problems with biological context using both simulated and real data. Factor analysis (FA) models are used in order to describe a larger number of observed variables by a smaller number of unobserved variables, the factors, whereby all correlation between observed variables is explained by common factors. Bayesian FA methods allow one to infer sparse networks by enforcing sparsity through priors. In contrast, in the classical FA, matrix rotation methods are used to enforce sparsity and thus to increase the interpretability of the inferred factor loadings matrix. However, we also show that Bayesian FA models that do not impose sparsity through the priors can still be used for the reconstruction of a gene regulatory network if applied in conjunction with matrix rotation methods. Finally, we show the added advantage of merging the information derived from all algorithms in order to obtain a combined result. CONCLUSION: Most of the algorithms tested are successful in reconstructing the connectivity structure as well as the TF profiles. Moreover, we demonstrate that if the underlying network is sparse it is still possible to reconstruct hidden activity profiles of TFs to some degree without prior connectivity information

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Domain Organization of Long Signal Peptides of Single-Pass Integral Membrane Proteins Reveals Multiple Functional Capacity

Author: A Al-Qahtani
A Schreiner
Alexander Schreiner
Anna Starzinski-Powitz
B Jungnickel
B Martoglio
B Martoglio
CA Hansson
CH Wu
E Dultz
E Resch
Eduard Resch
EG Hutchinson
F Foulquier
G Blobel
G Kurys
G Schneider
G von Heijne
G von Heijne
Gisbert Schneider
H-B Shen
J Berger
Jan A. Hiss
Janet Kelso
JD Bendtsen
JW Izard
K-C Chou
K-C Chou
L Gray
L Käll
M Froeschke
M Meissner
M Ouzzine
M Wiedmann
MA Robin
ME Watson
Michael Meissner
N Takasugi
O Emanuelsson
O Emanuelsson
P Horton
PA Champion
RL Szabady
RS Hegde
S Bharti
S Ramanujan
T Tamura
V Jakob
W Nickel
Z Yuan
ZP Feng
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Targeting signals direct proteins to their extra - or intracellular destination such as the plasma membrane or cellular organelles. Here we investigated the structure and function of exceptionally long signal peptides encompassing at least 40 amino acid residues. We discovered a two-domain organization (“NtraC model”) in many long signals from vertebrate precursor proteins. Accordingly, long signal peptides may contain an N-terminal domain (N-domain) and a C-terminal domain (C-domain) with different signal or targeting capabilities, separable by a presumably turn-rich transition area (tra). Individual domain functions were probed by cellular targeting experiments with fusion proteins containing parts of the long signal peptide of human membrane protein shrew-1 and secreted alkaline phosphatase as a reporter protein. As predicted, the N-domain of the fusion protein alone was shown to act as a mitochondrial targeting signal, whereas the C-domain alone functions as an export signal. Selective disruption of the transition area in the signal peptide impairs the export efficiency of the reporter protein. Altogether, the results of cellular targeting studies provide a proof-of-principle for our NtraC model and highlight the particular functional importance of the predicted transition area, which critically affects the rate of protein export. In conclusion, the NtraC approach enables the systematic detection and prediction of cryptic targeting signals present in one coherent sequence, and provides a structurally motivated basis for decoding the functional complexity of long protein targeting signals

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

PROlocalizer: integrated web service for protein subcellular localization prediction

Author: A Garg
A Krogh
B Eisenhaber
B Eisenhaber
B Eisenhaber
B Martoglio
C Guda
CS Yu
E Castro de
EW Klee
G Neuberger
G Neuberger
GH Schneider
HB Shen
J Sprenger
J Thusberg
JD Bendtsen
K Laurila
K Nakai
KC Chou
KC Chou
Kirsti Laurila
L Käll
M Cokol
Mauno Vihinen
O Emanuelsson
O Emanuelsson
O Emanuelsson
O Emanuelsson
P Dönnes
P Horton
R Falk
R Nair
RM Stroud
SR Sunyaev
TN Davis
Z Lu
Z Yuan
Publication venue: Springer Vienna
Publication date: 01/01/2010
Field of study

Subcellular localization is an important protein property, which is related to function, interactions and other features. As experimental determination of the localization can be tedious, especially for large numbers of proteins, a number of prediction tools have been developed. We developed the PROlocalizer service that integrates 11 individual methods to predict altogether 12 localizations for animal proteins. The method allows the submission of a number of proteins and mutations and generates a detailed informative document of the prediction and obtained results. PROlocalizer is available at http://bioinf.uta.fi/PROlocalizer/

Lund University Publications

Crossref

Springer - Publisher Connector

PubMed Central

The Plasmodium Export Element Revisited

Author: A Shanmugham
AA Zamyatnin
B Martoglio
BM Cooke
C Chothia
CJ Stoeckert Jr
DD Jones
DI Baruch
DM Engelmann
E Knuepfer
F Sargent
Florian Schwarte
G Cochrane
G Schneider
G Schneider
Gisbert Schneider
H Nielsen
I Ansorge
J Benting
J Zuegge
Jan Alexander Hiss
JD Bendtsen
JD Smith
JM Przyborski
Jude Marek Przyborski
Klaus Lingelbach
M Marti
M Marti
M Petter
M Rug
M Schmuker
MC Nunes
ME Wickham
MJ Gardner
N Joannin
NL Hiller
Per Westermark
Q Cheng
RS Hegde
S Baumeister
S Henikoff
SA Kyes
SA Ralph
SF Altschul
T Kohonen
TJ Sargeant
TP Hopp
XZ Su
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

We performed a bioinformatical analysis of protein export elements (PEXEL) in the putative proteome of the malaria parasite Plasmodium falciparum. A protein family-specific conservation of physicochemical residue profiles was found for PEXEL-flanking sequence regions. We demonstrate that the family members can be clustered based on the flanking regions only and display characteristic hydrophobicity patterns. This raises the possibility that the flanking regions may contain additional information for a family-specific role of PEXEL. We further show that signal peptide cleavage results in a positional alignment of PEXEL from both proteins with, and without, a signal peptide

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Hochschulschriftenserver - Universität Frankfurt am Main

A Metabolomic Approach to the Study of Wine Micro-Oxygenation

Author: A Carpentieri
A Castañeda-Ovando
A Cuadros-Inostroza
A-M Martoglio
Adelio Rigo
AJ Bell
AL Waterhouse
Alessandra Biondi Bartolini
BE Boser
C Alcalde-Eon
C Smith
CS McSweeney
D Durner
Daniel J. Kliebenstein
Daniele Perenzoni
Domenico Masuero
E Gómez-Plaza
E Villagra
EJ Calabrese
F Mattivi
FA van Dorsten
Fulvio Mattivi
G Liger-Belair
G Mazerolles
G Theodoridis
H Fulcrand
H-S Son
H-S Son
H-S Son
J Boccard
J Drinkine
JC Danilewicz
JC Danilewicz
K Skogerson
L Pasteur
L Vaclavik
L Yetukuri
LW Sumner
M Bodan
M Cano-Lopez
M Cano-López
M del Carmen Llaudy
M Nassiri-Asl
M Scholz
M Schwarz
M-A Ducasse
M-A Ducasse
Matthias Scholz
MJ Cejudo-Bastante
MJ Cejudo-Bastante
MJ Cejudo-Bastante
MR Guasch-Janè
N Kountoudakis
P Comon
P Ribereau-Gayon
P Ribereau-Gayon
Panagiotis Arapitsas
R Tautenhahn
RB Boulton
RD Gougeon
S Mahadevan
S Pérez-Magariño
Stefano Di Blasi
T Doco
TP Dew
Urska Vrhovsek
V Atanasova
V de Freitas
VL Singleton
VN Vapnik
W Guan
W Liebermeister
Y-S Hong
Z Guadalupe
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Wine micro-oxygenation is a globally used treatment and its effects were studied here by analysing by untargeted LC-MS the wine metabolomic fingerprint. Eight different procedural variations, marked by the addition of oxygen (four levels) and iron (two levels) were applied to Sangiovese wine, before and after malolactic fermentation

Public Library of Science (PLOS)

Crossref

Archivio istituzionale della ricerca - Fondazione Edmund Mach

Directory of Open Access Journals

PubMed Central

The Francis Crick Institute

Large-Scale Discovery and Characterization of Protein Regulatory Motifs in Eukaryotes

Author: A Aitken
A Belle
A Remenyi
AM Benham
B Martoglio
C Stark
CE Lawrence
CL Denis
D Chelsky
D Kalderon
D Schwartz
Daniel S. Lieber
E Birney
EC Hurt
F Diella
F Diella
FN Vogtle
G Blobel
H Dinkel
H Goodarzi
H Yu
I Jonassen
I Rigoutsos
J Ptacek
J Rush
JC Semenza
M Fuxreiter
M Gstaiger
MA Beer
MN Hall
N Slonim
NE Davey
O Elemento
Olivier Elemento
P Puntervoll
P Young
RB Russell
RJ Edwards
RJ Edwards
S Balla
S Subramani
Saeed Tavazoie
SB Ficarro
Sridhar Hannenhalli
TL Bailey
TM Cover
V Neduva
V Neduva
V Neduva
V Neduva
VD Rao
WK Huh
X Xie
Y Gavel
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The increasing ability to generate large-scale, quantitative proteomic data has brought with it the challenge of analyzing such data to discover the sequence elements that underlie systems-level protein behavior. Here we show that short, linear protein motifs can be efficiently recovered from proteome-scale datasets such as sub-cellular localization, molecular function, half-life, and protein abundance data using an information theoretic approach. Using this approach, we have identified many known protein motifs, such as phosphorylation sites and localization signals, and discovered a large number of candidate elements. We estimate that ∼80% of these are novel predictions in that they do not match a known motif in both sequence and biological context, suggesting that post-translational regulation of protein behavior is still largely unexplored. These predicted motifs, many of which display preferential association with specific biological pathways and non-random positioning in the linear protein sequence, provide focused hypotheses for experimental validation

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Expression and Characterization of Drosophila Signal Peptide Peptidase-Like (sppL), a Gene That Encodes an Intramembrane Protease

Author: A Kilic
A Krogh
A Weihofen
A Weihofen
AB Paaby
AH Brand
AL Parks
AP Grigorenko
B Amarneh
B Biehs
B De Strooper
B Martoglio
Brian Biehs
C Slack
C Werz
CC Spencer
CP Ponting
D Ron
David J. Casso
DJ Casso
E Friedmann
E Friedmann
G Struhl
GE Tusnady
J Loureiro
J McLauchlan
JC Christianson
JD Thompson
JP Miller
KA Matthews
L Martin
M Ashburner
M Stapleton
Maria Gasset
MK Lemberg
MS Wolfe
P Krawitz
R Chenna
R Fluhrer
R Pethica
S Han
S Narayanan
S Roy
S Urban
SF Altschul
SJ Poole
Songmei Liu
ST Thibault
SW Oh
T Ishikawa
Thomas B. Kornberg
V Kirkin
W Song
Y Ye
YM Chan
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Intramembrane proteases of the Signal Peptide Peptidase (SPP) family play important roles in developmental, metabolic and signaling pathways. Although vertebrates have one SPP and four SPP-like (SPPL) genes, we found that insect genomes encode one Spp and one SppL. Characterization of the Drosophila sppL gene revealed that the predicted SppL protein is a highly conserved structural homolog of the vertebrate SPPL3 proteases, with a predicted nine-transmembrane topology, an active site containing aspartyl residues within a transmembrane region, and a carboxy-terminal PAL domain. SppL protein localized to both the Golgi and ER. Whereas spp is an essential gene that is required during early larval stages and whereas spp loss-of-function reduced the unfolded protein response (UPR), sppL loss of function had no apparent phenotype. This was unexpected given that genetic knockdown phenotypes in other organisms suggested significant roles for Spp-related proteases

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Identification of Single- and Multiple-Class Specific Signature Genes from Gene Expression Profiles by Group Marker Index

Author: A Bhattacharjee
A Rocchi
A Yoshimura
AM Martoglio
AM Patel
AS Kostyukova
BJ McHugh
C Han
CH Ooi
CT Yap
D Nadano
DH Campbell
DW Huang
DW Huang
E Davicioni
EB Huerta
F Chiarini
G Agatha
G Gerlitz
H Ishii
H Watanabe
I Aifantis
I Guyon
I-Fang Chung
IM Depaz
J Khan
J Khan
JA Cancelas
JR Downing
K Baird
K Kuroda
K Mengubas
K Scotlandi
Kripamoy Aguan
L Li
L Martins
L Sun
L Zhang
M Bustin
M Kanehisa
M Kanehisa
M Kanehisa
M Linial
M Maekawa
M Salagierski
M Wang
M Yousef
M Yousef
ME Atz
MS Lan
N Yamashita
NH Bishopric
Nikhil R. Pal
NK Mukhopadhyay
NR Pal
P Pavlidis
PA Zweidler-McKay
Q Liu
R Fernández-Chacón
R Fiancette
R Hulshizer
R Nahar
R Opgen-Rhein
RJ van Alphen
S Dudoit
S Niijima
S Ocak
S Seo
S Tavor
SA Armstrong
SL Pomeroy
Sumitra Deb
T Jirapech-Umpai
T Tian
TR Golub
V Cerisano
V Zuber
VG Tusher
VI Taylor JG
WD Liu
WG Dilley
WZ Ren
X Zhou
XX Liu
Y Gu
Y Gu
Y Saeys
Y Yu
YS Tsai
Yu-Shuen Tsai
Ø Bruserud
Publication venue: Public Library of Science
Publication date: 01/09/2011
Field of study

Informative genes from microarray data can be used to construct prediction model and investigate biological mechanisms. Differentially expressed genes, the main targets of most gene selection methods, can be classified as single- and multiple-class specific signature genes. Here, we present a novel gene selection algorithm based on a Group Marker Index (GMI), which is intuitive, of low-computational complexity, and efficient in identification of both types of genes. Most gene selection methods identify only single-class specific signature genes and cannot identify multiple-class specific signature genes easily. Our algorithm can detect de novo certain conditions of multiple-class specificity of a gene and makes use of a novel non-parametric indicator to assess the discrimination ability between classes. Our method is effective even when the sample size is small as well as when the class sizes are significantly different. To compare the effectiveness and robustness we formulate an intuitive template-based method and use four well-known datasets. We demonstrate that our algorithm outperforms the template-based method in difficult cases with unbalanced distribution. Moreover, the multiple-class specific genes are good biomarkers and play important roles in biological pathways. Our literature survey supports that the proposed method identifies unique multiple-class specific marker genes (not reported earlier to be related to cancer) in the Central Nervous System data. It also discovers unique biomarkers indicating the intrinsic difference between subtypes of lung cancer. We also associate the pathway information with the multiple-class specific signature genes and cross-reference to published studies. We find that the identified genes participate in the pathways directly involved in cancer development in leukemia data. Our method gives a promising way to find genes that can involve in pathways of multiple diseases and hence opens up the possibility of using an existing drug on other diseases as well as designing a single drug for multiple diseases

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central