Search CORE

42 research outputs found

Employing conservation of co-expression to improve functional inference

Author: Carsten O Daub
CO Daub
Erik LL Sonnhammer
FD Gibbons
G Glazko
G Yona
H Ge
H Herzel
JM Stuart
KP O'Brien
M Ashburner
M Kanehisa
M Kotlyar
MB Eisen
N Bhardwaj
P Tsaparas
PT Spellman
PW Lord
R Jansen
R Steuer
SA Teichmann
SK Kim
T Beissbarth
TR Hughes
TR Li
V van Noort
WM Fitch
X Wen
Publication venue: BioMed Central
Publication date: 22/09/2008
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Parallel mutual information estimation for inferring gene regulatory networks on GPUs

Author: AJ Butte
AM Fraser
Bertil Schmidt
CO Daub
E Lindholm
Haixiang Shi
I Arsic
J Schäfer
J Wilson
J Zola
J Zola
JPW Pluim
M Tebmann
N CUDA
N Friedman
P D'Haeseleer
SA Manavski
W Liu
Weiguo Liu
Wolfgang Müller-Wittig
X Chen
X Zhou
X Zhou
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Mutual information is a measure of similarity between two variables. It has been widely used in various application domains including computational biology, machine learning, statistics, image processing, and financial computing. Previously used simple histogram based mutual information estimators lack the precision in quality compared to kernel based methods. The recently introduced B-spline function based mutual information estimation method is competitive to the kernel based methods in terms of quality but at a lower computational complexity. Results We present a new approach to accelerate the B-spline function based mutual information estimation algorithm with commodity graphics hardware. To derive an efficient mapping onto this type of architecture, we have used the Compute Unified Device Architecture (CUDA) programming model to design and implement a new parallel algorithm. Our implementation, called CUDA-MI, can achieve speedups of up to 82 using double precision on a single GPU compared to a multi-threaded implementation on a quad-core CPU for large microarray datasets. We have used the results obtained by CUDA-MI to infer gene regulatory networks (GRNs) from microarray data. The comparisons to existing methods including ARACNE and TINGe show that CUDA-MI produces GRNs of higher quality in less time. Conclusions CUDA-MI is publicly available open-source software, written in CUDA and C++ programming languages. It obtains significant speedup over sequential multi-threaded implementation by fully exploiting the compute capability of commonly used CUDA-enabled low-cost GPUs.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Robust Detection of Hierarchical Communities from Escherichia coli Gene Expression Data

Author: A Beyer
AL Barabási
BH Good
BW Kernighan
CO Daub
D Duewer
D Marbach
DFT Veiga
E Bonnet
E Ravasz
E Segal
EH Davidson
F Luo
G Balázsi
G Getz
G Palla
G Palla
H Zare
HW Ma
J Chen
J Duch
J Hubble
J Lemke
J Reichardt
JJ Faith
JJ Faith
JN Weinstein
K Baggerly
Kevin E. Bassler
KY Yeung
M Blatt
M Riley
MB Eisen
MEJ Newman
MEJ Newman
MF Traxler
MM Barker
N Friedman
N Friedman
O Alter
PD Karp
Q Lu
R Guimerà
RA Irizarry
S Fortunato
S Fortunato
S Gama-Castro
S Raychaudhuri
S Tavazoie
Santiago Treviño
Satoru Miyano
SB Seidman
SB Seidman
SP Borgatii
SP Borgatii
TF Cooper
Tim F. Cooper
TS Gardner
U Brandes
UN Raghavan
X Wen
Y Benjamini
Y Sun
Yudong Sun
Z Shi
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 11/01/2012
Field of study

Determining the functional structure of biological networks is a central goal of systems biology. One approach is to analyze gene expression data to infer a network of gene interactions on the basis of their correlated responses to environmental and genetic perturbations. The inferred network can then be analyzed to identify functional communities. However, commonly used algorithms can yield unreliable results due to experimental noise, algorithmic stochasticity, and the influence of arbitrarily chosen parameter values. Furthermore, the results obtained typically provide only a simplistic view of the network partitioned into disjoint communities and provide no information of the relationship between communities. Here, we present methods to robustly detect coregulated and functionally enriched gene communities and demonstrate their application and validity for Escherichia coli gene expression data. Applying a recently developed community detection algorithm to the network of interactions identified with the context likelihood of relatedness (CLR) method, we show that a hierarchy of network communities can be identified. These communities significantly enrich for gene ontology (GO) terms, consistent with them representing biologically meaningful groups. Further, analysis of the most significantly enriched communities identified several candidate new regulatory interactions. The robustness of our methods is demonstrated by showing that a core set of functional communities is reliably found when artificial noise, modeling experimental noise, is added to the data. We find that noise mainly acts conservatively, increasing the relatedness required for a network link to be reliably assigned and decreasing the size of the core communities, rather than causing association of genes into new communities.Comment: Due to appear in PLoS Computational Biology. Supplementary Figure S1 was not uploaded but is available by contacting the author. 27 pages, 5 figures, 15 supplementary file

arXiv.org e-Print Archive

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

FigShare

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Integrative inference of gene-regulatory networks in Escherichia coli using information theoretic concepts and sequence analysis

Author: A Rao
AA Herbert
AA Margolin
AJ Butte
Anna Göhler
ARFD Henestrosa
BK Cho
C Yanisch-Perron
Christoph Kaleta
CO Daub
EL Murray
GD Stormo
GEGE Forsythe
GJ McKenzie
H Ogasawara
J Massey
J van Helden
JJ Faith
JT Wade
K Basso
K Yamamoto
Knut Jahreis
L McCue
LA McCue
M Hecker
MD Bradley
NM Kredich
O Bembom
PE Meyer
R Schneider
Reinhard Guthke
SG Sedgwick
Stefan Schuster
Swetlana Nikolajewa
T Zeppenfeld
TB Morrison
YI Moon
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Although <it>Escherichia coli </it>is one of the best studied model organisms, a comprehensive understanding of its gene regulation is not yet achieved. There exist many approaches to reconstruct regulatory interaction networks from gene expression experiments. Mutual information based approaches are most useful for large-scale network inference. Results We used a three-step approach in which we combined gene regulatory network inference based on directed information (DTI) and sequence analysis. DTI values were calculated on a set of gene expression profiles from 19 time course experiments extracted from the Many Microbes Microarray Database. Focusing on influences between pairs of genes in which one partner encodes a transcription factor (TF) we derived a network which contains 878 TF - gene interactions of which 166 are known according to RegulonDB. Afterward, we selected a subset of 109 interactions that could be confirmed by the presence of a phylogenetically conserved binding site of the respective regulator. By this second step, the fraction of known interactions increased from 19% to 60%. In the last step, we checked the 44 of the 109 interactions not yet included in RegulonDB for functional relationships between the regulator and the target and, thus, obtained ten TF - target gene interactions. Five of them concern the regulator LexA and have already been reported in the literature. The remaining five influences describe regulations by Fis (with two novel targets), PhdR, PhoP, and KdgR. For the validation of our approach, one of them, the regulation of lipoate synthase (LipA) by the pyruvate-sensing pyruvate dehydrogenate repressor (PdhR), was experimentally checked and confirmed. Conclusions We predicted a set of five novel TF - target gene interactions in <it>E. coli</it>. One of them, the regulation of <it>lipA </it>by the transcriptional regulator PdhR was validated experimentally. Furthermore, we developed DTInfer, a new R-package for the inference of gene-regulatory networks from microarrays using directed information.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Inter-laboratory reproducibility of fast gas chromatography–electron impact–time of flight mass spectrometry (GC–EI–TOF/MS) based plant metabolomics

Author: A Erban
A Lommen
A Lüdemann
A Stepansky
AI Saeed
AI Saeed
Alexander Erban
Alexander Luedemann
AR Fernie
Arjen Lommen
B Biais
CHR Vos de
CO Daub
EC Horning
G Noctor
GS Catchpole
H Jenkins
H Lu
H Suzuki
J Kopka
J Lisec
J. William Allwood
JJ Jansen
Joachim Kopka
JV Stone
KC Verhoeckx
KH Karstensen
L Pauling
Lorraine Kay
LW Sumner
M Beckmann
M Scholz
M Scholz
NW Hardy
O Fiehn
O Fiehn
O Fiehn
O Fiehn
O Fiehn
R Goodacre
RA Dixon
Ralf Löscher
RD Hall
RD Hall
RD Hall
Royston Goodacre
S O’Hagan
Sjaak de Koning
U Roessner
W Pennie
W Pongsuwan
Warwick B. Dunn
WH Heijne
WM Heijne
Z Gao
Publication venue: Springer US
Publication date: 01/01/2009
Field of study

The application of gas chromatography–mass spectrometry (GC–MS) to the ‘global’ analysis of metabolites in complex samples (i.e. metabolomics) has now become routine. The generation of these data-rich profiles demands new strategies in data mining and standardisation of experimental and reporting aspects across laboratories. As part of the META-PHOR project’s (METAbolomics for Plants Health and OutReach: http://www.meta-phor.eu/) priorities towards robust technology development, a GC–MS ring experiment based upon three complex matrices (melon, broccoli and rice) was launched. All sample preparation, data processing, multivariate analyses and comparisons of major metabolite features followed standardised protocols, identical models of GC (Agilent 6890N) and TOF/MS (Leco Pegasus III) were also employed. In addition comprehensive GC×GC–TOF/MS was compared with 1 dimensional GC–TOF/MS. Comparisons of the paired data from the various laboratories were made with a single data processing and analysis method providing an unbiased assessment of analytical method variants and inter-laboratory reproducibility. A range of processing and statistical methods were also assessed with a single exemplary dataset revealing near equal performance between them. Further investigations of long-term reproducibility are required, though the future generation of global and valid metabolomics databases offers much promise

Crossref

Springer - Publisher Connector

University of Birmingham Research Portal

PubMed Central

Wageningen University & Research Publications

The University of Manchester - Institutional Repository

MPG.PuRe

Network Inference Algorithms Elucidate Nrf2 Regulation of Mouse Lung Oxidative Stress

Author: A Jacquier
A Otomo
AA Margolin
AA Margolin
AK Jaiswal
AK Jaiswal
B Biteau
C-C Chang
CJ Reed
CM Clements
CO Daub
D Giustarini
Deepti Malhotra
DJ Moore
EY Park
George Acquaah-Mensah
GK Acquaah-Mensah
H Cai
H Ohkawa
I Nagano
I Priness
I Rahman
IH Witten
J Choi
JJ Faith
K Basso
K Itoh
K Iwasaki
M Kanehisa
M Matsuoka
M Singhal
MM Gallogly
Mudita Singhal
N Christianni
N Slonim
N Watanabe
P Shannon
PL Whitney
R Venugopal
R Venugopal
RA Irizarry
RC Taylor
RC Taylor
RG Will
RK Thimmulappa
Ronald C. Taylor
Ruth Nussinov
S Hadano
S Mead
SE Keene
Shyam Biswal
T Rangasamy
TM Cover
U Alon
U Alon
V Bonifati
VJ Findlay
W Droge
W Zhou
WW Wasserman
XL Chen
Y El-Manzalawy
Y Katoh
Y Li
Publication venue: Public Library of Science
Publication date: 01/08/2008
Field of study

A variety of cardiovascular, neurological, and neoplastic conditions have been associated with oxidative stress, i.e., conditions under which levels of reactive oxygen species (ROS) are elevated over significant periods. Nuclear factor erythroid 2-related factor (Nrf2) regulates the transcription of several gene products involved in the protective response to oxidative stress. The transcriptional regulatory and signaling relationships linking gene products involved in the response to oxidative stress are, currently, only partially resolved. Microarray data constitute RNA abundance measures representing gene expression patterns. In some cases, these patterns can identify the molecular interactions of gene products. They can be, in effect, proxies for protein–protein and protein–DNA interactions. Traditional techniques used for clustering coregulated genes on high-throughput gene arrays are rarely capable of distinguishing between direct transcriptional regulatory interactions and indirect ones. In this study, newly developed information-theoretic algorithms that employ the concept of mutual information were used: the Algorithm for the Reconstruction of Accurate Cellular Networks (ARACNE), and Context Likelihood of Relatedness (CLR). These algorithms captured dependencies in the gene expression profiles of the mouse lung, allowing the regulatory effect of Nrf2 in response to oxidative stress to be determined more precisely. In addition, a characterization of promoter sequences of Nrf2 regulatory targets was conducted using a Support Vector Machine classification algorithm to corroborate ARACNE and CLR predictions. Inferred networks were analyzed, compared, and integrated using the Collective Analysis of Biological Interaction Networks (CABIN) plug-in of Cytoscape. Using the two network inference algorithms and one machine learning algorithm, a number of both previously known and novel targets of Nrf2 transcriptional activation were identified. Genes predicted as novel Nrf2 targets include Atf1, Srxn1, Prnp, Sod2, Als2, Nfkbib, and Ppp1r15b. Furthermore, microarray and quantitative RT-PCR experiments following cigarette-smoke-induced oxidative stress in Nrf2+/+ and Nrf2−/− mouse lung affirmed many of the predictions made. Several new potential feed-forward regulatory loops involving Nrf2, Nqo1, Srxn1, Prdx1, Als2, Atf1, Sod1, and Park7 were predicted. This work shows the promise of network inference algorithms operating on high-throughput gene expression data in identifying transcriptional regulatory and other signaling relationships implicated in mammalian disease

Crossref

Directory of Open Access Journals

PubMed Central

Towards a System Level Understanding of Non-Model Organisms Sampled from the Environment: A Network Biology Approach

Author: A Conesa
A Koehler
A Pistocchi
AD Southam
AI Saeed
AM Diab
Amer M. Diab
AP Davis
B Bierie
B Fricke
B Santiago-Josefat
B Sun
BP Lyons
BP Lyons
Brett P. Lyons
Carolynn Mackenzie
Christian von Mering
CO Daub
EG Bligh
F Falciani
FJ Warner
Francesco Falciani
G Dennis Jr
G Stentiford
G Van Aggelen
GD Stentiford
Grant D. Stentiford
H Wu
H Yoshiji
HJ Small
Huifeng Wu
I Katsiadaki
Ioanna Katsiadaki
J Felsenstein
JA Roling
JM Herbert
John B. Taggart
John M. Herbert
Joseph K. Abraham
JR Jonsson
K Abraham
K Basso
K Kiersch
Katie L. Bartie
Kevin J. Chipman
M Fernandez
M Raymond
MA Fisher
Mark R. Viant
ME Baker
MF Kirby
Michael J. Leaver
MJ Leaver
MJ Leaver
MJ Leaver
N Bluthgen
N Tijet
Nil Turan
Olga Hrydziuszko
P Shannon
P Thangavel
PA Farazi
PF Larsen
PF Larsen
PM Blumberg
PW Moran
R Yazawa
S Golotvin
S Gotz
SB Wiseman
SE Hook
SG George
Stephen G. George
SW Feist
T Andoh
T Maass
TD Williams
TD Williams
TD Williams
Tim D. Williams
V Trevino
W Huang da
WE Johnson
X Huang
Y Benjamini
Y Yamamoto
Publication venue: Public Library of Science
Publication date: 01/08/2011
Field of study

The acquisition and analysis of datasets including multi-level omics and physiology from non-model species, sampled from field populations, is a formidable challenge, which so far has prevented the application of systems biology approaches. If successful, these could contribute enormously to improving our understanding of how populations of living organisms adapt to environmental stressors relating to, for example, pollution and climate. Here we describe the first application of a network inference approach integrating transcriptional, metabolic and phenotypic information representative of wild populations of the European flounder fish, sampled at seven estuarine locations in northern Europe with different degrees and profiles of chemical contaminants. We identified network modules, whose activity was predictive of environmental exposure and represented a link between molecular and morphometric indices. These sub-networks represented both known and candidate novel adverse outcome pathways representative of several aspects of human liver pathophysiology such as liver hyperplasia, fibrosis, and hepatocellular carcinoma. At the molecular level these pathways were linked to TNF alpha, TGF beta, PDGF, AGT and VEGF signalling. More generally, this pioneering study has important implications as it can be applied to model molecular mechanisms of compensatory adaptation to a wide range of scenarios in wild populations

Crossref

Stirling Online Research Repository (RIOXX)

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

Institutional Repository of Yantai Institute of Coastal Zone Research, CAS

Stirling Online Research Repository

Comparative Genomics of Cell Envelope Components in Mycobacteria

Author: A Alahari
A Treumann
AK Arakaki
AK Azad
AK Azad
AR Flores
C Delmas
C Vignal
CC Huang
CM Sassetti
CM Sassetti
CO Daub
D Barkan
D Barkan
D Kaur
D Kaur
DA Benson
DE Minnikin
DJ Beste
E Dubnau
F Ripoll
G Huet
H Gebhardt
H Målen
H Nikaido
H Rachman
H Scherman
H Zheng
J Buglino
J Felsenstein
J Liu
JC Camus
K Raman
K Raman
K Takayama
K Tamura
KB Arnvig
KC Onwueme
L Guenin-Macé
L Li
Li Kuo-Bin
LR Camacho
LS Meena
M Daffe
M Jackson
M Jackson
M Joe
M Monot
M Pellegrini
M Seki
MA Behr
MS Glickman
MS Glickman
MS Glickman
OA Trivedi
OA Trivedi
Olivier Neyrolles
P Dinadayala
Pankaj Vats
PD Karp
PJ Woodruff
PR Marri
R Brosch
R Caspi
R Jothi
R Siméone
Rajendra Joshi
RD Fleischmann
RM Goldstone
RP Morris
Ruma Banerjee
S Agarwal
S Hasan
SK Parker
SL Kinnings
Sonal Dahale
ST Cole
Sunitha Manjari Kasibhatla
SV Date
SV Date
T Dos Vultos
T Garnier
TB Reddy
TJ Erb
TP Stinear
TP Stinear
V Bhowruth
V Rao
V Vissa
VD Vissa
WB Turnbull
WR Pearson
Y Guerardel
Y Guérardel
Y Yuan
Y Yuan
Y Yuan
Y Zuo
Publication venue: Public Library of Science
Publication date: 01/05/2011
Field of study

Mycobacterial cell envelope components have been a major focus of research due to their unique features that confer intrinsic resistance to antibiotics and chemicals apart from serving as a low-permeability barrier. The complex lipids secreted by Mycobacteria are known to evoke/repress host-immune response and thus contribute to its pathogenicity. This study focuses on the comparative genomics of the biosynthetic machinery of cell wall components across 21-mycobacterial genomes available in GenBank release 179.0. An insight into survival in varied environments could be attributed to its variation in the biosynthetic machinery. Gene-specific motifs like ‘DLLAQPTPAW’ of ufaA1 gene, novel functional linkages such as involvement of Rv0227c in mycolate biosynthesis; Rv2613c in LAM biosynthesis and Rv1209 in arabinogalactan peptidoglycan biosynthesis were detected in this study. These predictions correlate well with the available mutant and coexpression data from TBDB. It also helped to arrive at a minimal functional gene set for these biosynthetic pathways that complements findings using TraSH

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Endothelial progenitor cells inhibit platelet function in a P-selectin-dependent manner

Author: Ahmed Hachem
AK Ranjan
C Kalka
C Urbich
C Urbich
CS Rinder
D Kong
D Yacoub
Daniel Yacoub
DP Griese
E Chavakis
EI Lev
G Davi
GA Kunz
GB Nash
H Abou-Saleh
H Langer
Haissam Abou-Saleh
HC Boer de
J Aoki
J George
J Rehman
J Yamaguchi
K Daub
K Larsen
K Stellos
K Stellos
M Co
M Hristov
M Miglionico
Marc-Antoine Gillis
ML Henry
O Raz
PE Stenberg
S Kaushal
S Konstantinides
S Massberg
S Wassmann
S Yokoyama
SJ Goldenberg
T Asahara
T Kinnaird
T Kinnaird
T Shirota
T Shirota
TF Luscher
U Mayr
W Feng
XQ Li
Y Jang
Y Ozeki
Yahye Merhi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref