Search CORE

17 research outputs found

Accounting for Redundancy when Integrating Gene Interaction Databases

Author: A Beyer
A Ruepp
AC Gavin
AJ Walhout
Andreas Beyer
Antigoni Elefsinioti
B Snel
D Rhodes
DR Rhodes
DR Rhodes
EM Marcotte
F Ramirez
GD Bader
H Yu
I Lee
I Lee
J Albert
J Friedman
J McDermott
J Rual
K Venkatesan
L Giot
LJ Jensen
LJ Lu
LR Matthews
M Guerquin
MA Harris
Marit Ackermann
MPH Stumpf
NJ Mulder
P Bork
P Uetz
R Ewing
R Jansen
R Jansen
Raya Khanin
RE Schapire
S Kerrien
S Li
T Ito
U Stelzl
Y Ho
Publication venue: Public Library of Science
Publication date: 22/10/2009
Field of study

During the last years gene interaction networks are increasingly being used for the assessment and interpretation of biological measurements. Knowledge of the interaction partners of an unknown protein allows scientists to understand the complex relationships between genetic products, helps to reveal unknown biological functions and pathways, and get a more detailed picture of an organism's complexity. Being able to measure all protein interactions under all relevant conditions is virtually impossible. Hence, computational methods integrating different datasets for predicting gene interactions are needed. However, when integrating different sources one has to account for the fact that some parts of the information may be redundant, which may lead to an overestimation of the true likelihood of an interaction. Our method integrates information derived from three different databases (Bioverse, HiMAP and STRING) for predicting human gene interactions. A Bayesian approach was implemented in order to integrate the different data sources on a common quantitative scale. An important assumption of the Bayesian integration is independence of the input data (features). Our study shows that the conditional dependency cannot be ignored when combining gene interaction databases that rely on partially overlapping input data. In addition, we show how the correlation structure between the databases can be detected and we propose a linear model to correct for this bias. Benchmarking the results against two independent reference data sets shows that the integrated model outperforms the individual datasets. Our method provides an intuitive strategy for weighting the different features while accounting for their conditional dependencies

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Bayesian Inference for Genomic Data Integration Reduces Misclassification Rate in Predicting Protein-Protein Interactions

Author: A Elefsinioti
A Valencia
AJ Enright
AK Ramani
AL Hopkins
BA Shoemaker
C von Mering
C von Mering
CC Wu
Christos A. Ouzounis
Chuanhua Xing
CS Goh
David B. Dunson
DB Dunson
DR Rhodes
EC Butcher
EM Marcotte
F Browne
F Pazos
GT Hart
H Huang
H Ishwaran
H Yu
I Lee
IW Taylor
J Saric
J Sun
JS Bader
L Hakes
L Hood
L Lu
LJ Jensen
LJ Lu
LV Zhang
M Huang
M Persico
MA Yildirim
MP Brown
MS Scott
N Lin
OG Troyanskaya
P Aloy
P Bork
P Pagel
P Sham
R Chowdhary
R Jansen
R Malik
R Mrowka
S Dolma
S Kim
S Tsoka
SV Date
Y Qi
Y Qi
Publication venue: Public Library of Science
Publication date: 01/07/2011
Field of study

Protein-protein interactions (PPIs) are essential to most fundamental cellular processes. There has been increasing interest in reconstructing PPIs networks. However, several critical difficulties exist in obtaining reliable predictions. Noticeably, false positive rates can be as high as >80%. Error correction from each generating source can be both time-consuming and inefficient due to the difficulty of covering the errors from multiple levels of data processing procedures within a single test. We propose a novel Bayesian integration method, deemed nonparametric Bayes ensemble learning (NBEL), to lower the misclassification rate (both false positives and negatives) through automatically up-weighting data sources that are most informative, while down-weighting less informative and biased sources. Extensive studies indicate that NBEL is significantly more robust than the classic naïve Bayes to unreliable, error-prone and contaminated data. On a large human data set our NBEL approach predicts many more PPIs than naïve Bayes. This suggests that previous studies may have large numbers of not only false positives but also false negatives. The validation on two human PPIs datasets having high quality supports our observations. Our experiments demonstrate that it is feasible to predict high-throughput PPIs computationally with substantially reduced false positives and false negatives. The ability of predicting large numbers of PPIs both reliably and automatically may inspire people to use computational approaches to correct data errors in general, and may speed up PPIs prediction with high quality. Such a reliable prediction may provide a solid platform to other studies such as protein functions prediction and roles of PPIs in disease susceptibility

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Combination of novel and public RNA-seq datasets to generate an mRNA expression atlas for the domestic chicken

Author: A Alexa
A Aziz
A Balic
A Celada
A Chatr-Aryamontri
A Chatr-aryamontri
A Conesa
A Diaz-Perales
A Elefsinioti
A Esteve-Codina
A Joshi
A Kranis
A Psifidi
A So
A Theocharidis
A Van Goor
AC Long
AJ Vilella
Amanda J. MacCallum
Androniki Psifidi
AR Forrest
B Glick
B Strasser
BO Fabriek
BR Johnson
C Furusawa
C Garcia-Morales
C Wasmeier
C Wu
C-F Le
CD Stern
Chunlei Wu
CJ Langouet-Astrie
CP Zeferino
CW Resnyk
Cyrus Afrasiabi
D Brawand
D Günzel
D Han
D Risso
D-D Wu
DA Hume
DA Hume
David A. Hume
DJ Lynn
DJ Lynn
DR Rhodes
E Arner
E Eising
E González
EL Clark
EL Gautier
EL van Dijk
EM Pritchett
ET Richardson
F Bangs
F He
F Wang
G Frühbeck
G Zhu
GA Pavlopoulos
GD Bader
GD Plowman
H Hermjakob
HA Eckelhoefer
HH Cheng
I Gallego Romero
I Galvan
J Lopes Ricardo
J Merkin
J Smith
J Zhou
Jacqueline Smith
Jenny O’Dell
JF Reiter
JY Han
JY Kim
K Muret
K Pazdrak
K Piórkowska
KD Hansen
KD Pruitt
Kim M. Summers
KM Summers
KR Brown
L Alibardi
L Huminiecki
L Opitz
L Salwinski
L Taylor
L X-d
LM Quinn
Lucy Freem
M Kotlyar
M Kotlyar
M Lizio
M Stauber
M Sultan
M Takeda
MA Quail
Mark P. Stevens
ME Woodcock
MK Chang
MM Song
NA Mabbott
NA O'Leary
NC Johnson
NL Bray
P Kovarik
P Wu
P-F Roux
PH Sudmant
PJ Balwierz
PJ Seear
QC Zhang
R Andersson
R Deviatiiarov
R Feng
R Jansen
R Kapetanovic
R Kist
R Rodriguez-Manzanet
R Sinha
RI Kuo
RJ Kinsella
RS Holmes
S Chhangawala
S Epelman
S Intarapat
S Li
S Lin
S Oliver
S Roosing
S Tarazona
S Tornow
S van Dongen
S Zhao
S-A Lee
SF Altschul
SJ Bush
SM Carpanini
Stephen J. Bush
T Lu
TC Freeman
TC Freeman
TN Doig
TP van Gurp
TS Keshava Prasad
TX Jiang
U Coppola
V Curwen
V Garceau
X Adiconis
X Li
X Shen
X Su
Y Kodama
Y Liu
Y Wang
Y Yin
Z Bar-Joseph
Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

Background: The domestic chicken (Gallus gallus) is widely used as a model in developmental biology and is also an important livestock species. We describe a novel approach to data integration to generate an mRNA expression atlas for the chicken spanning major tissue types and developmental stages, using a diverse range of publicly-archived RNA-seq datasets and new data derived from immune cells and tissues. Results: Randomly down-sampling RNA-seq datasets to a common depth and quantifying expression against a reference transcriptome using the mRNA quantitation tool Kallisto ensured that disparate datasets explored comparable transcriptomic space. The network analysis tool Graphia was used to extract clusters of co-expressed genes from the resulting expression atlas, many of which were tissue or cell-type restricted, contained transcription factors that have previously been implicated in their regulation, or were otherwise associated with biological processes, such as the cell cycle. The atlas provides a resource for the functional annotation of genes that currently have only a locus ID. We cross-referenced the RNA-seq atlas to a publicly available embryonic Cap Analysis of Gene Expression (CAGE) dataset to infer the developmental time course of organ systems, and to identify a signature of the expansion of tissue macrophage populations during development. Conclusion: Expression profiles obtained from public RNA-seq datasets - despite being generated by different laboratories using different methodologies - can be made comparable to each other. This meta-analytic approach to RNA-seq can be extended with new datasets from novel tissues, and is applicable to any species

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

Oxford University Research Archive

University of Queensland eSpace

Efficient prediction of human protein-protein interactions at a global scale

Author: A Ben-Hur
A Bithell
A Chambers
A De Siervi
A Elefsinioti
A Gursoy
A Ozgur
A Stein
AB Mak
AC Gavin
AL Benko
Alex Wong
Andrew Schoenrock
Ashkan Golshani
Bahram Samanfar
C Bron
C Chica
C Sanchez
C von Mering
Charles A Phillips
CY Yu
D Dorsett
D Shenton
D Zheng
DF Easton
E Petsalaki
E Revenkova
EJ Chesler
F Hans
Frank Dehne
Fredrik Barrenäs
G Lucchini
H Jeong
H Wang
H Wang
H Yu
H Yu
Hui Wang
I Feldman
I Stansfield
J Bousquet
J Bousquet
J Chen
J Wu
JA Kim
James R Green
JD Eblen
JJ Garcia-Gomez
JM Hall
JM Peters
Katayoun Omidi
Ke Jin
KI Goh
L Schenk
M Alamgir
M Benson
M Carmena
M Fabbro
M Girvan
M Jessulat
M Jessulat
M Ouchi
M Shrivastav
MA Langston
Md Alamgir
MD McDowall
Michael A Langston
Mikael Benson
MJ Vogel
Mohan Babu
Mohsen Hooshyar
MW Pfaffl
N Yosef
NJ Krogan
NJ Krogan
P Uetz
PS Rao
QC Zhang
QC Zhang
R Albert
R Margueron
R Zagozdzon
RA Young
RI Yarden
RK Nibbe
S Lievens
S Pitre
S Pitre
S Pitre
S Pitre
S Ren
S Ryser
S Yu
Sadhna Phanse
SH Khan
Sylvain Pitre
T Aas
T Ito
T Kislinger
V Neduva
V Neduva
X Yu
X Zhang
Y Houvras
Y Park
Y Park
Y Qi
Yuan Gui
Z Dezso
Z Ni
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Large-scale De Novo Prediction of Physical Protein-Protein Association

Author: Beyer A.
Elefsinioti A.
Hegele A.
Hubner N. C.
Hyman A.
Mann M.
Plake C.
Poser I.
Sarac O. S.
Sarov M.
Schroeder M.
Stelzl U.
Publication venue: AMER SOC BIOCHEMISTRY MOLECULAR BIOLOGY INC
Publication date: 01/01/2011
Field of study

Information about the physical association of proteins is extensively used for studying cellular processes and disease mechanisms. However, complete experimental mapping of the human interactome will remain prohibitively difficult in the near future. Here we present a map of predicted human protein interactions that distinguishes functional association from physical binding. Our network classifies more than 5 million protein pairs predicting 94,009 new interactions with high confidence. We experimentally tested a subset of these predictions using yeast two-hybrid analysis and affinity purification followed by quantitative mass spectrometry. Thus we identified 462 new protein-protein interactions and confirmed the predictive power of the network. These independent experiments address potential issues of circular reasoning and are a distinctive feature of this work. Analysis of the physical interactome unravels subnetworks mediating between different functional and physical subunits of the cell. Finally, we demonstrate the utility of the network for the analysis of molecular mechanisms of complex diseases by applying it to genome-wide association studies of neurodegenerative diseases. This analysis provides new evidence implying TOMM40 as a factor involved in Alzheimer's disease. The network provides a high-quality resource for the analysis of genomic data sets and genetic association studies in particular. Our interactome is available via the hPRINT web server at: www.print-db.org

PubMed Central

MDC Repository

MPG.PuRe

Circular RNAs are a large class of animal RNAs with regulatory potency

Author: Elefsinioti A.
Gregersen L.H.
Jens M.
Kocks C.
Krueger J.
Landthaler M.
le Noble F.
Loewer A.
Mackowiak S.D.
Maier L.
Memczak S.
Munschauer M.
Rajewsky N.
Rybak A.
Torti F.
Ziebold U.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Circular RNAs (circRNAs) in animals are an enigmatic class of RNA with unknown function. To explore circRNAs systematically, we sequenced and computationally analysed human, mouse and nematode RNA. We detected thousands of well-expressed, stable circRNAs, often showing tissue/developmental-stage-specific expression. Sequence analysis indicated important regulatory functions for circRNAs. We found that a human circRNA, antisense to the cerebellar degeneration-related protein 1 transcript (CDR1as), is densely bound by microRNA (miRNA) effector complexes and harbours 63 conserved binding sites for the ancient miRNA miR-7. Further analyses indicated that CDR1as functions to bind miR-7 in neuronal tissues. Human CDR1as expression in zebrafish impaired midbrain development, similar to knocking down miR-7, suggesting that CDR1as is a miRNA antagonist with a miRNA-binding capacity ten times higher than any other known transcript. Together, our data provide evidence that circRNAs form a large class of post-transcriptional regulators. Numerous circRNAs form by head-to-tail splicing of exons, suggesting previously unrecognized regulatory potential of coding sequences

TUbiblio

Copenhagen University Research Information System

MDC Repository

Physicochemical properties of amino acid sequences of G-proteins for understanding GPCR-G-protein coupling

Author: A. L. Elefsinioti P. G. Bagos, I.
B. R. Conklin and H. R. Boume
Fumitsugu Akazawa
Ganga D. Ghimire
J. Drews
J. Wess
Kenichiro Imai
Masashi Sonoyama
N.G.. Sgourakis P. G.. Bagos and S
S. Mitaku and T. Hirokawa
Shigeki Mitaku
Toshiyuki Tsuji
V.P. Jaakoka J. Prilusky, J. L. Su
Y. Yabuki T. Muramatsu, T. Hirokaw
Publication venue: 'Chem-Bio Informatics Society'
Publication date: 01/01/2006
Field of study

Crossref

GPCRs, G-proteins, effectors and their interactions: human-gpDB, a database employing visualization tools and data integration techniques

Author: Attwood
Berman
C. K. Stampolakis
Cabrera-Vera
Cochrane
Elefsinioti
G. A. Pavlopoulos
Gene Ontology Consortium
Hamosh
Horn
Kanehisa
Kolakowski
Kristiansen
Lopez
M. C. Theodoropoulou
Maglott
McCudden
N. C. Papandreou
Oldham
P. G. Bagos
Pierce
R. Schneider
S. J. Hamodrakas
Sander
Schafferhans
Schuler
Smedley
The InterPro Consortium
V. P. Satagopam
Wishart
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2010
Field of study

G-protein coupled receptors (GPCRs) are a major family of membrane receptors in eukaryotic cells. They play a crucial role in the communication of a cell with the environment. Ligands bind to GPCRs on the outside of the cell, activating them by causing a conformational change, and allowing them to bind to G-proteins. Through their interaction with G-proteins, several effector molecules are activated leading to many kinds of cellular and physiological responses. The great importance of GPCRs and their corresponding signal transduction pathways is indicated by the fact that they take part in many diverse disease processes and that a large part of efforts towards drug development today is focused on them. We present Human-gpDB, a database which currently holds information about 713 human GPCRs, 36 human G-proteins and 99 human effectors. The collection of information about the interactions between these molecules was done manually and the current version of Human-gpDB holds information for about 1663 connections between GPCRs and G-proteins and 1618 connections between G-proteins and effectors. Major advantages of Human-gpDB are the integration of several external data sources and the support of advanced visualization techniques. Human-gpDB is a simple, yet a powerful tool for researchers in the life sciences field as it integrates an up-to-date, carefully curated collection of human GPCRs, G-proteins, effectors and their interactions. The database may be a reference guide for medical and pharmaceutical research, especially in the areas of understanding human diseases and chemical and drug discovery. Database URLs: http://schneider.embl.de/human_gpdb; http://bioinformatics.biol.uoa.gr/human_gpdb

Crossref

PubMed Central

Open Repository and Bibliography - Luxembourg

University of Thessaly Institutional Repository

Development and validation of whole genome-wide and genic microsatellite markers in oil palm (Elaeis guineensis Jacq.): First microsatellite database (OpSatdb)

Author: A Bairoch
A Hayati
A Riju
AL Elefsinioti
BK Babu
BK Babu
BK Babu
C Bakoumé
DJ Murphy
E Barcelos
FM Kanehisa
G Pandey
H Arabnezhad
H Sonah
JK Yu
JL Sussman
K Liu
L Zane
LS Mathew
M Krzywinski
M Sharma
MAS Iquebal
MG Murray
NC Ting
NM Zaki
R Singh
R Yasodha
S Gotz
S Jeennor
SS Deepika
T Thiel
WS Martins
Y Xiao
Z Tanya
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Deblender: a semi−/unsupervised multi-operational computational method for complete deconvolution of expression data from heterogeneous samples

Author: A Elefsinioti
A Janecek
A Kuhn
A Mortazavi
AG Vrahatis
AM Newman
AR Abbas
D Aran
D Repsilber
D Venet
DA Liebner
Elisabeth Wik
F Avila Cobos
G Quon
H Dueck
HS Park
Inge Jonassen
J Ahn
J Clarke
J Quackenbush
K Dimitrakopoulou
K Yoshihara
KE Parsopoulos
Konstantina Dimitrakopoulou
L Oesper
Lars A. Akslen
N Wang
N Wang
O Wolkenhauer
R Gaujoux
R Gaujoux
RA Moffitt
RO Stuart
SL Carter
SS Shen-Orr
T Erkkilä
T Gong
T Gong
V Kulasingam
V Onuchic
VK Yadav
W Qiao
X Dai
Y Kluger
Y Zhong
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref