Search CORE

508 research outputs found

A Knowledge Graph for Industry 4.0

Author: A Bassi
C Parent
JA Saucedo-Martínez
L Halilaj
LD Xu
M Färber
S Andreev
SR Bader
Publication venue: Cham : Springer
Publication date: 01/01/2020
Field of study

One of the most crucial tasks for today’s knowledge workers is to get and retain a thorough overview on the latest state of the art. Especially in dynamic and evolving domains, the amount of relevant sources is constantly increasing, updating and overruling previous methods and approaches. For instance, the digital transformation of manufacturing systems, called Industry 4.0, currently faces an overwhelming amount of standardization efforts and reference initiatives, resulting in a sophisticated information environment. We propose a structured dataset in the form of a semantically annotated knowledge graph for Industry 4.0 related standards, norms and reference frameworks. The graph provides a Linked Data-conform collection of annotated, classified reference guidelines supporting newcomers and experts alike in understanding how to implement Industry 4.0 systems. We illustrate the suitability of the graph for various use cases, its already existing applications, present the maintenance process and evaluate its quality

Crossref

Repositorium für Naturwissenschaften und Technik

MCL-CAw: A refinement of MCL for detecting yeast complexes from weighted PPI networks by incorporating core-attachment structure

Author: A Eberharter
A Mitrofanova
AC Gavin
AC Gavin
AD King
AJ Enright
B Breitkreutz
B Zhang
C Fridel
C Friedel
C von Mering
DF Seals
E Zotenko
EA Winzeler
G Giaever
G Hart
G Liu
G Liu
G Rigaut
GD Bader
H Cheng
H Chua
H Jeong
H Leung
H Wang
Hon Wai Leong
HW Mewes
J Hurwitz
J Zhao
JC Mellor
JD Han
JM Cherry
JS Luz
K Voevodski
Kang Ning
M Ashburner
M Wu
N Batada
NJ Krogan
P Aloy
P Carvalho
P Shannon
P Uetz
PA Grant
PA Grant
S Brohee
S Dongen
S Pu
S Pu
S Srihari
S Srihari
SR Collins
Sriganesh Srihari
T Ito
T Miller
X Zhou
Y Araki
Y Ho
Y Ozawa
Y Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Abstract Background The reconstruction of protein complexes from the physical interactome of organisms serves as a building block towards understanding the higher level organization of the cell. Over the past few years, several independent high-throughput experiments have helped to catalogue enormous amount of physical protein interaction data from organisms such as yeast. However, these individual datasets show lack of correlation with each other and also contain substantial number of false positives (noise). Over these years, several affinity scoring schemes have also been devised to improve the qualities of these datasets. Therefore, the challenge now is to detect meaningful as well as novel complexes from protein interaction (PPI) networks derived by combining datasets from multiple sources and by making use of these affinity scoring schemes. In the attempt towards tackling this challenge, the Markov Clustering algorithm (MCL) has proved to be a popular and reasonably successful method, mainly due to its scalability, robustness, and ability to work on scored (weighted) networks. However, MCL produces many noisy clusters, which either do not match known complexes or have additional proteins that reduce the accuracies of correctly predicted complexes. Results Inspired by recent experimental observations by Gavin and colleagues on the modularity structure in yeast complexes and the distinctive properties of "core" and "attachment" proteins, we develop a core-attachment based refinement method coupled to MCL for reconstruction of yeast complexes from scored (weighted) PPI networks. We combine physical interactions from two recent "pull-down" experiments to generate an unscored PPI network. We then score this network using available affinity scoring schemes to generate multiple scored PPI networks. The evaluation of our method (called MCL-CAw) on these networks shows that: (i) MCL-CAw derives larger number of yeast complexes and with better accuracies than MCL, particularly in the presence of natural noise; (ii) Affinity scoring can effectively reduce the impact of noise on MCL-CAw and thereby improve the quality (precision and recall) of its predicted complexes; (iii) MCL-CAw responds well to most available scoring schemes. We discuss several instances where MCL-CAw was successful in deriving meaningful complexes, and where it missed a few proteins or whole complexes due to affinity scoring of the networks. We compare MCL-CAw with several recent complex detection algorithms on unscored and scored networks, and assess the relative performance of the algorithms on these networks. Further, we study the impact of augmenting physical datasets with computationally inferred interactions for complex detection. Finally, we analyse the essentiality of proteins within predicted complexes to understand a possible correlation between protein essentiality and their ability to form complexes. Conclusions We demonstrate that core-attachment based refinement in MCL-CAw improves the predictions of MCL on yeast PPI networks. We show that affinity scoring improves the performance of MCL-CAw.http://deepblue.lib.umich.edu/bitstream/2027.42/78256/1/1471-2105-11-504.xmlhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/2/1471-2105-11-504-S1.PDFhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/3/1471-2105-11-504-S2.ZIPhttp://deepblue.lib.umich.edu/bitstream/2027.42/78256/4/1471-2105-11-504.pdfPeer Reviewe

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Deep Blue Documents at the University of Michigan

ScholarBank@NUS

Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences

Markov clustering versus affinity propagation for the partitioning of protein interaction graphs

Author: AC Gavin
AC Gavin
AK Jain
B Alberts
BJ Frey
BJ Frey
C Stark
E Pieroni
GD Bader
H Chipman
H Yu
J MacQueen
J Vlasblom
James Vlasblom
M Blatt
ME Cusick
MJ Brusco
N Johnsson
NJ Krogan
P Shannon
R Sharan
S Bader
S Brohee
S Charbonnier
S Fields
S Lloyd
S Pu
S Pu
S van Dongen
SH Yook
Shoshana J Wodak
SR Collins
T Formosa
T Hastie
TE Ideker
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Genome scale data on protein interactions are generally represented as large networks, or graphs, where hundreds or thousands of proteins are linked to one another. Since proteins tend to function in groups, or complexes, an important goal has been to reliably identify protein complexes from these graphs. This task is commonly executed using clustering procedures, which aim at detecting densely connected regions within the interaction graphs. There exists a wealth of clustering algorithms, some of which have been applied to this problem. One of the most successful clustering procedures in this context has been the Markov Cluster algorithm (MCL), which was recently shown to outperform a number of other procedures, some of which were specifically designed for partitioning protein interactions graphs. A novel promising clustering procedure termed Affinity Propagation (AP) was recently shown to be particularly effective, and much faster than other methods for a variety of problems, but has not yet been applied to partition protein interaction graphs. Results In this work we compare the performance of the Affinity Propagation (AP) and Markov Clustering (MCL) procedures. To this end we derive an unweighted network of protein-protein interactions from a set of 408 protein complexes from <it>S. cervisiae </it>hand curated in-house, and evaluate the performance of the two clustering algorithms in recalling the annotated complexes. In doing so the parameter space of each algorithm is sampled in order to select optimal values for these parameters, and the robustness of the algorithms is assessed by quantifying the level of complex recall as interactions are randomly added or removed to the network to simulate noise. To evaluate the performance on a weighted protein interaction graph, we also apply the two algorithms to the consolidated protein interaction network of <it>S. cerevisiae</it>, derived from genome scale purification experiments and to versions of this network in which varying proportions of the links have been randomly shuffled. Conclusion Our analysis shows that the MCL procedure is significantly more tolerant to noise and behaves more robustly than the AP algorithm. The advantage of MCL over AP is dramatic for unweighted protein interaction graphs, as AP displays severe convergence problems on the majority of the unweighted graph versions that we tested, whereas MCL continues to identify meaningful clusters, albeit fewer of them, as the level of noise in the graph increases. MCL thus remains the method of choice for identifying protein complexes from binary interaction networks.</p

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Predicting Quantitative Genetic Interactions by Means of Sequential Matrix Approximation

Author: A Beyer
A Hintze
AH Tong
Aki P. Järvinen
B Lehner
BL Drees
C Boone
D Segrè
ER DeLong
GD Bader
I Ulitsky
I Ulitsky
J De Leeuw
JL Badano
JL Hartman
Jukka Hiissa
L Decourty
Laura L. Elo
M Schuldiner
O Dror
P Pudil
P Ye
R Kelley
R Mani
RJ Taylor
RP St Onge
S Axler
S Bandyopadhyay
Shin-Han Shiu
SL Ooi
SL Wong
SR Collins
SR Collins
Tero Aittokallio
X Pan
Publication venue: Public Library of Science
Publication date: 26/09/2008
Field of study

Despite the emerging experimental techniques for perturbing multiple genes and measuring their quantitative phenotypic effects, genetic interactions have remained extremely difficult to predict on a large scale. Using a recent high-resolution screen of genetic interactions in yeast as a case study, we investigated whether the extraction of pertinent information encoded in the quantitative phenotypic measurements could be improved by computational means. By taking advantage of the observation that most gene pairs in the genetic interaction screens have no significant interactions with each other, we developed a sequential approximation procedure which ranks the mutation pairs in order of evidence for a genetic interaction. The sequential approximations can efficiently remove background variation in the double-mutation screens and give increasingly accurate estimates of the single-mutant fitness measurements. Interestingly, these estimates not only provide predictions for genetic interactions which are consistent with those obtained using the measured fitness, but they can even significantly improve the accuracy with which one can distinguish functionally-related gene pairs from the non-interacting pairs. The computational approach, in general, enables an efficient exploration and classification of genetic interactions in other studies and systems as well

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Resolving the structure of interactomes with hierarchical agglomerative clustering

Author: A Clauset
A Clauset
A Clauset
AM Cuervo
B Ravikumar
C He
C Stark
CG Rivera
DS Goldberg
E Airoldi
G Palla
GD Bader
H Huang
H Yu
H Zhang
J Qiu
JDJ Han
JM Hofman
Joel S Bader
K Heller
K Henderson
L Royer
M Costanzo
MEJ Newman
MEJ Newman
N Mizushima
P Ye
R Kelley
RE Kass
S Fortunato
SR Pfeffer
UV Luxburg
V Spirin
WW Zachary
X Pan
Y Park
Y Qi
Yongjin Park
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Discovery and Expansion of Gene Modules by Seeking Isolated Groups in a Random Graph Process

Author: A Beyer
C Schluter
E Hartuv
EA Winzeler
Elizabeth Conibear
Eshel Ben-Jacob
G Giaever
G Milligan
GD Bader
Jennifer Bryan
Jochen Brumm
L Kiemer
MA Wong
O Rinner
P Shannon
R Tibshirani
RF Ling
RO Duda
S Brohee
S van Dongen
SR Collins
TI Lee
W Huber
W Lee
W Stuetzle
W Stuetzle
Wyeth W. Wasserman
Publication venue: Public Library of Science
Publication date: 09/10/2008
Field of study

BACKGROUND: A central problem in systems biology research is the identification and extension of biological modules-groups of genes or proteins participating in a common cellular process or physical complex. As a result, there is a persistent need for practical, principled methods to infer the modular organization of genes from genome-scale data. RESULTS: We introduce a novel approach for the identification of modules based on the persistence of isolated gene groups within an evolving graph process. First, the underlying genomic data is summarized in the form of ranked gene-gene relationships, thereby accommodating studies that quantify the relevant biological relationship directly or indirectly. Then, the observed gene-gene relationship ranks are viewed as the outcome of a random graph process and candidate modules are given by the identifiable subgraphs that arise during this process. An isolation index is computed for each module, which quantifies the statistical significance of its survival time. CONCLUSIONS: The Miso (module isolation) method predicts gene modules from genomic data and the associated isolation index provides a module-specific measure of confidence. Improving on existing alternative, such as graph clustering and the global pruning of dendrograms, this index offers two intuitively appealing features: (1) the score is module-specific; and (2) different choices of threshold correlate logically with the resulting performance, i.e. a stringent cutoff yields high quality predictions, but low sensitivity. Through the analysis of yeast phenotype data, the Miso method is shown to outperform existing alternatives, in terms of the specificity and sensitivity of its predictions

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Integrating diverse biological and computational sources for reliable protein-protein interactions

Author: A Ben-Hur
A Ben-Hur
A Gavin
A Grigoriev
A Patil
A Stein
B Raghavachari
C Deane
C Stark
C von Mering
Chee-Keong Kwoh
D Goldberg
G Liu
GD Bader
GD Bader
GT Hart
H Huang
HN Chua
HN Chua
Hon Nian Chua
I Donaldson
I Ispolatov
J Chen
J Wang
JS Bader
K Tarassov
L Salwinski
Min Wu
N Krogan
N Zaki
P Pei
PY Chen
R Gentleman
R Jansen
R Kelley
R Saito
R Saito
RD Finn
See-Kiong Ng
SR Collins
T Joachims
VN Vapnik
Xiaoli Li
XL Li
XL Li
XL Li
XL Li
Publication venue: BioMed Central
Publication date
Field of study

Crossref

PubMed Central

Identifying efficient solutions via simulation: myopic multi-objective budget allocation for the bi-objective case

Author: B Shahriari
CH Chen
CH Chen
D He
E Zitzler
E Zitzler
G Feldman
I Ryzhov
J Bader
J Branke
J Butler
J Gittins
Juergen Branke
LH Lee
LH Lee
M Birattari
M DeGroot
M Fu
N Beume
P Frazier
P Frazier
R Pasupathy
RL Keeney
S Andradottir
S Chick
S Chick
S Chick
S Hunter
S Teng
SR Hunter
T Zhang
V Mattila
Wen Zhang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Simulation optimisation offers great opportunities in the design and optimisation of complex systems. In the presence of multiple objectives, there is usually no single solution that performs best on all objectives. Instead, there are several Pareto-optimal (efficient) solutions with different trade-offs which cannot be improved in any objective without sacrificing performance in another objective. For the case where alternatives are evaluated on multiple stochastic criteria, and the performance of an alternative can only be estimated via simulation, we consider the problem of efficiently identifying the Pareto-optimal designs out of a (small) given set of alternatives. We present a simple myopic budget allocation algorithm for multi-objective problems and propose several variants for different settings. In particular, this myopic method only allocates one simulation sample to one alternative in each iteration. This paper shows how the algorithm works in bi-objective problems under different settings. Empirical tests show that our algorithm can significantly reduce the necessary simulation budget

University of Essex Research Repository

Crossref

Warwick Research Archives Portal Repository

Explore Bristol Research

Maximal Extraction of Biological Information from Genetic Interaction Data

Author: AH Tong
AM Dudley
BL Drees
D Segre
David J. Galas
DJ Galas
DR Shook
Gregory W. Carter
GW Carter
H Sinha
HD Madhani
JM Gancedo
Joel S. Bader
KB Lengeler
KD Entian
L Avery
LM Steinmetz
LV Zhang
M Ashburner
M Li
M Schuldiner
O Carlborg
PD Grunwald
R Kelley
R Milo
RJ Taylor
RP Onge
S Jana
SR Collins
T Ideker
Timothy Galitski
W Zhong
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

Targeted genetic perturbation is a powerful tool for inferring gene function in model organisms. Functional relationships between genes can be inferred by observing the effects of multiple genetic perturbations in a single strain. The study of these relationships, generally referred to as genetic interactions, is a classic technique for ordering genes in pathways, thereby revealing genetic organization and gene-to-gene information flow. Genetic interaction screens are now being carried out in high-throughput experiments involving tens or hundreds of genes. These data sets have the potential to reveal genetic organization on a large scale, and require computational techniques that best reveal this organization. In this paper, we use a complexity metric based in information theory to determine the maximally informative network given a set of genetic interaction data. We find that networks with high complexity scores yield the most biological information in terms of (i) specific associations between genes and biological functions, and (ii) mapping modules of co-functional genes. This information-based approach is an automated, unsupervised classification of the biological rules underlying observed genetic interactions. It might have particular potential in genetic studies in which interactions are complex and prior gene annotation data are sparse

Crossref

The Jackson Laboratory: The Mouseion at the JAXlibrary

Directory of Open Access Journals

PubMed Central

An Integrative Multi-Network and Multi-Classifier Approach to Predict Genetic Interactions

Author: Aaron N. Chang
AH Tong
AH Tong
AP Gasch
AP Jarvinen
Bin Zhang
C Boone
C Rodrigues-Pousada
Chad L. Myers
CL Myers
D Lin
DS McNabb
EA Winzeler
Eric E. Schadt
G Pandey
G Weiss
Gary D. Bader
Gaurav Pandey
IH Witten
J Zhu
JS Edwards
Jun Zhu
K Tan
KC Chipman
L Fernandes
L Royer
M Costanzo
M Kanehisa
PT Spellman
PW Lord
R Kelley
RB Brem
RO Duda
S Mnaimneh
S Onami
SF Altschul
SL Wong
SR Collins
SR Paladugu
SV Date
T Mitchell
T Nevitt
TG Dietterich
TR Hughes
Vipin Kumar
W Zhong
Y Qi
Y Tao
Z Hu
Publication venue: Public Library of Science
Publication date: 09/09/2010
Field of study

Genetic interactions occur when a combination of mutations results in a surprising phenotype. These interactions capture functional redundancy, and thus are important for predicting function, dissecting protein complexes into functional pathways, and exploring the mechanistic underpinnings of common human diseases. Synthetic sickness and lethality are the most studied types of genetic interactions in yeast. However, even in yeast, only a small proportion of gene pairs have been tested for genetic interactions due to the large number of possible combinations of gene pairs. To expand the set of known synthetic lethal (SL) interactions, we have devised an integrative, multi-network approach for predicting these interactions that significantly improves upon the existing approaches. First, we defined a large number of features for characterizing the relationships between pairs of genes from various data sources. In particular, these features are independent of the known SL interactions, in contrast to some previous approaches. Using these features, we developed a non-parametric multi-classifier system for predicting SL interactions that enabled the simultaneous use of multiple classification procedures. Several comprehensive experiments demonstrated that the SL-independent features in conjunction with the advanced classification scheme led to an improved performance when compared to the current state of the art method. Using this approach, we derived the first yeast transcription factor genetic interaction network, part of which was well supported by literature. We also used this approach to predict SL interactions between all non-essential gene pairs in yeast (http://sage.fhcrc.org/downloads/downloads/predicted_yeast_genetic_interactions.zip). This integrative approach is expected to be more effective and robust in uncovering new genetic interactions from the tens of millions of unknown gene pairs in yeast and from the hundreds of millions of gene pairs in higher organisms like mouse and human, in which very few genetic interactions have been identified to date

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central