Search CORE

Boston University Institutional Repository (OpenBU)

NeAT: a toolbox for the analysis of biological networks, clusters, classes and pathways

Author: Babu
Bebek
Brohee
Croes
Croes
Deville
Enright
Fukuda
G. Lima-Mendez
G. Vanderstocken
Gagneur
Han
J. van Helden
Janky
Jeong
Jeong
K. Faust
Krogan
Letovsky
Luscombe
Mewes
Milenkovic
O. Sand
Pereira-Leal
R. Janky
S. Brohee
Samuel Lattimore
Scott
Shannon
Sharan
Sprinzak
Uetz
von Mering
Y. Deville
Yook
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

COMBREX: a project to accelerate the functional annotation of prokaryotic genomes

Author: B. P. Anton
Brenner
C. Delisi
D. Segre
D. Vitkup
Fleischmann
G. Housman
Galperin
Green
H.-P. Choi
Heurgue-Hamard
Hsiao
J. Guleria
J. N. Rachlin
K. R. Tao
L. L. Faller
L. Osmani
M. G. McGettrick
M. Steffen
N. Klitgord
R. J. Roberts
R. M. Pokrzywa
R. Swaminathan
Roberts
S. Kasif
S. L. Salzberg
S. Letovsky
V. Mazumdar
Y.-C. Chang
Z. Hu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/10/2010
Field of study

DSpace@MIT

Boston University Institutional Repository (OpenBU)

Interaction site prediction by structural similarity to neighboring clusters in protein-protein interaction networks

Author: A Koike
A Porollo
A Sacan
A Thomas
A Vazquez
AJ Bordner
B Huang
B Schwikowski
DW Ritchie
GD Bader
H Hishigaki
Hiroyuki Monji
HX Zhou
HX Zhou
I Res
I Xenarios
J Dundas
JR Bradford
K Kinoshita
L Salwinski
M Deng
P Fariselli
P Uetz
RA Laskowski
S Jones
S Letovsky
S Peri
Satoshi Koizumi
T Ito
Takenao Ohkawa
Tomonobu Ozaki
Y Kaneta
Y Ofran
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Recently, revealing the function of proteins with protein-protein interaction (PPI) networks is regarded as one of important issues in bioinformatics. With the development of experimental methods such as the yeast two-hybrid method, the data of protein interaction have been increasing extremely. Many databases dealing with these data comprehensively have been constructed and applied to analyzing PPI networks. However, few research on prediction interaction sites using both PPI networks and the 3D protein structures complementarily has explored. Results We propose a method of predicting interaction sites in proteins with unknown function by using both of PPI networks and protein structures. For a protein with unknown function as a target, several clusters are extracted from the neighboring proteins based on their structural similarity. Then, interaction sites are predicted by extracting similar sites from the group of a protein cluster and the target protein. Moreover, the proposed method can improve the prediction accuracy by introducing repetitive prediction process. Conclusions The proposed method has been applied to small scale dataset, then the effectiveness of the method has been confirmed. The challenge will now be to apply the method to large-scale datasets.</p

Springer - Publisher Connector

Cape Town University OpenUCT

Scoring Protein Relationships in Functional Interaction Networks Predicted from Sequence Data

Author: A Vazquez
B Schwikowski
C von Mering
C von Mering
CE Shannon
Christophe Herman
CL Myers
D Devos
E Nabieva
G Subramanian
Gaston K. Mazandu
GRG Lanckriet
HN Chua
HN Chua
HN Chua
J Krawczyk
J Xiong
JCD Mackay
K Raman
K Tsuda
LJ Jensen
M Deng
M Deng
M Li
MA Mahdavi
Nicola J. Mulder
NJ Mulder
NJ Mulder
O Bastian
O Bastian
OG Troyanskaya
P Baldi
PG Aaron
RVL Hartley
S Hunter
S Letovsky
S Yellaboina
SF Altschul
SF Altschul
SF Altschul
TM Murali
WR Pearson
X Mao
Y Chen
Y-R Cho
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins

CiteSeerX

Public Library of Science (PLOS)

Bayesian Markov Random Field Analysis for Protein Function Prediction Based on Network Data

Author: A Kuzniar
A Vazquez
Aalt D. J. van Dijk
AJ Enright
C Moler
Cajo J. F. ter Braak
CJF Ter Braak
CJF Ter Braak
CM Federovitch
DJC MacKay
GD Bader
GR Lanckriet
H Lee
I Kosmidis
I Ulitsky
Iddo Friedberg
IM Cheeseman
J Besag
JA Hanley
L Milligan
L Peña Castillo
M Ashburner
M Deng
M Deng
M Punta
Marco C. A. M. Bink
N Nariai
NJ Mulder
P McCullagh
R Sharan
RI Kondor
Roeland C. H. J. van Ham
S Ferré
S Geman
S Letovsky
S Mostafavi
SF Altschul
SR Collins
SZ Li
T Gabaldon
U Karaoz
V Vethantham
XL Chen
Y Chen
Y Guan
Yiannis A. I. Kourmpetis
Z Barutcuoglu
Z Wei
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Inference of protein functions is one of the most important aims of modern biology. To fully exploit the large volumes of genomic data typically produced in modern-day genomic experiments, automated computational methods for protein function prediction are urgently needed. Established methods use sequence or structure similarity to infer functions but those types of data do not suffice to determine the biological context in which proteins act. Current high-throughput biological experiments produce large amounts of data on the interactions between proteins. Such data can be used to infer interaction networks and to predict the biological process that the protein is involved in. Here, we develop a probabilistic approach for protein function prediction using network data, such as protein-protein interaction measurements. We take a Bayesian approach to an existing Markov Random Field method by performing simultaneous estimation of the model parameters and prediction of protein functions. We use an adaptive Markov Chain Monte Carlo algorithm that leads to more accurate parameter estimates and consequently to improved prediction performance compared to the standard Markov Random Fields method. We tested our method using a high quality S.cereviciae validation network with 1622 proteins against 90 Gene Ontology terms of different levels of abstraction. Compared to three other protein function prediction methods, our approach shows very good prediction performance. Our method can be directly applied to protein-protein interaction or coexpression networks, but also can be extended to use multiple data sources. We apply our method to physical protein interaction data from S. cerevisiae and provide novel predictions, using 340 Gene Ontology terms, for 1170 unannotated proteins and we evaluate the predictions using the available literature

Wageningen University & Research Publications

Improving protein function prediction methods with integrated literature data

Author: A Karimpour-Fard
A Vazquez
A Vinayagam
Aaron P Gabow
AK Ramani
B Schwikowski
BTF Alako
C Brun
C von Mering
Debra S Goldberg
E Nabieva
HW Mewes
I Xenarios
J Rual
K Tsuda
L Hunter
L Hunter
L Tanabe
Lawrence E Hunter
M Ashburner
M Aubry
M Chagoyen
M Huynen
M Krallinger
M Krallinger
M Pelligri
M Yetisgen-Yildiz
OG Troyanskaya
P Srinivasan
PM Bowers
R Cilibrasi
R Hoffmann
S Letovsky
S Raychaudhuri
Sonia M Leach
T Schlitt
T Tanabe
TK Jenssen
U Karaoz
William A Baumgartner
Y Ofran
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Determining the function of uncharacterized proteins is a major challenge in the post-genomic era due to the problem's complexity and scale. Identifying a protein's function contributes to an understanding of its role in the involved pathways, its suitability as a drug target, and its potential for protein modifications. Several graph-theoretic approaches predict unidentified functions of proteins by using the functional annotations of better-characterized proteins in protein-protein interaction networks. We systematically consider the use of literature co-occurrence data, introduce a new method for quantifying the reliability of co-occurrence and test how performance differs across species. We also quantify changes in performance as the prediction algorithms annotate with increased specificity. Results We find that including information on the co-occurrence of proteins within an abstract greatly boosts performance in the Functional Flow graph-theoretic function prediction algorithm in yeast, fly and worm. This increase in performance is not simply due to the presence of additional edges since supplementing protein-protein interactions with co-occurrence data outperforms supplementing with a comparably-sized genetic interaction dataset. Through the combination of protein-protein interactions and co-occurrence data, the neighborhood around unknown proteins is quickly connected to well-characterized nodes which global prediction algorithms can exploit. Our method for quantifying co-occurrence reliability shows superior performance to the other methods, particularly at threshold values around 10% which yield the best trade off between coverage and accuracy. In contrast, the traditional way of asserting co-occurrence when at least one abstract mentions both proteins proves to be the worst method for generating co-occurrence data, introducing too many false positives. Annotating the functions with greater specificity is harder, but co-occurrence data still proves beneficial. Conclusion Co-occurrence data is a valuable supplemental source for graph-theoretic function prediction algorithms. A rapidly growing literature corpus ensures that co-occurrence data is a readily-available resource for nearly every studied organism, particularly those with small protein interaction databases. Though arguably biased toward known genes, co-occurrence data provides critical additional links to well-studied regions in the interaction network that graph-theoretic function prediction algorithms can exploit.</p

Springer - Publisher Connector

Public Library of Science (PLOS)

Protein Function Assignment through Mining Cross-Species Protein-Protein Interactions

Author: A Bateman
A Schlicker
A Schlicker
A Vazquez
A Zanzoni
AJ Enright
B Schwikowski
C Brun
C Stark
EM Marcotte
EM Marcotte
F Ramirez
GD Bader
H Hishigaki
H Holzl
H Lee
HW Jacobs
HW Mewes
J McDermott
J Wojcik
JB Pereira-Leal
JB Pereira-Leal
JZ Wang
K Tschop
KP O'Brien
L Salwinski
M Ashburner
M Deng
M Deng
M Pellegrini
MA Crosby
MC Costanzo
Mei Liu
MO Lee
MP Brown
MY Galperin
N Nariai
OG Troyanskaya
P Gallant
P Resnik
PJ Ellis
R Apweiler
R Kraut
Robert Ward
S Letovsky
S Li
S Peri
SF Altschul
Sudhindra Gadagkar
T Pawson
V Spirin
W Poller
WR Pearson
Xue-wen Chen
XW Chen
Y Chen
Publication venue: Public Library of Science
Publication date: 01/01/2007
Field of study

Background: As we move into the post genome-sequencing era, an immediate challenge is how to make best use of the large amount of high-throughput experimental data to assign functions to currently uncharacterized proteins. We here describe CSIDOP, a new method for protein function assignment based on shared interacting domain patterns extracted from cross-species protein-protein interaction data. Methodology/Principal Findings: The proposed method is assessed both biologically and statistically over the genome of H. sapiens. The CSIDOP method is capable of making protein function prediction with accuracy of 95.42 % using 2,972 gene ontology (GO) functional categories. In addition, we are able to assign novel functional annotations for 181 previously uncharacterized proteins in H. sapiens. Furthermore, we demonstrate that for proteins that are characterized by GO, the CSIDOP may predict extra functions. This is attractive as a protein normally executes a variety of functions in different processes and its current GO annotation may be incomplete. Conclusions/Significance: It can be shown through experimental results that the CSIDOP method is reliable and practical in use. The method will continue to improve as more high quality interaction data becomes available and is readily scalable t

CiteSeerX

KU ScholarWorks

UEL Research Repository at University of East London

Understanding the behaviour of hackers while performing attack tasks in a professional setting and in a public challenge

Author: A Strauss
A von Mayrhauser
B Barak
Bart Coppens
BG Glaser
Bjorn De Sutter
Cataldo Basile
DC Littman
I Sutherland
J Burkhardt
M Ceccato
M Ceccato
Marco Torchiano
Mariano Ceccato
N Pennington
Paolo Falcarin
Paolo Tonella
PC Oorschot van
S Letovsky
U Flick
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2018
Field of study

When critical assets or functionalities are included in a piece of software accessible to the end users, code protections are used to hinder or delay the extraction or manipulation of such critical assets. The process and strategy followed by hackers to understand and tamper with protected software might differ from program understanding for benign purposes. Knowledge of the actual hacker behaviours while performing real attack tasks can inform better ways to protect the software and can provide more realistic assumptions to the developers, evaluators, and users of software protections. Within Aspire, a software protection research project funded by the EU under framework programme FP7, we have conducted three industrial case studies with the involvement of professional penetration testers and a public challenge consisting of eight attack tasks with open participation. We have applied a systematic qualitative analysis methodology to the hackers’ reports relative to the industrial case studies and the public challenge. The qualitative analysis resulted in 459 and 265 annotations added respectively to the industrial and to the public challenge reports. Based on these annotations we built a taxonomy consisting of 169 concepts. They address the hacker activities related to (i) understanding code; (ii) defining the attack strategy; (iii) selecting and customizing the tools; and (iv) defeating the protections. While there are many commonalities between professional hackers and practitioners, we could spot many fundamental differences. For instance, while industrial professional hackers aim at elaborating automated and reproducible deterministic attacks, practitioners prefer to minimize the effort and try many different manual tasks. This analysis allowed us to distill a number of new research directions and potential improvements for protection techniques. In particular, considering the critical role of analysis tools, protection techniques should explicitly attack them, by exploiting analysis problems and complexity aspects that available automated techniques are bad at addressing