Search CORE

36 research outputs found

All-paths graph kernel for protein-protein interaction extraction with evaluation of cross-corpus learning

Author: Airola A
Bjorne J
Ginter F
Pahikkala T
Pyysalo S
Salakoski T
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2022
Field of study

Background Automated extraction of protein-protein interactions (PPI) is an important and widely studied task in biomedical text mining. We propose a graph kernel based approach for this task. In contrast to earlier approaches to PPI extraction, the introduced all-paths graph kernel has the capability to make use of full, general dependency graphs representing the sentence structure. Results We evaluate the proposed method on five publicly available PPI corpora, providing the most comprehensive evaluation done for a machine learning based PPI-extraction system. We additionally perform a detailed evaluation of the effects of training and testing on different resources, providing insight into the challenges involved in applying a system beyond the data it was trained on. Our method is shown to achieve state-of-the-art performance with respect to comparable evaluations, with 56.4 F-score and 84.8 AUC on the AImed corpus. Conclusion We show that the graph kernel approach performs on state-of-the-art level in PPI extraction, and note the possible extension to the task of extracting complex interactions. Cross-corpus results provide further insight into how the learning generalizes beyond individual corpora. Further, we identify several pitfalls that can make evaluations of PPI-extraction systems incomparable, or even invalid. These include incorrect cross-validation strategies and problems related to comparing F-score results achieved on different evaluation resources. Recommendations for avoiding these pitfalls are provided. </div

UTUPub

Comparative analysis of five protein-protein interaction corpora

Author: Airola A
Bjorne J
Ginter F
Heimonen J
Pyysalo S
Salakoski T
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 28/10/2022
Field of study

Conclusions: Our comparative analysis uncovers key similarities and differences between the diverse PPI corpora, thus taking an important step towards standardization. In the course of this study we have created a major practical contribution in converting the corpora into a shared format. The conversion software is freely available at http://mars.cs.utu.fi/PPICorpora.</p

UTUPub

PESCADOR, a web-based tool to assist text-mining of biointeractions extracted from PubMed queries

Author: A Bairoch
A Barbosa-Silva
A Herbst
A Kashyap
A Renner
Adriano Barbosa-Silva
AJ Perez
AR Aronson
C Blaschke
C Perez-Iratxeta
C Plake
CE Moussa
CJ Gottardi
D Maglott
DA Benson
Elisa R Donnard
EM Marcotte
EW Sayers
Fernanda Stussi
FT Kolligs
H Shatkay
H Xie
I Iliopoulos
IF Tsigelny
J Bjorne
J Hur
J Miguel Ortega
Jean-Fred Fontaine
JF Fontaine
JF Fontaine
JM Olson
L Guo
M Chagoyen
M Miwa
MG Spillantini
Miguel A Andrade-Navarro
R Bunescu
R Hoffmann
R Leaman
S Dihlmann
S Matos
S Mika
SD Hooper
SK Halder
Z Lu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

BACKGROUND: Biological function is greatly dependent on the interactions of proteins with other proteins and genes. Abstracts from the biomedical literature stored in the NCBI's PubMed database can be used for the derivation of interactions between genes and proteins by identifying the co-occurrences of their terms. Often, the amount of interactions obtained through such an approach is large and may mix processes occurring in different contexts. Current tools do not allow studying these data with a focus on concepts of relevance to a user, for example, interactions related to a disease or to a biological mechanism such as protein aggregation. RESULTS: To help the concept-oriented exploration of such data we developed PESCADOR, a web tool that extracts a network of interactions from a set of PubMed abstracts given by a user, and allows filtering the interaction network according to user-defined concepts. We illustrate its use in exploring protein aggregation in neurodegenerative disease and in the expansion of pathways associated to colon cancer. CONCLUSIONS: PESCADOR is a platform independent web resource available at: http://cbdm.mdc-berlin.de/tools/pescador

Crossref

Springer - Publisher Connector

PubMed Central

MDC Repository

Open Repository and Bibliography - Luxembourg

Hearing difficulties, ear-related diagnoses and sickness absence or disability pension - a systematic literature review

Author: A Bjorne
A Liberati
AK Skooien
B Bjerlemo
C Ide
CD Mathers
CW Ide
D Hasson
Emilie Friberg
G Andersson
GA Gates
HK Neuhauser
K Alexanderson
K Alexanderson
K Alexanderson
K Gustafsson
Klas Gustafsson
KM Holgers
Kristina Alexanderson
M Hagberg
M Marmot
M Sorri
MA Joore
MI Wallhagen
N Chau
PI Carlsson
R Rudin
RK Sewell
S Kochkin
SE Kramer
U Rosenhall
Y Agrawal
Z Starzynski
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Integrated Bio-Entity Network: A System for Biological Knowledge Discovery

Author: A Ceol
A Chatr-aryamontri
A Coulet
A Grote
A Koike
A Mottaz
A Rzhetsky
A Yuryev
B Aranda
C Alfarano
C Blaschke
C Friedman
C Stark
CB Giles
CF Schaefer
D Barrell
D Hristovski
D Maglott
D Maglott
D Tikk
DR Swanson
EW Dijkstra
F Leitner
G Gonzalez
GR Mishra
H Liu
I Iossifov
I Vastrik
J Bjorne
JD Wren
Jinfeng Zhang
JO Korbel
Jun S. Liu
K Du
K Han
KD Pruitt
L Gong
L Salwinski
Lindsey Bell
LJ Jensen
LS Wong
M Ashburner
M Castagna
M Devignes
M Huang
M Kanehisa
M Krallinger
M Krallinger
M Kuhn
M Kuhn
M Yetisgen-Yildiz
MG Kann
N Daraselia
N Sierro
OL Griffith
P Pagel
P Shahi
P Srinivasan
QC Bui
QC Bui
R Apweiler
R Chowdhary
R Crnich
R Frijters
R Hoffmann
R Hoffmann
R Saetre
Rajesh Chowdhary
S Gama-Castro
S Mathivanan
S Naidu
S Yilmaz
T Beuming
TH Cormen
TS Keshava Prasad
V Matys
Xufeng Niu
Y Li
Y Wang
Ying Xu
Z Gao
Z Huang
Publication venue: Public Library of Science
Publication date: 27/06/2011
Field of study

A significant part of our biological knowledge is centered on relationships between biological entities (bio-entities) such as proteins, genes, small molecules, pathways, gene ontology (GO) terms and diseases. Accumulated at an increasing speed, the information on bio-entity relationships is archived in different forms at scattered places. Most of such information is buried in scientific literature as unstructured text. Organizing heterogeneous information in a structured form not only facilitates study of biological systems using integrative approaches, but also allows discovery of new knowledge in an automatic and systematic way. In this study, we performed a large scale integration of bio-entity relationship information from both databases containing manually annotated, structured information and automatic information extraction of unstructured text in scientific literature. The relationship information we integrated in this study includes protein–protein interactions, protein/gene regulations, protein–small molecule interactions, protein–GO relationships, protein–pathway relationships, and pathway–disease relationships. The relationship information is organized in a graph data structure, named integrated bio-entity network (IBN), where the vertices are the bio-entities and edges represent their relationships. Under this framework, graph theoretic algorithms can be designed to perform various knowledge discovery tasks. We designed breadth-first search with pruning (BFSP) and most probable path (MPP) algorithms to automatically generate hypotheses—the indirect relationships with high probabilities in the network. We show that IBN can be used to generate plausible hypotheses, which not only help to better understand the complex interactions in biological systems, but also provide guidance for experimental designs

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Author: Alborzi S. Z.
Altenhoff A.
Amezola M.
Antczak M.
Aridhi S.
Asgari E.
Atalay V.
Babbitt P. C.
Barot M.
Ben-Hur A.
Benso A.
Bergquist T. R.
Berselli M.
Bhat P.
Bjorne J.
Black G. S.
Boecker F.
Bonneau R.
Borukhov I.
Bosco G.
Boudellioua I.
Brackenridge D. A.
Brenner S. E.
Cao R.
Carraro M.
Casadio R.
Cetin Atalay R.
Chandler C.
Chang J. -M.
Cheng J.
Chi P. -H.
Cozzetto D.
Crocker A. W.
Dai S.
Dalklran A.
Das S.
Davidovic R. S.
Davis L.
Dayton J. B.
Dessimoz C.
Devignes M. -D.
Di Carlo S.
Dogan T.
Dzeroski S.
Fa R.
Fabris F.
Falda M.
Fang H.
Fernandez J. M.
Fontana P.
Frank Y.
Frasca M.
Freddolino P. L.
Freitas A. A.
Friedberg I.
Gemovic B.
Georghiou G.
Ginter F.
Gligorijevic V.
Goldberg T.
Gough J.
Greene C. S.
Grossi G.
Hakala K.
Hamid M. N.
Hoehndorf R.
Hogan D. A.
Holm L.
Hou J.
Hurto R. L.
Jain A.
Jeffery C. J.
Jiang Y.
Jo D.
Johnson D.
Jones D. T.
Kacsoh B. Z.
Kaewphan S.
Kahanda I.
Kihara D.
Koo D. C. E.
Kulmanov M.
Larsen D. J.
Lavezzo E.
Lee A. J.
Lees J. G.
Lewis K. A.
Liao W. -H.
Lichtarge O.
Linial M.
Liu Y. -W.
Mao Q.
Martelli P. L.
Martin M. J.
McGuffin L. J.
McHardy A. C.
Medlar A. J.
Mehryary F.
Mesiti M.
Moen H.
Mofrad M. R. K.
Mooney S. D.
Nguyen H. N.
Notaro M.
Novikov I.
O'Donovan C.
Omdahl A. R.
Orengo C. A.
Paccanaro A.
Pascarelli S.
Perovic V. R.
Petrini A.
Piovesan D.
Politano G.
Profiti G.
Radivojac P.
Re M.
Reeb J.
Renaux A.
Rifaioglu A. S.
Ritchie D. W.
Roche D. B.
Rodriguez J. M.
Romero A. E.
Rose P. W.
Rost B.
Saidi R.
Salakoski T.
Savojardo C.
Schoof H.
Sillitoe I.
Smuc T.
Suh E.
Sumonja N.
Supek F.
Thurlby N.
Tian W.
Tolvanen M. E. E.
Toppo S.
Toronen P.
Torres M.
Tosatto S. C. E.
Tress M. L.
Tseng W. -C.
Ur Rehman H.
Valentini G.
Veljkovic N.
Vidulin V.
Vucetic S.
Wan C.
Wang Z.
Warwick Vesztrocy A.
Wass M. N.
Wilkins A.
Yang H.
Yao S.
You R.
Yunes J. M.
Zhang C.
Zhang F.
Zhang S.
Zhang Y.
Zhang Z.
Zhao C.
Zhou N.
Zhu S.
Zosa E.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Background: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Results: Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole genome mutation screening in Candida albicans and aeruginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster, which we suspected of being involved in long-term memory. Conclusion: We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging; although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster, it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Nitric Oxide Antagonizes the Acid Tolerance Response that Protects Salmonella against Innate Gastric Defenses

Author: A Fite
A Llama-Palacios
AJ Baumler
Alejandro Aballay
Andrés Vázquez-Torres
AW van der Velden
B D'Autreaux
B Spiegelhalder
BD McCollister
BD McCollister
BL Bearson
C Duncan
CD Ellermeier
CJ Dorman
CR Hung
CS Butler
D Zhou
D Zhou
EA Groisman
FC Soncini
FC Soncini
GC Cook
GJ Leyer
H Bjorne
H Hori
H Reynaert
Harry Ischiropoulos
HJ Vogel
HK Hall
I Stojiljkovic
I Zwir
IS Bang
IS Lee
J Deiwick
J Xu
JB Dressman
JH Miller
JJ Bijlsma
JW Foster
JW Foster
JW Foster
JW Foster
K Eichelberg
KA Datsenko
KE Klose
L Puzniak
LR Prost
M Husain
Michael McClelland
MJ Worley
N Benjamin
OL Wijburg
P Adams
P Holt
P Monsieurs
P Small
PC Oyston
PI Fields
PJ Buchin
PP Cherepanov
R Cunningham
RA Giannella
RB Canani
RF Wang
RS Dykhuizen
RT Bacon
Rui Zhao
RW Stockbruegger
S Porwollik
S Uzzau
SI Miller
SR Tannenbaum
Steffen Porwollik
T Aiso
Todd Greco
Travis J. Bourret
VG Tusher
YA Golubeva
YH Lee
Publication venue: Public Library of Science
Publication date: 01/03/2008
Field of study

Reactive nitrogen species (RNS) derived from dietary and salivary inorganic nitrogen oxides foment innate host defenses associated with the acidity of the stomach. The mechanisms by which these reactive species exert antimicrobial activity in the gastric lumen are, however, poorly understood.The genetically tractable acid tolerance response (ATR) that enables enteropathogens to survive harsh acidity was screened for signaling pathways responsive to RNS. The nitric oxide (NO) donor spermine NONOate derepressed the Fur regulon that controls secondary lines of resistance against organic acids. Despite inducing a Fur-mediated adaptive response, acidified RNS largely repressed oral virulence as demonstrated by the fact that Salmonella bacteria exposed to NO donors during mildly acidic conditions were shed in low amounts in feces and exhibited ameliorated oral virulence. NO prevented Salmonella from mounting a de novo ATR, but was unable to suppress an already functional protective response, suggesting that RNS target regulatory cascades but not their effectors. Transcriptional and translational analyses revealed that the PhoPQ signaling cascade is a critical ATR target of NO in rapidly growing Salmonella. Inhibition of PhoPQ signaling appears to contribute to most of the NO-mediated abrogation of the ATR in log phase bacteria, because the augmented acid sensitivity of phoQ-deficient Salmonella was not further enhanced after RNS treatment.Since PhoPQ-regulated acid resistance is widespread in enteric pathogens, the RNS-mediated inhibition of the Salmonella ATR described herein may represent a common component of innate host defenses

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California