Search CORE

28 research outputs found

Evaluation of linguistic features useful in extraction of interactions from PubMed; Application to annotating known, high-throughput and predicted interactions in I2D

Author: Bader
Barrios-Rodiles
BioCreAtIve
BioCreAtIvE
Brown
Brown
Brown
Bunescu
Bunescu
Collins
David Otasek
Donaldson
Erkan
Fundel
Fundel
Gavin
Giot
Haddow
Hakenberg
Hao
Ho
Hoffmann
Huang
Igor Jurisica
Ingham
Ito
Jang
Joachims
Jones
Kerrien
Krallinger
Krallinger
Leitner
Li
Lin
LLL
Mering
Mewes
Mitsumori
Nielsen
Niu
Otasek
Peri
Plake
Ponzielli
Ramani
Romano
Rual
Stelzl
Temkin
Tsuruoka
Xenarios
Yakushiji
Yun Niu
Zanzoni
Zhou
Zhou
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Identification and characterization of protein–protein interactions (PPIs) is one of the key aims in biological research. While previous research in text mining has made substantial progress in automatic PPI detection from literature, the need to improve the precision and recall of the process remains. More accurate PPI detection will also improve the ability to extract experimental data related to PPIs and provide multiple evidence for each interaction

Crossref

PubMed Central

Scalable, Integrative Analysis and Visualization of Protein Interactions

Author: Chiara Pastrello
David Otasek
Igor Jurisica
Publication venue: 'IntechOpen'
Publication date: 30/03/2012
Field of study

IntechOpen

STRING v10: protein-protein interaction networks, integrated over the tree of life

Author: Bork P.
Forslund K.
Franceschini A.
Heller D.
Huerta-Cepas J.
Jensen L.J.
Kuhn M.
Roth A.
Santos A.
Simonovic M.
Szklarczyk D.
Tsafou K.P.
von Mering C.
Wyder S.
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

The many functional partnerships and interactions that occur between proteins are at the core of cellular processing and their systematic characterization helps to provide context in molecular systems biology. However, known and predicted interactions are scattered over multiple resources, and the available data exhibit notable differences in terms of quality and completeness. The STRING database (http://string-db.org) aims to provide a critical assessment and integration of protein-protein interactions, including direct (physical) as well as indirect (functional) associations. The new version 10.0 of STRING covers more than 2000 organisms, which has necessitated novel, scalable algorithms for transferring interaction information between organisms. For this purpose, we have introduced hierarchical and self-consistent orthology annotations for all interacting proteins, grouping the proteins into families at various levels of phylogenetic resolution. Further improvements in version 10.0 include a completely redesigned prediction pipeline for inferring protein-protein associations from co-expression data, an API interface for the R computing environment and improved statistical analysis for enrichment tests in user-provided networks

MDC Repository

Integrative computational biology for cancer research

Author: Fortney Kristen
Jurisica Igor
Publication venue: Springer-Verlag
Publication date: 01/01/2011
Field of study

Over the past two decades, high-throughput (HTP) technologies such as microarrays and mass spectrometry have fundamentally changed clinical cancer research. They have revealed novel molecular markers of cancer subtypes, metastasis, and drug sensitivity and resistance. Some have been translated into the clinic as tools for early disease diagnosis, prognosis, and individualized treatment and response monitoring. Despite these successes, many challenges remain: HTP platforms are often noisy and suffer from false positives and false negatives; optimal analysis and successful validation require complex workflows; and great volumes of data are accumulating at a rapid pace. Here we discuss these challenges, and show how integrative computational biology can help diminish them by creating new software tools, analytical methods, and data standards

Springer - Publisher Connector

PubMed Central

Comparative interactomics with Funcoup 2.0

Author: A. Alexeyenko
A. Tjarnberg
Ashburner
Costanzo
D. Guala
E. L. L. Sonnhammer
Kuchaiev
O. Frings
Prieto
Skjolberg
T. Schmitt
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

FunCoup (http://FunCoup.sbc.su.se) is a database that maintains and visualizes global gene/protein networks of functional coupling that have been constructed by Bayesian integration of diverse high-throughput data. FunCoup achieves high coverage by orthology-based integration of data sources from different model organisms and from different platforms. We here present release 2.0 in which the data sources have been updated and the methodology has been refined. It contains a new data type Genetic Interaction, and three new species: chicken, dog and zebra fish. As FunCoup extensively transfers functional coupling information between species, the new input datasets have considerably improved both coverage and quality of the networks. The number of high-confidence network links has increased dramatically. For instance, the human network has more than eight times as many links above confidence 0.5 as the previous release. FunCoup provides facilities for analysing the conservation of subnetworks in multiple species. We here explain how to do comparative interactomics on the FunCoup website

Crossref

Publikationer från Stockholms universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

The IntAct molecular interaction database in 2012

Author: A. Bridge
A. Raghunath
Aranda
B. Aranda
B. Roechert
C. Chen
C. Jandrasits
Demir
E. Pfeiffenberger
F. Broackes-Carter
H. Hermjakob
I. Pedruzzi
J. Khadake
Kerrien
L. Breuza
Lynn
M. Duesbury
M. Dumousseau
M. Feuermann
Orchard
P. Masson
P. Porras
R. C. Jimenez
S. Kerrien
S. Orchard
U. Hinz
U. Mahadevan
Publication venue: Oxford University Press
Publication date
Field of study

IntAct is an open-source, open data molecular interaction database populated by data either curated from the literature or from direct data depositions. Two levels of curation are now available within the database, with both IMEx-level annotation and less detailed MIMIx-compatible entries currently supported. As from September 2011, IntAct contains approximately 275 000 curated binary interaction evidences from over 5000 publications. The IntAct website has been improved to enhance the search process and in particular the graphical display of the results. New data download formats are also available, which will facilitate the inclusion of IntAct's data in the Semantic Web. IntAct is an active contributor to the IMEx consortium (http://www.imexconsortium.org). IntAct source code and data are freely available at http://www.ebi.ac.uk/intact

Crossref

PubMed Central

Classifying protein-protein interaction articles using word and syntactic features

Author: A Ceol
B Aranda
B Settles
C Blaschke
D Rebholz-Schuhmann
E Buyko
GD Bader
GEAPA Batista
H Jang
HJ Lowe
J Björne
JR Curran
K Sugiyama
L Salwinski
L Tanabe
LH Smith
M Huang
M Krallinger
M Krallinger
M Kubat
MF Porter
P Baldi
RK Ando
S Kim
S Kim
S Nash
Sun Kim
T Mitsumort
T Zhang
VN Vapnik
W John Wilbur
Y Miyao
Y Niu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Automatic extraction of biomolecular interactions: an empirical approach

Author: Berleant Daniel
Ding Jing
Wurtele Eve S.
Zhang Lifeng
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2013
Field of study

Background We describe a method for extracting data about how biomolecule pairs interact from texts. This method relies on empirically determined characteristics of sentences. The characteristics are efficient to compute, making this approach to extraction of biomolecular interactions scalable. The results of such interaction mining can support interaction network annotation, question answering, database construction, and other applications. Results We constructed a software system to search MEDLINE for sentences likely to describe interactions between given biomolecules. The system extracts a list of the interaction-indicating terms appearing in those sentences, then ranks those terms based on their likelihood of correctly characterizing how the biomolecules interact. The ranking process uses a tf-idf (term frequency-inverse document frequency) based technique using empirically derived knowledge about sentences, and was applied to the MEDLINE literature collection. Software was developed as part of the MetNet toolkit (http://www.metnetdb.org). Conclusions Specific, efficiently computable characteristics of sentences about biomolecular interactions were analyzed to better understand how to use these characteristics to extract how biomolecules interact. The text empirics method that was investigated, though arising from a classical tradition, has yet to be fully explored for the task of extracting biomolecular interactions from the literature. The conclusions we reach about the sentence characteristics investigated in this work, as well as the technique itself, could be used by other systems to provide evidence about putative interactions, thus supporting efforts to maximize the ability of hybrid systems to support such tasks as annotating and constructing interaction networks

Digital Repository @ Iowa State University (ISU)

Crossref

Springer - Publisher Connector

PubMed Central

What the papers say: Text mining for genomics and systems biology

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

The STRING database in 2011: functional interaction networks of proteins, globally integrated and scored

Author: Bork Peer
Doerks Tobias
Franceschini Andrea
Jensen Lars J.
Kuhn Michael
Minguez Pablo
Muller Jean
Roth Alexander
Simonovic Milan
Stark Manuel
Szklarczyk Damian
von Mering Christian
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

An essential prerequisite for any systems-level understanding of cellular functions is to correctly uncover and annotate all functional interactions among proteins in the cell. Toward this goal, remarkable progress has been made in recent years, both in terms of experimental measurements and computational prediction techniques. However, public efforts to collect and present protein interaction information have struggled to keep up with the pace of interaction discovery, partly because protein–protein interaction information can be error-prone and require considerable effort to annotate. Here, we present an update on the online database resource Search Tool for the Retrieval of Interacting Genes (STRING); it provides uniquely comprehensive coverage and ease of access to both experimental as well as predicted interaction information. Interactions in STRING are provided with a confidence score, and accessory information such as protein domains and 3D structures is made available, all within a stable and consistent identifier space. New features in STRING include an interactive network viewer that can cluster networks on demand, updated on-screen previews of structural information including homology models, extensive data updates and strongly improved connectivity and integration with third-party resources. Version 9.0 of STRING covers more than 1100 completely sequenced organisms; the resource can be reached at http://string-db.org

CiteSeerX

PubMed Central

Copenhagen University Research Information System

ZORA

MDC Repository