Search CORE

531 research outputs found

The Ontology Lookup Service: more data and better tools for controlled vocabulary queries

Author: Cote
H. Hermjakob
Hull
L. Martens
Orchard
P. Jones
R. Apweiler
R. G. Cote
Smith
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

The Ontology Lookup Service (OLS) (http://www.ebi.ac.uk/ols) provides interactive and programmatic interfaces to query, browse and navigate an ever increasing number of biomedical ontologies and controlled vocabularies. The volume of data available for querying has more than quadrupled since it went into production and OLS functionality has been integrated into several high-usage databases and data entry tools. Improvements have been made to both OLS query interfaces, based on user feedback and requirements, to improve usability and service interoperability and provide novel ways to perform queries

CiteSeerX

Crossref

Ghent University Academic Bibliography

PubMed Central

Recommended from our members

Cellular resolution models for even skipped regulation in the entire Drosophila embryo

Author: Apweiler Rolf
DePace Angela H
Fisher Jasmin
Ilsley Garth R
Luscombe Nicholas M
Publication venue: 'eLife Sciences Publications, Ltd'
Publication date: 01/03/2014
Field of study

Transcriptional control ensures genes are expressed in the right amounts at the correct times and locations. Understanding quantitatively how regulatory systems convert input signals to appropriate outputs remains a challenge. For the first time, we successfully model even skipped (eve) stripes 2 and 3+7 across the entire fly embryo at cellular resolution. A straightforward statistical relationship explains how transcription factor (TF) concentrations define eve’s complex spatial expression, without the need for pairwise interactions or cross-regulatory dynamics. Simulating thousands of TF combinations, we recover known regulators and suggest new candidates. Finally, we accurately predict the intricate effects of perturbations including TF mutations and misexpression. Our approach imposes minimal assumptions about regulatory function; instead we infer underlying mechanisms from models that best fit the data, like the lack of TF-specific thresholds and the positional value of homotypic interactions. Our study provides a general and quantitative method for elucidating the regulation of diverse biological systems. DOI: http://dx.doi.org/10.7554/eLife.00522.00

Harvard University - DASH

Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads

Author: Bentley
Iafrate
K. Ye
Kidd
Levy
M. H. Schulz
Ning
Q. Long
R. Apweiler
Schulz
Sebat
Wheeler
Z. Ning
Publication venue: Oxford University Press
Publication date: 26/06/2009
Field of study

Motivation: There is a strong demand in the genomic community to develop effective algorithms to reliably identify genomic variants. Indel detection using next-gen data is difficult and identification of long structural variations is extremely challenging

Crossref

PubMed Central

MPG.PuRe

Generation of asparagine-linked glycan structure databases and their use

Author: D. H. Dube
J. M. Ren
J. T. Prince
K. Ohtsubo
M. M. Fuster
N. H. Packer
R. Apweiler
R. Kornfeld
S. E. Stein
U. M. Demelbauer
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

gViz, a novel tool for the visualization of co-expression networks

Author: B Boeckmann
Bertrand De Meulder
CF Chen
D Jupiter
DJ Watts
E Bareke
Eric Bareke
Eric Depiereux
H Ogata
H Yu
J Rashbass
J Schafer
JC Guillaume
M Ashburner
M Girvan
M Suderman
Michael Pierre
N Salomonis
Naji Habra
P JU
P Shannon
PE Meyer
R Apweiler
R Castelo
Raphaël Helaers
Sophie Depiereux
T Hubbard
U Brandes
Z Wu
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The quantity of microarray data available on the Internet has grown dramatically over the past years and now represents millions of Euros worth of underused information. One way to use this data is through co-expression analysis. To avoid a certain amount of bias, such data must often be analyzed at the genome scale, for example by network representation. The identification of co-expression networks is an important means to unravel gene to gene interactions and the underlying functional relationship between them. However, it is very difficult to explore and analyze a network of such dimensions. Several programs (Cytoscape, yEd) have already been developed for network analysis; however, to our knowledge, there are no available GraphML compatible programs. Findings We designed and developed gViz, a GraphML network visualization and exploration tool. gViz is built on clustering coefficient-based algorithms and is a novel tool to visualize and manipulate networks of co-expression interactions among a selection of probesets (each representing a single gene or transcript), based on a set of microarray co-expression data stored as an adjacency matrix. Conclusions We present here gViz, a software tool designed to visualize and explore large GraphML networks, combining network theory, biological annotation data, microarray data analysis and advanced graphical features.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

DIAL UCLouvain

Repository of the University of Namur

Using optimized collision energies and high resolution, high accuracy fragment ion selection to improve glycopeptide detection by precursor ion scanning

Author: A. C. Rigby
A. Shevchenko
A.-S. Petersson
D. E. Kalume
G. W. Hart
H. Steen
H. Steen
H. Steen
H. Steen
J. Gobom
K. F. Medzihradszky
M. A. Ritchie
M. J. Huddleston
M. Ploug
M. Wilm
R. Apweiler
S. Carr
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

IntAct—open source resource for molecular interaction data

Author: Alam-Faruque Y.
Apweiler R.
Aranda B.
Bancarz I.
Bridge A.
Derow C.
Dimmer E.
Feuermann M.
Friedrichsen A.
Hermjakob H.
Huntley R.
Kerrien S.
Khadake J.
Kohler C.
Leroy C.
Liban A.
Lieftink C.
Montecchi-Palazzi L.
Orchard S.
Risse J.
Robbe K.
Roechert B.
Thorneycroft D.
Zhang Y.
Publication venue: Oxford University Press
Publication date: 01/12/2006
Field of study

IntAct is an open source database and software suite for modeling, storing and analyzing molecular interaction data. The data available in the database originates entirely from published literature and is manually annotated by expert biologists to a high level of detail, including experimental methods, conditions and interacting domains. The database features over 126 000 binary interactions extracted from over 2100 scientific publications and makes extensive use of controlled vocabularies. The web site provides tools allowing users to search, visualize and download data from the repository. IntAct supports and encourages local installations as well as direct data submission and curation collaborations. IntAct source code and data are freely available from

FLORA: a novel method to predict protein function from structure in diverse superfamilies

Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2–3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (α, β, αβ) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

UCL Discovery

PubMed Central

ProteomeXchange provides globally coordinated proteomics data submission and dissemination.

Author: Alex Campos
Andrew R Jones
Attila Csordas
Daniel Ríos
Eric W Deutsch
EW Deutsch
Florian Reisinger
Gerhard Mayer
Gilbert S Omenn
H Hahne
H Hermjakob
Hans-Joachim Kraus
Henning Hermjakob
I Matic
Ioannis Xenarios
José A Dianes
Juan A Vizcaíno
Juan Pablo Albar
JV Olsen
L Martens
Laurent Gatto
Lennart Martens
M Vaudel
Martin Eisenacher
Nuno Bandeira
Pierre-Alain Binz
R Craig
Robert J Chalkley
Rolf Apweiler
Rui Wang
Salvador Martinez-Bartolomé
T Farrah
Terry Farrah
Zhi Sun
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Serveur académique lausannois

Ghent University Academic Bibliography

PubMed Central

A Systematic Survey of Mini-Proteins in Bacteria and Archaea

Author: A de la Pena-Moctezuma
B Imperiali
DJ Lipman
DT Krieger
Fengyu Wang
Guoqiang Zhang
H Amiri
H Seligmann
J Ebedes
JC Hotopp
Jingfa Xiao
Josh Bongard
JP Kastenmayer
Jun Yu
KA Brayton
L Brocchieri
L Martin
Linlin Pan
Ming Yang
P Setlow
R Apweiler
RL Tatusov
S Gribaldo
S Kumar
SH Gellman
Shouguang Jin
SJ Wheelan
UH Ha
W Wu
Publication venue: Public Library of Science
Publication date
Field of study

BACKGROUND: Mini-proteins, defined as polypeptides containing no more than 100 amino acids, are ubiquitous in prokaryotes and eukaryotes. They play significant roles in various biological processes, and their regulatory functions gradually attract the attentions of scientists. However, the functions of the majority of mini-proteins are still largely unknown due to the constraints of experimental methods and bioinformatic analysis. METHODOLOGY/PRINCIPAL FINDINGS: In this article, we extracted a total of 180,879 mini-proteins from the annotations of 532 sequenced genomes, including 491 strains of Bacteria and 41 strains of Archaea. The average proportion of mini-proteins among all genomic proteins is approximately 10.99%, but different strains exhibit remarkable fluctuations. These mini-proteins display two notable characteristics. First, the majority are species-specific proteins with an average proportion of 58.79% among six representative phyla. Second, an even larger proportion (70.03% among all strains) is hypothetical proteins. However, a fraction of highly conserved hypothetical proteins potentially play crucial roles in organisms. Among mini-proteins with known functions, it seems that regulatory and metabolic proteins are more abundant than essential structural proteins. Furthermore, domains in mini-proteins seem to have greater distributions in Bacteria than Eukarya. Analysis of the evolutionary progression of these domains reveals that they have diverged to new patterns from a single ancestor. CONCLUSIONS/SIGNIFICANCE: Mini-proteins are ubiquitous in bacterial and archaeal species and play significant roles in various functions. The number of mini-proteins in each genome displays remarkable fluctuation, likely resulting from the differential selective pressures that reflect the respective life-styles of the organisms. The answers to many questions surrounding mini-proteins remain elusive and need to be resolved experimentally

Crossref

Directory of Open Access Journals

PubMed Central