Search CORE

387 research outputs found

InterProScan: protein domains identifier

Author: Apweiler R.
Harte N.
Lopez R.
Mulder N.
Pillai S.
Quevillon E.
Silventoinen V.
Publication venue: Oxford University Press
Publication date: 27/06/2005
Field of study

InterProScan [E. M. Zdobnov and R. Apweiler (2001) Bioinformatics, 17, 847–848] is a tool that combines different protein signature recognition methods from the InterPro [N. J. Mulder, R. Apweiler, T. K. Attwood, A. Bairoch, A. Bateman, D. Binns, P. Bradley, P. Bork, P. Bucher, L. Cerutti et al. (2005) Nucleic Acids Res., 33, D201–D205] consortium member databases into one resource. At the time of writing there are 10 distinct publicly available databases in the application. Protein as well as DNA sequences can be analysed. A web-based version is accessible for academic and commercial organizations from the EBI (). In addition, a standalone Perl version and a SOAP Web Service [J. Snell, D. Tidwell and P. Kulchenko (2001) Programming Web Services with SOAP, 1st edn. O'Reilly Publishers, Sebastopol, CA, ] are also available to the users. Various output formats are supported and include text tables, XML documents, as well as various graphs to help interpret the results

Crossref

PubMed Central

A new bioinformatics analysis tools framework at EMBL–EBI

Author: Edgar
F. Valentin
H. McWilliam
Hull
J. Paern
Katoh
Lassmann
Lopez
M. Goujon
Notredame
Pearson
Quevillon
R. Lopez
S. Squizzato
W. Li
Publication venue: Oxford University Press
Publication date
Field of study

The EMBL-EBI provides access to various mainstream sequence analysis applications. These include sequence similarity search services such as BLAST, FASTA, InterProScan and multiple sequence alignment tools such as ClustalW, T-Coffee and MUSCLE. Through the sequence similarity search services, the users can search mainstream sequence databases such as EMBL-Bank and UniProt, and more than 2000 completed genomes and proteomes. We present here a new framework aimed at both novice as well as expert users that exposes novel methods of obtaining annotations and visualizing sequence analysis results through one uniform and consistent interface. These services are available over the web and via Web Services interfaces for users who require systematic access or want to interface with customized pipe-lines and workflows using common programming languages. The framework features novel result visualizations and integration of domain and functional predictions for protein database searches. It is available at http://www.ebi.ac.uk/Tools/sss for sequence similarity searches and at http://www.ebi.ac.uk/Tools/msa for multiple sequence alignments

Crossref

PubMed Central

Direct detection of Higgs-portal dark matter at the LHC

Author: A. Alves
A. Djouadi
A. Djouadi
A. Djouadi
A. Djouadi
A. Djouadi
A. Drozd
Abdelhak Djouadi
Adam Falkowski
C. Anastasiou
C. Englert
C. Englert
C.P. Burgess
D. Carmi
D. Choudhury
E. Aprile
E. Bagnaschi
E. Komatsu
G. Aad
H. Davoudiasl
H. Davoudiasl
H. Georgi
J. Alwall
J. Baglio
J. Kopp
J. McDonald
Jérémie Quevillon
M. Drees
M. Montull
M. Raidal
M. Spira
O. Eboli
O. Lebedev
R. Godbole
R. Harlander
R.K. Ellis
S. Alioli
S. Andreas
S. Catani
S. Chatrchyan
S. Dawson
S. Kanemura
T. Hambye
T. Sjostrand
V. Ravindran
V. Silveira
X.-G. He
Y. Mambrini
Yann Mambrini
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 14/05/2012
Field of study

We consider the process in which a Higgs particle is produced in association with jets and show that monojet searches at the LHC already provide interesting constraints on the invisible decays of a 125 GeV Higgs boson. Using the existing monojet searches performed by CMS and ATLAS, we show the 95% confidence level limit on the invisible Higgs decay rate is of the order of the total Higgs production rate in the Standard Model. This limit could be significantly improved when more data at higher center of mass energies are collected, provided systematic errors on the Standard Model contribution to the monojet background can be reduced. We also compare these direct constraints on the invisible rate with indirect ones based on measuring the Higgs rates in visible channels. In the context of Higgs portal models of dark matter, we then discuss how the LHC limits on the invisible Higgs branching fraction impose strong constraints on the dark matter scattering cross section on nucleons probed in direct detection experiments.Comment: 6 pages, 3 figures; v2: references added; v3: monojet and Higgs data updated, version published in EPJ

arXiv.org e-Print Archive

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

Springer - Publisher Connector

HAL Descartes

CERN Document Server

Hal-Diderot

Manual GO annotation of predictive protein signatures: the InterPro approach to GO curation

Author: A. Mitchell
A. Sangrador-Vegas
Ashburner
C. McAnulla
Camon
D. Lonsdale
Deegan
E. Kelly
Harms
Jensen
N. Mulder
P. Mutowo-Muellenet
Quevillon
S. Burge
S. Hunter
S.-Y. Yong
Vousden
Publication venue: Oxford University Press
Publication date
Field of study

InterPro amalgamates predictive protein signatures from a number of well-known partner databases into a single resource. To aid with interpretation of results, InterPro entries are manually annotated with terms from the Gene Ontology (GO). The InterPro2GO mappings are comprised of the cross-references between these two resources and are the largest source of GO annotation predictions for proteins. Here, we describe the protocol by which InterPro curators integrate GO terms into the InterPro database. We discuss the unique challenges involved in integrating specific GO terms with entries that may describe a diverse set of proteins, and we illustrate, with examples, how InterPro hierarchies reflect GO terms of increasing specificity. We describe a revised protocol for GO mapping that enables us to assign GO terms to domains based on the function of the individual domain, rather than the function of the families in which the domain is found. We also discuss how taxonomic constraints are dealt with and those cases where we are unable to add any appropriate GO terms. Expert manual annotation of InterPro entries with GO terms enables users to infer function, process or subcellular information for uncharacterized sequences based on sequence matches to predictive models

Crossref

PubMed Central

POGs/PlantRBP: a resource for comparative genomics in plants

Author: Alice Barkan
Altschul
Barkan
Bateman
Belostotsky
Brown
Chiu
Cokol
Edgar
Emanuelsson
Gough
Heazlewood
Horan
Lorkovic
Lurin
Needleman
Nicholas Stiffler
Nigel S. Walker
Quevillon
Rice
Small
Stajich
Sun
Vermel
Wang
Publication venue: Oxford University Press
Publication date: 01/01/2006
Field of study

POGs/PlantRBP () is a relational database that integrates data from rice, Arabidopsis, and maize by placing the complete Arabidopsis and rice proteomes and available maize sequences into ‘putative orthologous groups’ (POGs). Annotation efforts will focus on predicted RNA binding proteins (RBPs): i.e. those with known RNA binding domains or otherwise implicated in RNA function. POGs form the heart of the database, and were assigned using a mutual-best-hit-strategy after performing BLAST comparisons of the predicted Arabidopsis and rice proteomes. Each POG entry includes orthologs in Arabidopsis and rice, annotated with domain organization, gene models, phylogenetic trees, and multiple intracellular targeting predictions. A graphical display maps maize sequences on to their most similar rice gene model. The database can be queried using any combination of gene name, accession, domain, and predicted intracellular location, or using BLAST. Useful features of the database include the ability to search for proteins with both a specified domain content and intracellular location, the concurrent display of mutual best hits and phylogenetic trees which facilitates evaluation of POG assignments, the association of maize sequences with POGs, and the display of targeting predictions and domain organization for all POG members, which reveals consistency, or lack thereof, of those predictions

CiteSeerX

Crossref

PubMed Central

d-Omix: a mixer of generic protein domain analysis tools

Author: Alako
Apic
Bashton
Bashton
Bj rklund
Bork
D. Wichadakul
Enright
Geer
Han
Hegyi
Hiraguri
Marcotte
Mott
Murzin
Novatchkova
Orengo
Quevillon
S. Ingsriswang
S. Numnark
Thompson
Vogel
Watts
Wuchty
Ye
Publication venue: Oxford University Press
Publication date
Field of study

Domain combination provides important clues to the roles of protein domains in protein function, interaction and evolution. We have developed a web server d-Omix (a Mixer of Protein Domain Analysis Tools) aiming as a unified platform to analyze, compare and visualize protein data sets in various aspects of protein domain combinations. With InterProScan files for protein sets of interest provided by users, the server incorporates four services for domain analyses. First, it constructs protein phylogenetic tree based on a distance matrix calculated from protein domain architectures (DAs), allowing the comparison with a sequence-based tree. Second, it calculates and visualizes the versatility, abundance and co-presence of protein domains via a domain graph. Third, it compares the similarity of proteins based on DA alignment. Fourth, it builds a putative protein network derived from domain–domain interactions from DOMINE. Users may select a variety of input data files and flexibly choose domain search tools (e.g. hmmpfam, superfamily) for a specific analysis. Results from the d-Omix could be interactively explored and exported into various formats such as SVG, JPG, BMP and CSV. Users with only protein sequences could prepare an InterProScan file using a service provided by the server as well. The d-Omix web server is freely available at http://www.biotec.or.th/isl/Domix

Crossref

PubMed Central

FungiDB: an integrated functional genomics database for fungi

Author: Altschul
B. P. Brunk
Bendtsen
Boguski
C. J. Stoeckert
Cho
Cuomo
D. F. Pinney
D. S. Roos
Dean
Fedorova
Galagan
Galagan
J. Brestelli
J. C. Kissinger
J. E. Stajich
Kim
Li
Ma
Ma
Mewes
Nierman
O. S. Harb
Quevillon
S. Fischer
Spellman
Stein
T. Harris
V. Nayak
W. Li
Publication venue: Oxford University Press
Publication date: 01/01/2011
Field of study

FungiDB (http://FungiDB.org) is a functional genomic resource for pan-fungal genomes that was developed in partnership with the Eukaryotic Pathogen Bioinformatic resource center (http://EuPathDB.org). FungiDB uses the same infrastructure and user interface as EuPathDB, which allows for sophisticated and integrated searches to be performed using an intuitive graphical system. The current release of FungiDB contains genome sequence and annotation from 18 species spanning several fungal classes, including the Ascomycota classes, Eurotiomycetes, Sordariomycetes, Saccharomycetes and the Basidiomycota orders, Pucciniomycetes and Tremellomycetes, and the basal ‘Zygomycete’ lineage Mucormycotina. Additionally, FungiDB contains cell cycle microarray data, hyphal growth RNA-sequence data and yeast two hybrid interaction data. The underlying genomic sequence and annotation combined with functional data, additional data from the FungiDB standard analysis pipeline and the ability to leverage orthology provides a powerful resource for in silico experimentation

CiteSeerX

Crossref

PubMed Central

eScholarship - University of California

Recommended from our members

Purification and functional characterisation of rhiminopeptidase A, a novel aminopeptidase from the venom of Bitis gabonica rhinoceros

This study describes the discovery and characterisation of a novel aminopeptidase A from the venom of B. g. rhinoceros and highlights its potential biological importance. Similar to mammalian aminopeptidases, rhiminopeptidase A might be capable of playing roles in altering the blood pressure and brain function of victims. Furthermore, it could have additional effects on the biological functions of other host proteins by cleaving their N-terminal amino acids. This study points towards the importance of complete analysis of individual components of snake venom in order to develop effective therapies for snake bites

Central Archive at the University of Reading

Public Library of Science (PLOS)

LSTM Online Archive

Crossref

Directory of Open Access Journals

PubMed Central

Exploiting Amino Acid Composition for Predicting Protein-Protein Interactions

Author: A Ben-Hur
A Jaimovich
AK McCallum
AP Gasch
B Raghavachari
BA Shoemaker
BA Shoemaker
C Stark
Dafydd Jones
Diego Martinez
E Quevillon
E Quevillon
E Sprinzak
H Huang
H Yu
Harriett Platero
I Lee
IH Witten
J Qiu
J Sun
JH Fong
K Nigam
M Ashburner
M Mahdavi
M Werner-Washburne
Margaret Werner-Washburne
MG Kann
N Simonis
NJ Krogan
P Langley
P Uetz
R Jansen
R Jothi
R Li
S Li
S Martin
SM Gomez
Sushmita Roy
T Ito
T Joachims
TA Lasko
Terran Lane
Y Liu
Y Ofran
Publication venue: Public Library of Science
Publication date: 01/11/2009
Field of study

Computational prediction of protein interactions typically use protein domains as classifier features because they capture conserved information of interaction surfaces. However, approaches relying on domains as features cannot be applied to proteins without any domain information. In this paper, we explore the contribution of pure amino acid composition (AAC) for protein interaction prediction. This simple feature, which is based on normalized counts of single or pairs of amino acids, is applicable to proteins from any sequenced organism and can be used to compensate for the lack of domain information.AAC performed at par with protein interaction prediction based on domains on three yeast protein interaction datasets. Similar behavior was obtained using different classifiers, indicating that our results are a function of features and not of classifiers. In addition to yeast datasets, AAC performed comparably on worm and fly datasets. Prediction of interactions for the entire yeast proteome identified a large number of novel interactions, the majority of which co-localized or participated in the same processes. Our high confidence interaction network included both well-studied and uncharacterized proteins. Proteins with known function were involved in actin assembly and cell budding. Uncharacterized proteins interacted with proteins involved in reproduction and cell budding, thus providing putative biological roles for the uncharacterized proteins.AAC is a simple, yet powerful feature for predicting protein interactions, and can be used alone or in conjunction with protein domains to predict new and validate existing interactions. More importantly, AAC alone performs at par with existing, but more complex, features indicating the presence of sequence-level information that is predictive of interaction, but which is not necessarily restricted to domains

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central