Search CORE

17 research outputs found

New in protein structure and function annotation: Hotspots, single nucleotide polymorphisms and the 'Deep Web'

Author: Bromberg Yana
Ofran Yanay
Rost Burkhard
Schneider Reinhard
Yachdav Guy
Publication venue
Publication date: 01/01/2009
Field of study

The rapidly increasing quantity of protein sequence data continues to widen the gap between available sequences and annotations. Comparative modeling suggests some aspects of the 3D structures of approximately half of all known proteins; homology- and network-based inferences annotate some aspect of function for a similar fraction of the proteome. For most known protein sequences, however, there is detailed knowledge about neither their function nor their structure. Comprehensive efforts towards the expert curation of sequence annotations have failed to meet the demand of the rapidly increasing number of available sequences. Only the automated prediction of protein function in the absence of homology can close the gap between available sequences and annotations in the foreseeable future. This review focuses on two novel methods for automated annotation, and briefly presents an outlook on how modern web software may revolutionize the field of protein sequence annotation. First, predictions of protein binding sites and functional hotspots, and the evolution of these into the most successful type of prediction of protein function from sequence will be discussed. Second, a new tool, comprehensive in silico mutagenesis, which contributes important novel predictions of function and at the same time prepares for the onset of the next sequencing revolution, will be described. While these two new sub-fields of protein prediction represent the breakthroughs that have been achieved methodologically, it will then be argued that a different development might further change the way biomedical researchers benefit from annotations: modern web software can connect the worldwide web in any browser with the 'Deep Web' (ie, proprietary data resources). The availability of this direct connection, and the resulting access to a wealth of data, may impact drug discovery and development more than any existing method that contributes to protein annotation

Open Repository and Bibliography - Luxembourg

MSAViewer:interactive JavaScript visualization of multiple sequence alignments

Author: Benedikt Rauscher
Burkhard Rost
Corpas
Guy Yachdav
Ian Sillitoe
James Procter
Kultys
Martin
Robert Sheridan
Sebastian Wilzbach
Suzanna E. Lewis
Tatyana Goldberg
Publication venue: 'Oxford University Press (OUP)'
Publication date: 13/07/2016
Field of study

Summary: The MSAViewer is a quick and easy visualization and analysis JavaScript component for Multiple Sequence Alignment data of any size. Core features include interactive navigation through the alignment, application of popular color schemes, sorting, selecting and filtering. The MSAViewer is ‘web ready’: written entirely in JavaScript, compatible with modern web browsers and does not require any specialized software. The MSAViewer is part of the BioJS collection of components. Availability and Implementation: The MSAViewer is released as open source software under the Boost Software License 1.0. Documentation, source code and the viewer are available at http://msa.biojs.net/. Supplementary information: Supplementary data are available at Bioinformatics online. Contact: [email protected]

Crossref

Harvard University - DASH

PubMed Central

UCL Discovery

eScholarship - University of California

University of Dundee Online Publications

LocTree3 prediction of localization

Author: Alberts
Aleksandr Sorokoumov
Alexander Betz
Alice Meier
Altschul
Altschul
Bairoch
Berman
Briesemeister
Burkhard Rost
Dimmer
Goldberg
Guy Yachdav
Hamp
Hassan Nasir
Henrik Nielsen
Horton
Huh
Ilira Troshani
Imai
Jonas Reeb
Jonas Zierer
Julia Gerke
Kajan
Katharina Hembach
Kieu Trinh Do
Kinga Balasz
Koonin
Kuang
Laura Cizmadija
Lee
Maria Kalemanov
Max Herzog
Maximilian Hastreiter
Maximilian Hecht
Michael Bernhofer
Michael Kluge
Mika
Mooney
Nadeem Ahmed
Philipp Angerer
Przybylski
Radivojac
Robert Greil
Rost
Rost
Sander
Simpson
Sonja Ansorge
Sonja Waldraff
Susann Vorberg
Tatyana Goldberg
Timothy Karl
Tobias Hamp
Ulrich Neumaier
Uwe Altermann
Vadim Joerdens
Verena Prade
Yachdav
Yu
Yu
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2014
Field of study

The prediction of protein sub-cellular localization is an important step toward elucidating protein function. For each query protein sequence, LocTree2 applies machine learning (profile kernel SVM) to predict the native sub-cellular localization in 18 classes for eukaryotes, in six for bacteria and in three for archaea. The method outputs a score that reflects the reliability of each prediction. LocTree2 has performed on par with or better than any other state-of-the-art method. Here, we report the availability of LocTree3 as a public web server. The server includes the machine learning-based LocTree2 and improves over it through the addition of homology-based inference. Assessed on sequence-unique data, LocTree3 reached an 18-state accuracy Q18 = 80 ± 3% for eukaryotes and a six-state accuracy Q6 = 89 ± 4% for bacteria. The server accepts submissions ranging from single protein sequences to entire proteomes. Response time of the unloaded server is about 90 s for a 300-residue eukaryotic protein and a few hours for an entire eukaryotic proteome not considering the generation of the alignments. For over 1000 entirely sequenced organisms, the predictions are directly available as downloads. The web server is available at http://www.rostlab.org/services/loctree3

Crossref

PubMed Central

Online Research Database In Technology

Tools and data services registry: a community effort to document bioinformatics resources

Author: Anthon Christian
Beard Niall
Berka Karel
Bolser Dan
Booth Tim
Bretaudeau Anthony
Brezovsky Jan
Brunak Søren
Casadio Rita
Cesareni Gianni
Chmura Piotr
Coppens Frederik
Cornell Michael
Cuccuru Gianmauro
Davidsen Kristian
de la Torre Victor
Dogan Tunca
Doppelt-Azeroual Olivia
Emery Laura
Friborg Rune Møllegaard
Gasteiger Elisabeth
Gatter Thomas
Goldberg Tatyana
Grosjean Marie
Grüning Björn
Helmer-Citterich Manuela
Ienasescu Hans
Ioannidis Vassilios
Ison Jon
Jespersen Martin Closter
Jimenez Rafael
Juty Nick
Juvan Peter
Kalaš Matúš
Koch Maximilian
Laibe Camille
Li Jing-Woei
Licata Luana
Løngreen Peter
Mareuil Fabien
Mičetić Ivan
Moretti Sebastien
Morris Chris
Ménager Hervé
Möller Steffen
Nenadic Aleksandra
Parkinson Helen
Peterson Hedi
Profiti Giuseppe
Rapacki Kristoffer
Rice Peter
Romano Paolo
Roncaglia Paola
Rost Burkhard
Rydza Emil
Saidi Rabie
Schafferhans Andrea
Schwämmle Veit
Smith Callum
Sperotto Maria Maddalena
Stockinger Heinz
Tosatto Silvio C.E.
Uva Paolo
Vařeková Radka Svobodová
Vedova Gianluca Della
Via Allegra
Vriend Gert
Yachdav Guy
Zambelli Federico
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

Life sciences are yielding huge data sets that underpin scientific discoveries fundamental to improvement in human health, agriculture and the environment. In support of these discoveries, a plethora of databases and tools are deployed, in technically complex and diverse implementations, across a spectrum of scientific disciplines. The corpus of documentation of these resources is fragmented across the Web, with much redundancy, and has lacked a common standard of information. The outcome is that scientists must often struggle to find, understand, compare and use the best resources for the task at hand. Here we present a community-driven curation effort, supported by ELIXIR—the European infrastructure for biological information—that aspires to a comprehensive and consistent registry of information about bioinformatics resources. The sustainable upkeep of this Tools and Data Services Registry is assured by a curation effort driven by and tailored to local needs, and shared amongst a network of engaged partners. As of November 2015, the registry includes 1785 resources, with depositions from 126 individual registrations including 52 institutional providers and 74 individuals. With community support, the registry can become a standard for dissemination of information about bioinformatics resources: we welcome everyone to join us in this common endeavour. The registry is freely available at https://bio.tools

HAL Descartes

Online Research Database In Technology

Hal-Diderot

Archivio istituzionale della ricerca - Università di Padova

NERC Open Research Archive

Crossref

Ghent University Academic Bibliography

Copenhagen University Research Information System

PubMed Central

Archivsystem Ask23

Archivio istituzionale della ricerca - Alma Mater Studiorum Università di Bologna

University of Southern Denmark Research Output

Archivio della ricerca- Università di Roma La Sapienza

HAL-Rennes 1

MSAViewer: interactive JavaScript visualization of multiple sequence alignments.

Author: Yachdav Guy,
Publication venue
Publication date: 19/06/2018
Field of study

Ezid

The PredictProtein server

Author: Liu Jinfeng
Rost Burkhard
Yachdav Guy
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

PredictProtein (http://www.predictprotein.org) is an Internet service for sequence analysis and the prediction of protein structure and function. Users submit protein sequences or alignments; PredictProtein returns multiple sequence alignments, PROSITE sequence motifs, low-complexity regions (SEG), nuclear localization signals, regions lacking regular structure (NORS) and predictions of secondary structure, solvent accessibility, globular regions, transmembrane helices, coiled-coil regions, structural switch regions, disulfide-bonds, sub-cellular localization and functional annotations. Upon request fold recognition by prediction-based threading, CHOP domain assignments, predictions of transmembrane strands and inter-residue contacts are also available. For all services, users can submit their query either by electronic mail or interactively via the World Wide Web

CiteSeerX

Crossref

PubMed Central

Epitome: database of structure-inferred antigenic epitopes” Nucl

Author: Avner Schlessinger
Burkhard Rost
Guy Yachdav
Yanay Ofran
Publication venue
Publication date
Field of study

Immunoglobulin molecules specifically recognize particular areas on the surface of proteins. These areas are commonly dubbed B-cell epitopes. The identification of epitopes in proteins is important both for the design of experiments and vaccines. Additionally, the interactions between epitopes and antibodies have often served as a model for protein– protein interactions. One of the main obstacles in creating a database of antigen–antibody interactions is the difficulty in distinguishing between antigenic and non-antigenic interactions. Antigenic interactions involve specific recognition sites on the antibody’s surface, while non-antigenic interactions are between a protein and any other site on the antibody. To solve this problem, we performed a comparative analysis of all protein–antibody complexes for which structures have been experimentally determined. Additionally, we developed a semi-automated tool that identified the antigenic interactions within the known antigen–antibody complex structures. We compiled those interactions into Epitome, a database of structure-inferred antigenic residues in proteins. Epitome consists of all known antigen/ antibody complex structures, a detailed description of the residues that are involved in the interactions, and their sequence/structure environments. Interactions can be visualized using an interface to Jmol. The database is available a

CiteSeerX

Improved disorder prediction by combination of orthogonal approaches

Author: Avner Schlessinger
Burkhard Rost
Guy Yachdav
Laszlo Kajan
Marco Punta
Publication venue
Publication date: 01/01/2009
Field of study

Disordered proteins are highly abundant in regulatory processes such as transcription and cell-signaling. Different methods have been developed to predict protein disorder often focusing on different types of disordered regions. Here, we present MD, a novel META-Disorder prediction method that molds various sources of information predominantly obtained from orthogonal prediction methods, to significantly improve in performance over its constituents. In sustained cross-validation, MD not only outperforms its origins, but it also compares favorably to other state-of-the-art prediction methods in a variet

CiteSeerX

Directory of Open Access Journals

PubMed Central