Search CORE

76 research outputs found

The RCSB PDB information portal for structural genomics

Author: Berman Helen M.
Bourne Philip E.
Chen Li
de la Cruz Joanna
Kouranov Andrei
Westbrook John
Xie Lei
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

The RCSB Protein Data Bank (PDB) offers online tools, summary reports and target information related to the worldwide structural genomics initiatives from its portal at . There are currently three components to this site: Structural Genomics Initiatives contains information and links on each structural genomics site, including progress reports, target lists, target status, targets in the PDB and level of sequence redundancy; Targets provides combined target information, protocols and other data associated with protein structure determination; and Structures offers an assessment of the progress of structural genomics based on the functional coverage of the human genome by PDB structures, structural genomics targets and homology models. Functional coverage can be examined according to enzyme classification, gene ontology (biological process, cell component and molecular function) and disease

Crossref

PubMed Central

REBASE—a database for DNA restriction and modification: enzymes, genes and genomes

Author: Benson
Dana Macelis
Finn
Janos Posfai
Kouranov
Pruitt
Richard J. Roberts
Roberts
Roberts
Tamas Vincze
The UniProt Consortium
Venter
Zheng
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

REBASE is a comprehensive database of information about restriction enzymes, DNA methyltransferases and related proteins involved in the biological process of restriction–modification (R–M). It contains fully referenced information about recognition and cleavage sites, isoschizomers, neoschizomers, commercial availability, methylation sensitivity, crystal and sequence data. Experimentally characterized homing endonucleases are also included. The fastest growing segment of REBASE contains the putative R–M systems found in the sequence databases. Comprehensive descriptions of the R–M content of all fully sequenced genomes are available including summary schematics. The contents of REBASE may be browsed from the web (http://rebase.neb.com) and selected compilations can be downloaded by ftp (ftp.neb.com). Additionally, monthly updates can be requested via email

CiteSeerX

Crossref

PubMed Central

SchistoDB: a Schistosoma mansoni genome resource

Author: A. J. Dominitini
A. Zerlotini
Bahl
Chen
Chitsulo
El-Sayed
G. Oliveira
Gene Ontology Consortium
H. Wang
Haas
J. C. Kissinger
J. C. Ruiz
Karp
Kouranov
Le
M. Heiges
McCarter
Mulder
Pica-Mattoccia
R. L. V. Moraes
Simpson
Stein
Verjovski-Almeida
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

SchistoDB (http://schistoDB.net/) is a genomic database for the parasitic organism Schistosoma mansoni, one of the major causative agents of schistosomiasis worldwide. It currently incorporates sequences and annotation for S. mansoni in a single user-friendly database. Several genomic scale analyses are available as well as ESTs, oligonucleotides, metabolic pathways and drugs. In this article, we describe the data sets and its analyses, how to query the database and tools available in the website

CiteSeerX

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

PubMed Central

The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data

Author: Berman Helen
Henrick Kim
Markley John L.
Nakamura Haruki
Publication venue: Oxford University Press
Publication date: 16/11/2006
Field of study

The worldwide Protein Data Bank (wwPDB) is the international collaboration that manages the deposition, processing and distribution of the PDB archive. The online PDB archive is a repository for the coordinates and related information for more than 38 000 structures, including proteins, nucleic acids and large macromolecular complexes that have been determined using X-ray crystallography, NMR and electron microscopy techniques. The founding members of the wwPDB are RCSB PDB (USA), MSD-EBI (Europe) and PDBj (Japan) [H.M. Berman, K. Henrick and H. Nakamura (2003) Nature Struct. Biol., 10, 980]. The BMRB group (USA) joined the wwPDB in 2006. The mission of the wwPDB is to maintain a single archive of macromolecular structural data that are freely and publicly available to the global community. Additionally, the wwPDB provides a variety of services to a broad community of users. The wwPDB website at provides information about services provided by the individual member organizations and about projects undertaken by the wwPDB

Crossref

PubMed Central

The protein structure initiative structural genomics knowledgebase

Author: A. Kouranov
Ashburner
Benson
Berman
Berman
Berman
Berman
Corpet
F. Kiefer
H. M. Berman
Haft
J. D. Westbrook
J. Kopp
J. L. Baer
K. Arnold
Kopp
Kouranov
L. Bordoli
L. G. Carter
Lo Conte
M. J. Gabanyi
M. Podvinec
Orengo
P. D. Adams
Pieper
R. Nair
R. Shah
Sonnhammer
T. Schwede
W. Minor
W. Tao
Wu
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The Protein Structure Initiative Structural Genomics Knowledgebase (PSI SGKB, http://kb.psi-structuralgenomics.org) has been created to turn the products of the PSI structural genomics effort into knowledge that can be used by the biological research community to understand living systems and disease. This resource provides central access to structures in the Protein Data Bank (PDB), along with functional annotations, associated homology models, worldwide protein target tracking information, available protocols and the potential to obtain DNA materials for many of the targets. It also offers the ability to search all of the structural and methodological publications and the innovative technologies that were catalyzed by the PSI's high-throughput research efforts. In collaboration with the Nature Publishing Group, the PSI SGKB provides a research library, editorials about new research advances, news and an events calendar to present a broader view of structural biology and structural genomics. By making these resources freely available, the PSI SGKB serves as a bridge to connect the structural biology and the greater biomedical communities

Expresso: automatic incorporation of structural information in multiple sequence alignments using 3D-Coffee

Author: Armougom Fabrice
Audic Stéphane
Dumas Pierre
Keduas Vladimir
Moretti Sébastien
Notredame Cedric
Poirot Olivier
Schaeli Basile
Publication venue: Oxford University Press
Publication date: 14/07/2006
Field of study

Expresso is a multiple sequence alignment server that aligns sequences using structural information. The user only needs to provide sequences. The server runs BLAST to identify close homologues of the sequences within the PDB database. These PDB structures are used as templates to guide the alignment of the original sequences using structure-based sequence alignment methods like SAP or Fugue. The final result is a multiple sequence alignment of the original sequences based on the structural information of the templates. An advanced mode makes it possible to either upload private structures or specify which PDB templates should be used to model each sequence. Providing the suitable structural information is available, Expresso delivers sequence alignments with accuracy comparable with structure-based alignments. The server is available on

Infoscience - École polytechnique fédérale de Lausanne

Crossref

PubMed Central

Influenza Virus Database (IVDB): an integrated information resource and analysis platform for influenza virus research

Author: Chang Suhua
Feng Tao
Gao George F.
Liao Xiaoyun
Wang Dahai
Wang Jian
Wang Jing
Yang Huanming
Yu Jun
Zhang Jiajie
Zhu Baoli
Zhu Jiang
Zhu Xinxing
Publication venue: Oxford University Press
Publication date: 25/10/2006
Field of study

Frequent outbreaks of highly pathogenic avian influenza and the increasing data available for comparative analysis require a central database specialized in influenza viruses (IVs). We have established the Influenza Virus Database (IVDB) to integrate information and create an analysis platform for genetic, genomic, and phylogenetic studies of the virus. IVDB hosts complete genome sequences of influenza A virus generated by Beijing Institute of Genomics (BIG) and curates all other published IV sequences after expert annotation. Our Q-Filter system classifies and ranks all nucleotide sequences into seven categories according to sequence content and integrity. IVDB provides a series of tools and viewers for comparative analysis of the viral genomes, genes, genetic polymorphisms and phylogenetic relationships. A search system has been developed for users to retrieve a combination of different data types by setting search options. To facilitate analysis of global viral transmission and evolution, the IV Sequence Distribution Tool (IVDT) has been developed to display the worldwide geographic distribution of chosen viral genotypes and to couple genomic data with epidemiological data. The BLAST, multiple sequence alignment and phylogenetic analysis tools were integrated for online data analysis. Furthermore, IVDB offers instant access to pre-computed alignments and polymorphisms of IV genes and proteins, and presents the results as SNP distribution plots and minor allele distributions. IVDB is publicly available a

CiteSeerX

Crossref

PubMed Central

The EMBRACE web service collection

Author: A. B. Clegg
A. Liaquat
Altschul
Ashburner
C. Blanchet
Curcin
D. G. Pisano
D. Thorne
E. Bartaseviciute
E. Bongcam-Rudloff
G. Cameron
G. Vriend
H. Stockinger
Hull
I. Jonassen
I.- Partners
J. Ison
J. M. Fernandez
J. M. Rodriguez
J. Salzemann
K. Rapacki
Kouranov
M. Hekkelman
M. Kalas
M. Uludag
O. Sand
P. McDermott
P. Rice
Pillai
S. Pettifer
Smith
T. K. Attwood
V. Breton
Vriend
Wolstencroft
Publication venue: Oxford University Press
Publication date: 01/01/2010
Field of study

The EMBRACE (European Model for Bioinformatics Research and Community Education) web service collection is the culmination of a 5-year project that set out to investigate issues involved in developing and deploying web services for use in the life sciences. The project concluded that in order for web services to achieve widespread adoption, standards must be defined for the choice of web service technology, for semantically annotating both service function and the data exchanged, and a mechanism for discovering services must be provided. Building on this, the project developed: EDAM, an ontology for describing life science web services; BioXSD, a schema for exchanging data between services; and a centralized registry (http://www.embraceregistry.net) that collects together around 1000 services developed by the consortium partners. This article presents the current status of the collection and its associated recommendations and standards definitions

University of Bergen

HAL-IN2P3

Crossref

HAL Clermont Université

PubMed Central

UCL Discovery

DI-fusion

The University of Manchester - Institutional Repository

NORA - Norwegian Open Research Archives

Online Research Database In Technology

Protein Binding Site Prediction by Combining Hidden Markov Support Vector Machine and Profile-Based Propensities

Author: Bin Liu
Bingquan Liu
Fule Liu
Xiaolong Wang
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2014
Field of study

Crossref

fDETECT webserver: fast predictor of propensity for protein production, purification, and crystallization

Author: Kurgan Lukasz
Meng Fanchi
Wang Chen
Publication venue: VCU Scholars Compass
Publication date: 01/01/2017
Field of study

Background: Development of predictors of propensity of protein sequences for successful crystallization has been actively pursued for over a decade. A few novel methods that expanded the scope of these predictions to address additional steps of protein production and structure determination pipelines were released in recent years. The predictive performance of the current methods is modest. This is because the only input that they use is the protein sequence and since the experimental annotations of these data might be inconsistent given that they were collected across many laboratories and centers. However, even these modest levels of predictive quality are still practical compared to the reported low success rates of crystallization, which are below 10%. We focus on another important aspect related to a high computational cost of running the predictors that offer the expanded scope. Results: We introduce a novel fDETECT webserver that provides very fast and modestly accurate predictions of the success of protein production, purification, crystallization, and structure determination. Empirical tests on two datasets demonstrate that fDETECT is more accurate than the only other similarly fast method, and similarly accurate and three orders of magnitude faster than the currently most accurate predictors. Our method predicts a single protein in about 120 milliseconds and needs less than an hour to generate the four predictions for an entire human proteome. Moreover, we empirically show that fDETECT secures similar levels of predictive performance when compared with four representative methods that only predict success of crystallization, while it also provides the other three predictions. A webserver that implements fDETECT is available at http://biomine.cs.vcu.edu/servers/ fDETECT/. Conclusions: fDETECT is a computational tool that supports target selection for protein production and X-ray crystallography-based structure determination. It offers predictive quality that matches or exceeds other state-ofthe-art tools and is especially suitable for the analysis of large protein sets

VCU Scholars Compass