Search CORE

6,107 research outputs found

SIFTER search: a web server for accurate phylogeny-based protein function prediction.

Author: Brenner Steven E
Luo Kevin R
Sahraeian Sayed M
Publication venue: eScholarship, University of California
Publication date: 01/01/2015
Field of study

We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Here, we introduce a user-friendly web interface for accurate protein function prediction using the SIFTER algorithm. SIFTER is a state-of-the-art sequence-based gene molecular function prediction algorithm that uses a statistical model of function evolution to incorporate annotations throughout the phylogenetic tree. Due to the resources needed by the SIFTER algorithm, running SIFTER locally is not trivial for most users, especially for large-scale problems. The SIFTER web server thus provides access to precomputed predictions on 16 863 537 proteins from 232 403 species. Users can explore SIFTER predictions with queries for proteins, species, functions, and homologs of sequences not in the precomputed prediction set. The SIFTER web server is accessible at http://sifter.berkeley.edu/ and the source code can be downloaded

CiteSeerX

PubMed Central

eScholarship - University of California

WormBase 2007

Author: A. Petcherski
A. Rogers
Ashburner
C. Bastiani
C. Nakamura
D. Blasiar
D. Wang
Deplancke
E. M. Schwarz
G. Schindelman
G. Williams
H.-M. Muller
Hillier
Husson
I. Antoshechkin
J. Chan
J. Fernandes
J. Spieth
K. Van Auken
K. Yook
Kirienko
L. D. Stein
Li
Li
M. A. Tuli
M. Han
Matera
Meyer
M ller
O'Brien
P. Canaran
P. Davis
P. Ozersky
P. W. Sternberg
Potter
R. Durbin
R. Kishore
R. Lee
Ruby
S. McKay
T. Bieri
T. J. Fiedler
T. W. Harris
Tatusov
W. J. Chen
W. Spooner
Wachi
Walhout
X. Wang
Zemann
Zhang
Zhong
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2008
Field of study

WormBase (www.wormbase.org) is the major publicly available database of information about Caenorhabditis elegans, an important system for basic biological and biomedical research. Derived from the initial ACeDB database of C. elegans genetic and sequence information, WormBase now includes the genomic, anatomical and functional information about C. elegans, other Caenorhabditis species and other nematodes. As such, it is a crucial resource not only for C. elegans biologists but the larger biomedical and bioinformatics communities. Coverage of core areas of C. elegans biology will allow the biomedical community to make full use of the results of intensive molecular genetic analysis and functional genomic studies of this organism. Improved search and display tools, wider cross-species comparisons and extended ontologies are some of the features that will help scientists extend their research and take advantage of other nematode species genome sequences

Crossref

Cold Spring Harbor Laboratory Institutional Repository

PubMed Central

Caltech Authors

An analysis of the Sargasso Sea resource and the consequences for database composition

Author: Cozzetto D
Tramontano A
Tress ML
Valencia A
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

Background: The environmental sequencing of the Sargasso Sea has introduced a huge new resource of genomic information. Unlike the protein sequences held in the current searchable databases, the Sargasso Sea sequences originate from a single marine environment and have been sequenced from species that are not easily obtainable by laboratory cultivation. The resource also contains very many fragments of whole protein sequences, a side effect of the shotgun sequencing method.These sequences form a significant addendum to the current searchable databases but also present us with some intrinsic difficulties. While it is important to know whether it is possible to assign function to these sequences with the current methods and whether they will increase our capacity to explore sequence space, it is also interesting to know how current bioinformatics techniques will deal with the new sequences in the resource.Results: The Sargasso Sea sequences seem to introduce a bias that decreases the potential of current methods to propose structure and function for new proteins. In particular the high proportion of sequence fragments in the resource seems to result in poor quality multiple alignments.Conclusion: These observations suggest that the new sequences should be used with care, especially if the information is to be used in large scale analyses. On a positive note, the results may just spark improvements in computational and experimental methods to take into account the fragments generated by environmental sequencing techniques

Springer - Publisher Connector

Directory of Open Access Journals

UCL Discovery

PubMed Central

Digital.CSIC

Phytome: a platform for plant comparative genomics

Author: Hartmann Stefanie
Lu Dihui
Phillips Jason
Vision Todd J.
Publication venue: Oxford University Press
Publication date: 28/12/2005
Field of study

Phytome is an online comparative genomics resource that can be applied to functional plant genomics, molecular breeding and evolutionary studies. It contains predicted protein sequences, protein family assignments, multiple sequence alignments, phylogenies and functional annotations for proteins from a large, phylogenetically diverse set of plant taxa. Phytome serves as a glue between disparate plant gene databases both by identifying the evolutionary relationships among orthologous and paralogous protein sequences from different species and by enabling cross-references between different versions of the same gene curated independently by different database groups. The web interface enables sophisticated queries on lineage-specific patterns of gene/protein family proliferation and loss. This rich dataset is serving as a platform for the unification of sequence-anchored comparative maps across taxonomic families of plants. The Phytome web interface can be accessed at the following URL: . Batch homology searches and bulk downloads are available upon free registration

CiteSeerX

Crossref

PubMed Central

Carolina Digital Repository