Search CORE

31,155 research outputs found

A biophysical approach to large-scale protein-DNA binding data

Author: Manke T.
Roider H.
Vingron M.
Publication venue
Publication date: 01/01/2008
Field of study

About this book * Cutting-edge genome analysis methods from leading bioinformaticians An accurate description of current scientific developments in the field of bioinformatics and computational implementation is presented by research of the BioSapiens Network of Excellence. Bioinformatics is essential for annotating the structure and function of genes, proteins and the analysis of complete genomes and to molecular biology and biochemistry. Included is an overview of bioinformatics, the full spectrum of genome annotation approaches including; genome analysis and gene prediction, gene regulation analysis and expression, genome variation and QTL analysis, large scale protein annotation of function and structure, annotation and prediction of protein interactions, and the organization and annotation of molecular networks and biochemical pathways. Also covered is a technical framework to organize and represent genome data using the DAS technology and work in the annotation of two large genomic sets: HIV/HCV viral genomes and splicing alternatives potentially encoded in 1% of the human genome

Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.

Author: Du Yushen
Gong Danyang
Jiang Lin
Shu Sara
Sun Ren
Wu Nicholas C
Wu Ting-Ting
Zhang Tianhao
Publication venue: eScholarship, University of California
Publication date: 01/11/2016
Field of study

Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available.ImportanceTo fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is usually limited by sampling size. Sequence conservation-based methods are further confounded by structural constraints and multifunctionality of proteins. Here we present a method that can systematically identify and annotate functional residues of a given protein. We used a high-throughput functional profiling platform to identify essential residues. Coupling it with homologous-structure comparison, we were able to annotate multiple functions of proteins. We demonstrated the method with the PB1 protein of influenza A virus and identified novel functional residues in addition to its canonical function as an RNA-dependent RNA polymerase. Not limited to virology, this method is generally applicable to other proteins that can be functionally selected and about which homologous-structure information is available

Directory of Open Access Journals

eScholarship - University of California

SIFTER search: a web server for accurate phylogeny-based protein function prediction.

Author: Brenner Steven E
Luo Kevin R
Sahraeian Sayed M
Publication venue: eScholarship, University of California
Publication date: 01/01/2015
Field of study

We are awash in proteins discovered through high-throughput sequencing projects. As only a minuscule fraction of these have been experimentally characterized, computational methods are widely used for automated annotation. Here, we introduce a user-friendly web interface for accurate protein function prediction using the SIFTER algorithm. SIFTER is a state-of-the-art sequence-based gene molecular function prediction algorithm that uses a statistical model of function evolution to incorporate annotations throughout the phylogenetic tree. Due to the resources needed by the SIFTER algorithm, running SIFTER locally is not trivial for most users, especially for large-scale problems. The SIFTER web server thus provides access to precomputed predictions on 16 863 537 proteins from 232 403 species. Users can explore SIFTER predictions with queries for proteins, species, functions, and homologs of sequences not in the precomputed prediction set. The SIFTER web server is accessible at http://sifter.berkeley.edu/ and the source code can be downloaded

CiteSeerX

eScholarship - University of California

The Toxoplasma gondii plastid replication and repair enzyme complex, PREX

Author: Barrett M P
Chen C-Y
Doerig C
Henriquez F L
Mukhopadhyay A
Roberts C W
Roberts Craig
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 30/04/2009
Field of study

A plastid-like organelle, the apicoplast, is essential to the majority of medically and veterinary important apicomplexan protozoa including Toxoplasma gondii and Plasmodium. The apicoplast contains multiple copies of a 35 kb genome, the replication of which is dependent upon nuclear-encoded proteins that are imported into the organelle. In P. falciparum an unusual multi-functional gene, pfprex, was previously identified and inferred to encode a protein with DNA primase, DNA helicase and DNA polymerase activities. Herein, we report the presence of a prex orthologue in T. gondii. The protein is predicted to have a bi-partite apicoplast targeting sequence similar to that demonstrated on the PfPREX polypeptide, capable of delivering marker proteins to the apicoplast. Unlike the P. falciparum gene that is devoid of introns, the T. gondii prex gene carries 19 introns, which are spliced to produce a contiguous mRNA. Bacterial expression of the polymerase domain reveals the protein to be active. Consistent with the reported absence of a plastid in Cryptosporidium species, in silico analysis of their genomes failed to demonstrate an orthologue of prex. These studies indicate that prex is conserved across the plastid-bearing apicomplexans and may play an important role in the replication of the plastid genome

Enlighten