Search CORE

4 research outputs found

Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently "orphan" viral proteins

Author: Chung Betty
Cook Shelley
Eisenhaber Birgit
Karlin David
Kuchibhatla Durga
Schneider Georg
Sherman Westley
Publication venue: 'American Society for Microbiology'
Publication date: 01/01/2014
Field of study

The genome sequences of new viruses often contain many "orphan" or "taxon-specific" proteins apparently lacking homologs. However, because viral proteins evolve very fast, commonly used sequence similarity detection methods such as BLAST may overlook homologs. We analyzed a data set of proteins from RNA viruses characterized as "genus specific" by BLAST. More powerful methods developed recently, such as HHblits or HHpred (available through web-based, user-friendly interfaces), could detect distant homologs of a quarter of these proteins, suggesting that these methods should be used to annotate viral genomes. In-depth manual analyses of a subset of the remaining sequences, guided by contextual information such as taxonomy, gene order, or domain cooccurrence, identified distant homologs of another third. Thus, a combination of powerful automated methods and manual analyses can uncover distant homologs of many proteins thought to be orphans. We expect these methodological results to be also applicable to cellular organisms, since they generally evolve much more slowly than RNA viruses. As an application, we reanalyzed the genome of a bee pathogen, Chronic bee paralysis virus (CBPV). We could identify homologs of most of its proteins thought to be orphans; in each case, identifying homologs provided functional clues. We discovered that CBPV encodes a domain homologous to the Alphavirus methyltransferase-guanylyltransferase; a putative membrane protein, SP24, with homologs in unrelated insect viruses and insect-transmitted plant viruses having different morphologies (cileviruses, higreviruses, blunerviruses, negeviruses); and a putative virion glycoprotein, ORF2, also found in negeviruses. SP24 and ORF2 are probably major structural components of the virionsd

PubMed Central

IST Austria: PubRep (Institute of Science and Technology)

Powerful sequence similarity search methods and in-depth manual analyses can identify remote homologs in many apparently "orphan" viral proteins.

Author: Chung Betty YW
Cook Shelley
Eisenhaber Birgit
Karlin David G
Kuchibhatla Durga B
Schneider Georg
Sherman Westley A
Publication venue: J Virol
Publication date: 01/01/2014
Field of study

PubMed Central

IST Austria: PubRep (Institute of Science and Technology)

Apollo (Cambridge)

Tachyon search speeds up retrieval of similar sequences by several orders of magnitude

Author: Altschul
Benson
Chia Yee Kwoh
Durga Kuchibhatla
Fernanda L. Sirota
Frank Eisenhaber
Georg Schneider
Joshua Tan
Katoh
Kent
Ooi
Pearson
Sayers
Sebastian Maurer-Stroh
The Universal Protein Resource (UniProt) in 2010.
Tobias Gattermayer
Waterhouse
Westley A. Sherman
Wootton
Zhao
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Summary: The usage of current sequence search tools becomes increasingly slower as databases of protein sequences continue to grow exponentially. Tachyon, a new algorithm that identifies closely related protein sequences ~200 times faster than standard BLAST, circumvents this limitation with a reduced database and oligopeptide matching heuristic

Crossref

PubMed Central

DR-NTU (Digital Repository of NTU)

HPMV: Human protein mutation viewer — relating sequence mutations to protein sequence architecture and function changes

Author: Anant JS
Bendtsen JD
Birgit Eisenhaber
Durga Bhavani Kuchibhatla
Eisenhaber F
Flicek P
Frank Eisenhaber
Schneider G
Sebastian Maurer-Stroh
Vachiranee Limviphuvadh
Veske A
Villaveces JM
Westley Arthur Sherman
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date
Field of study

Crossref