132 research outputs found
SS-Wrapper: a package of wrapper applications for similarity searches on Linux clusters
BACKGROUND: Large-scale sequence comparison is a powerful tool for biological inference in modern molecular biology. Comparing new sequences to those in annotated databases is a useful source of functional and structural information about these sequences. Using software such as the basic local alignment search tool (BLAST) or HMMPFAM to identify statistically significant matches between newly sequenced segments of genetic material and those in databases is an important task for most molecular biologists. Searching algorithms are intrinsically slow and data-intensive, especially in light of the rapid growth of biological sequence databases due to the emergence of high throughput DNA sequencing techniques. Thus, traditional bioinformatics tools are impractical on PCs and even on dedicated UNIX servers. To take advantage of larger databases and more reliable methods, high performance computation becomes necessary. RESULTS: We describe the implementation of SS-Wrapper (Similarity Search Wrapper), a package of wrapper applications that can parallelize similarity search applications on a Linux cluster. Our wrapper utilizes a query segmentation-search (QS-search) approach to parallelize sequence database search applications. It takes into consideration load balancing between each node on the cluster to maximize resource usage. QS-search is designed to wrap many different search tools, such as BLAST and HMMPFAM using the same interface. This implementation does not alter the original program, so newly obtained programs and program updates should be accommodated easily. Benchmark experiments using QS-search to optimize BLAST and HMMPFAM showed that QS-search accelerated the performance of these programs almost linearly in proportion to the number of CPUs used. We have also implemented a wrapper that utilizes a database segmentation approach (DS-BLAST) that provides a complementary solution for BLAST searches when the database is too large to fit into the memory of a single node. CONCLUSIONS: Used together, QS-search and DS-BLAST provide a flexible solution to adapt sequential similarity searching applications in high performance computing environments. Their ease of use and their ability to wrap a variety of database search programs provide an analytical architecture to assist both the seasoned bioinformaticist and the wet-bench biologist
Genomic multiple sequence alignments: refinement using a genetic algorithm
BACKGROUND: Genomic sequence data cannot be fully appreciated in isolation. Comparative genomics – the practice of comparing genomic sequences from different species – plays an increasingly important role in understanding the genotypic differences between species that result in phenotypic differences as well as in revealing patterns of evolutionary relationships. One of the major challenges in comparative genomics is producing a high-quality alignment between two or more related genomic sequences. In recent years, a number of tools have been developed for aligning large genomic sequences. Most utilize heuristic strategies to identify a series of strong sequence similarities, which are then used as anchors to align the regions between the anchor points. The resulting alignment is globally correct, but in many cases is suboptimal locally. We describe a new program, GenAlignRefine, which improves the overall quality of global multiple alignments by using a genetic algorithm to improve local regions of alignment. Regions of low quality are identified, realigned using the program T-Coffee, and then refined using a genetic algorithm. Because a better COFFEE (Consistency based Objective Function For alignmEnt Evaluation) score generally reflects greater alignment quality, the algorithm searches for an alignment that yields a better COFFEE score. To improve the intrinsic slowness of the genetic algorithm, GenAlignRefine was implemented as a parallel, cluster-based program. RESULTS: We tested the GenAlignRefine algorithm by running it on a Linux cluster to refine sequences from a simulation, as well as refine a multiple alignment of 15 Orthopoxvirus genomic sequences approximately 260,000 nucleotides in length that initially had been aligned by Multi-LAGAN. It took approximately 150 minutes for a 40-processor Linux cluster to optimize some 200 fuzzy (poorly aligned) regions of the orthopoxvirus alignment. Overall sequence identity increased only slightly; but significantly, this occurred at the same time that the overall alignment length decreased – through the removal of gaps – by approximately 200 gapped regions representing roughly 1,300 gaps. CONCLUSION: We have implemented a genetic algorithm in parallel mode to optimize multiple genomic sequence alignments initially generated by various alignment tools. Benchmarking experiments showed that the refinement algorithm improved genomic sequence alignments within a reasonable period of time
Virus taxonomy: the database of the International Committee on Taxonomy of Viruses (ICTV)
The International Committee on Taxonomy of Viruses (ICTV) is charged with the task of developing, refining, and maintaining a universal virus taxonomy. This task encompasses the classification of virus species and higher-level taxa according to the genetic and biological properties of their members; naming virus taxa; maintaining a database detailing the currently approved taxonomy; and providing the database, supporting proposals, and other virus-related information from an open-access, public web site. The ICTV web site (http://ictv.global) provides access to the current taxonomy database in online and downloadable formats, and maintains a complete history of virus taxa back to the first release in 1971. The ICTV has also published the ICTV Report on Virus Taxonomy starting in 1971. This Report provides a comprehensive description of all virus taxa covering virus structure, genome structure, biology and phylogenetics. The ninth ICTV report, published in 2012, is available as an open-access online publication from the ICTV web site. The current, 10th report (http://ictv.global/report/), is being published online, and is replacing the previous hard-copy edition with a completely open access, continuously updated publication. No other database or resource exists that provides such a comprehensive, fully annotated compendium of information on virus taxa and taxonomy
PCR Based Microbial Monitor for Analysis of Recycled Water Aboard the ISSA: Issues and Prospects
The monitoring of spacecraft life support systems for the presence of health threatening microorganisms is paramount for crew well being and successful completion of missions. Development of technology to monitor spacecraft recycled water based on detection and identification of the genetic material of contaminating microorganisms and viruses would be a substantial improvement over current NASA plans to monitor recycled water samples that call for the use of conventional microbiology techniques which are slow, insensitive, and labor intensive. The union of the molecular biology techniques of DNA probe hybridization and polymerase chain reaction (PCR) offers a powerful method for the detection, identification, and quantification of microorganisms and viruses. This technology is theoretically capable of assaying samples in as little as two hours with specificity and sensitivity unmatched by any other method. A major advance in probe-hybridization/PCR has come about in a technology called TaqMan(TM), which was invented by Perkin Elmer. Instrumentation using TaqMan concepts is evolving towards devices that could meet NASA's needs of size, low power use, and simplicity of operation. The chemistry and molecular biology needed to utilize these probe-hybridization/PCR instruments must evolve in parallel with the hardware. The following issues of chemistry and biology must be addressed in developing a monitor: Early in the development of a PCR-based microbial monitor it will be necessary to decide how many and which organisms does the system need the capacity to detect. We propose a set of 17 different tests that would detect groups of bacteria and fungus, as well as specific eukaryotic parasites and viruses; In order to use the great sensitivity of PCR it will be necessary to concentrate water samples using filtration. If a lower limit of detection of 1 microorganism per 100 ml is required then the microbes in a 100 ml sample must be concentrated into a volume that can be added to a PCR assay; There are not likely to be contaminants in ISSA recycled water that would inhibit PCR resulting in false-negative results; The TaqMan PCR product detection system is the most promising method for developing a rapid, highly automated gene-based microbial monitoring system. The method is inherently quantitative. NASA and other government agencies have invested in other technologies that, although potentially could lead to revolutionary advances, are not likely to mature in the next 5 years into working systems; PCR-based methods cannot distinguish between DNA or RNA of a viable microorganism and that of a non-viable organism. This may or may not be an important issue with reclaimed water on the ISSA. The recycling system probably damages the capacity of the genetic material of any bacteria or viruses killed during processing to serve as a template in a PCR desinged to amplify a large segment of DNA (less than 650 base pairs). If necessary, vital dye staining could be used in addition to PCR, to enumerate the viable cells in a water sample; The quality control methods have been developed to insure that PCR's are working properly, and that reactions are not contaminated with PCR carryover products which could lead to the generation of false-positive results; and The sequences of the small rRNA subunit gene for a large number of microorganisms are known, and they consititue the best database for rational development of the oligonucleotide reagents that give PCR its great specificity. From those gene sequences, sets of oligonucleotide primers for PCR and Taqman detection that could be used in a NASA microbial monitor were constructed using computer based methods. In addition to space utilization, a microbial monitior will have tremendous terrestrial applications. Analysis of patient samples for microbial pathogens, testing industrial effluent for biofouling bacteria, and detection biological warfare agents on the battlefield are but a few of the diverse potential uses for this technology. Once fully developed, gene-based microbial monitors will become the fundamental tool in every lab that tests for microbial contaminants, and serve as a powerful weapon in mankind's war with the germ world
Orthopoxvirus Genome Evolution: The Role of Gene Loss
Poxviruses are highly successful pathogens, known to infect a variety of hosts. The family Poxviridae includes Variola virus, the causative agent of smallpox, which has been eradicated as a public health threat but could potentially reemerge as a bioterrorist threat. The risk scenario includes other animal poxviruses and genetically engineered manipulations of poxviruses. Studies of orthologous gene sets have established the evolutionary relationships of members within the Poxviridae family. It is not clear, however, how variations between family members arose in the past, an important issue in understanding how these viruses may vary and possibly produce future threats. Using a newly developed poxvirus-specific tool, we predicted accurate gene sets for viruses with completely sequenced genomes in the genus Orthopoxvirus. Employing sensitive sequence comparison techniques together with comparison of syntenic gene maps, we established the relationships between all viral gene sets. These techniques allowed us to unambiguously identify the gene loss/gain events that have occurred over the course of orthopoxvirus evolution. It is clear that for all existing Orthopoxvirus species, no individual species has acquired protein-coding genes unique to that species. All existing species contain genes that are all present in members of the species Cowpox virus and that cowpox virus strains contain every gene present in any other orthopoxvirus strain. These results support a theory of reductive evolution in which the reduction in size of the core gene set of a putative ancestral virus played a critical role in speciation and confining any newly emerging virus species to a particular environmental (host or tissue) niche
Poxvirus Bioinformatics Resource Center: a comprehensive Poxviridae informational and analytical resource
The Poxvirus Bioinformatics Resource Center (PBRC) has been established to provide informational and analytical resources to the scientific community to aid research directed at providing a better understanding of the Poxviridae family of viruses. The PBRC was specifically established as the result of the concern that variola virus, the causative agent of smallpox, as well as related viruses, might be utilized as biological weapons. In addition, the PBRC supports research on poxviruses that might be considered new and emerging infectious agents such as monkeypox virus. The PBRC consists of a relational database and web application that supports the data storage, annotation, analysis and information exchange goals of the project. The current release consists of over 35 complete genomic sequences of various genera, species and strains of viruses from the Poxviridae family. Sequence and annotation information for these viruses has been obtained from sequences publicly available from GenBank as well as sequences not yet deposited in GenBank that have been obtained from ongoing sequencing projects. In addition to sequence data, the PBRC provides comprehensive annotation and curation of virus genes; analytical tools to aid in the understanding of the available sequence data, including tools for the comparative analysis of different virus isolates; and visualization tools to help better display the results of various analyses. The PBRC represents the initial development of what will become a more comprehensive Viral Bioinformatics Resource Center for Biodefense that will be one of the National Institute of Allergy and Infectious Diseases' ‘Bioinformatics Resource Centers for Biodefense and Emerging or Re-Emerging Infectious Diseases’. The PBRC website is available at http://www.poxvirus.org
Ratification vote on taxonomic proposals to the International Committee on Taxonomy of Viruses (2016)
This article lists the changes to virus taxonomy approved and ratified by the International Committee on Taxonomy of Viruses (ICTV) in April 2016.
Changes to virus taxonomy (the Universal Scheme of Virus Classification of the International Committee on Taxonomy of Viruses [ICTV]) now take place annually and are the result of a multi-stage process. In accordance with the ICTV Statutes (http://​www.​ictvonline.​org/​statutes.​asp), proposals submitted to the ICTV Executive Committee (EC) undergo a review process that involves input from the ICTV Study Groups (SGs) and Subcommittees (SCs), other interested virologists, and the EC. After final approval by the EC, proposals are then presented for ratification to the full ICTV membership by publication on an ICTV web site (http://​www.​ictvonline.​org/​) followed by an electronic vote. The latest set of proposals approved by the EC was made available on the ICTV website by January 2016 (https://​talk.​ictvonline.​org/​files/​proposals/​). A list of these proposals was then emailed on 28 March 2016 to the 148 members of ICTV, namely the EC Members, Life Members, ICTV Subcommittee Members (including the SG chairs) and ICTV National Representatives. Members were then requested to vote on whether to ratify the taxonomic proposals (voting closed on 29 April 2016)
ICTV Virus Taxonomy Profile: Bromoviridae
[EN] Bromoviridae is a family of plant viruses with tri-segmented, positive-sense, single-stranded RNA genomes of about 8 kb in total. Genomic RNAs are packaged in separate virions that may also contain subgenomic, defective or satellite RNAs. Virions are variable in morphology (spherical or bacilliform) and are transmitted between hosts mechanically, in/on the pollen and non-persistently by insect vectors. Members of the family are responsible for major disease epidemics in fruit, vegetable and fodder crops such as tomato, cucurbits, bananas, fruit trees and alfalfa. This is a summary of the International Committee on Taxonomy of Viruses (ICTV) Report on the family Bromoviridae, which is available at www.ictv.global/report/bromoviridae.Production of this summary, the online chapter and associated resources was funded by a grant from the Wellcome Trust (WT108418AIA). Members of the ICTV (10th) Report Consortium are Elliot J. Lefkowitz, Andrew J. Davison, Stuart G. Siddell, Peter Simmonds, Sead Sabanadzovic, Donald B. Smith, Richard J. Orton and F. Murilo Zerbini.Bujarski, J.; Gallitelli, D.; Garcia, F.; Pallás Benet, V.; Palukaitis, P.; Reddy, M.; Wang, A.... (2019). ICTV Virus Taxonomy Profile: Bromoviridae. Journal of General Virology. 100(8):1206-1207. https://doi.org/10.1099/jgv.0.001282S120612071008Pallas, V., Aparicio, F., Herranz, M. C., Sanchez-Navarro, J. A., & Scott, S. W. (2013). The Molecular Biology of Ilarviruses. Advances in Virus Research, 139-181. doi:10.1016/b978-0-12-407698-3.00005-3Hanssen, I. M., & Lapidot, M. (2012). Major Tomato Viruses in the Mediterranean Basin. Viruses and Virus Diseases of Vegetables in the Mediterranean Basin, 31-66. doi:10.1016/b978-0-12-394314-9.00002-6Jacquemond, M. (2012). Cucumber Mosaic Virus. Viruses and Virus Diseases of Vegetables in the Mediterranean Basin, 439-504. doi:10.1016/b978-0-12-394314-9.00013-0Pallas, V., Aparicio, F., Herranz, M. C., Amari, K., Sanchez-Pina, M. A., Myrta, A., & Sanchez-Navarro, J. A. (2012). Ilarviruses of Prunus spp.: A Continued Concern for Fruit Trees. Phytopathology®, 102(12), 1108-1120. doi:10.1094/phyto-02-12-0023-rv
Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop
Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world’s biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop
ICTV Virus Taxonomy Profile: Ophioviridae
[EN] The Ophioviridae is a family of filamentous plant viruses, with single-stranded negative, and possibly ambisense, RNA genomes of 11.3-12.5 kb divided into 3-4 segments, each encapsidated separately. Virions are naked filamentous nucleocapsids, forming kinked circles of at least two different contour lengths. The sole genus, Ophiovirus, includes seven species. Four ophioviruses are soil-transmitted and their natural hosts include trees, shrubs, vegetables and bulbous or corm-forming ornamentals, both monocots and dicots. This is a summary of the International Committee on Taxonomy of Viruses (ICTV) Report on the taxonomy of the Ophioviridae, which is available at http://www.ictv.global/report/ophioviridae.Production of this summary, the online chapter and associated resources was funded by a grant from the Wellcome Trust (WT108418AIA).Garcia, M.; Dal Bo, E.; Da Graca, JV.; Gago Zachert, SP.; Hammond, J.; Moreno, P.; Natsuaki, T.... (2017). ICTV Virus Taxonomy Profile: Ophioviridae. Journal of General Virology. 98(6):1161-1162. doi:10.1099/jgv.0.000836S1161116298
- …