Search CORE

19 research outputs found

Supplement: VIRify manuscript

Author: Martin Beracochea
Martin Hölzer
Publication venue: OSF
Publication date: 12/07/2023
Field of study

OSF Preprints

Recommended from our members

VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.

Author: Almeida Alexandre
Beracochea Martin
Finn Robert D
Hölzer Martin
Marz Manja
Rangel-Pineros Guillermo
Reyes Muñoz Alejandro
Sakharova Ekaterina
Publication venue: PLoS Comput Biol
Publication date: 18/09/2023
Field of study

Acknowledgements: The authors thank Franziska Hufsky for improving the readability of the illustrations. The authors would also like to acknowledge Lorna Richardson for contributing to the edition of the final manuscript version.The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them

Apollo (Cambridge)

VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models.

Author: Alejandro Reyes Muñoz
Alexandre Almeida
Ekaterina Sakharova
Guillermo Rangel-Pineros
Manja Marz
Martin Beracochea
Martin Hölzer
Robert D Finn
Publication venue: Public Library of Science (PLoS)
Publication date: 01/08/2023
Field of study

The study of viral communities has revealed the enormous diversity and impact these biological entities have on various ecosystems. These observations have sparked widespread interest in developing computational strategies that support the comprehensive characterisation of viral communities based on sequencing data. Here we introduce VIRify, a new computational pipeline designed to provide a user-friendly and accurate functional and taxonomic characterisation of viral communities. VIRify identifies viral contigs and prophages from metagenomic assemblies and annotates them using a collection of viral profile hidden Markov models (HMMs). These include our manually-curated profile HMMs, which serve as specific taxonomic markers for a wide range of prokaryotic and eukaryotic viral taxa and are thus used to reliably classify viral contigs. We tested VIRify on assemblies from two microbial mock communities, a large metagenomics study, and a collection of publicly available viral genomic sequences from the human gut. The results showed that VIRify could identify sequences from both prokaryotic and eukaryotic viruses, and provided taxonomic classifications from the genus to the family rank with an average accuracy of 86.6%. In addition, VIRify allowed the detection and taxonomic classification of a range of prokaryotic and eukaryotic viruses present in 243 marine metagenomic assemblies. Finally, the use of VIRify led to a large expansion in the number of taxonomically classified human gut viral sequences and the improvement of outdated and shallow taxonomic classifications. Overall, we demonstrate that VIRify is a novel and powerful resource that offers an enhanced capability to detect a broad range of viral contigs and taxonomically classify them

Directory of Open Access Journals

Recommended from our members

A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome

Author: Almeida Alexandre
Beracochea Martin
Boland Miguel
Finn Robert
Hugenholtz Philip
Kyrpides Nikos
Nayfach Stephen
Parks Donovan
Pollard Katherine
Segata Nicola
Shi Zhou Jason
Strozzi Francesco
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Comprehensive reference data is essential for accurate taxonomic and functional characterization of the human gut microbiome. Here we present the Unified Human Gastrointestinal Genome (UHGG) collection, a resource combining 286,997 genomes representing 4,644 prokaryotic species from the human gut. These genomes contain over 625 million protein sequences used to generate the Unified Human Gastrointestinal Protein (UHGP) catalogue, a collection that more than doubles the number of gut protein clusters over the Integrated Gene Catalogue. We find that a large portion of the human gut microbiome remains to be fully explored, with over 70% of the UHGG species lacking cultured representatives, and 40% of the UHGP missing meaningful functional annotations. Intra-species genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which were specific to individual human populations. These freely available genomic resources should greatly facilitate investigations into the human gut microbiome

eScholarship - University of California

Recommended from our members

A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome

Author: Almeida Alexandre
Beracochea Martin
Boland Miguel
Finn Robert
Hugenholtz Philip
Kyrpides Nikos
Nayfach Stephen
Parks Donovan
Pollard Katherine
Segata Nicola
Shi Zhou Jason
Strozzi Francesco
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

eScholarship - University of California

Interactions between modafinil and cocaine during the induction of conditioned place preference and locomotor sensitization in mice: Implications for addiction

Author: Ackerman
Anagnostaras
Anagnostaras
Ballon
Bastuji
Beracochea
Beracochea
Beracochea
Bernardi
Dackis
De Vries
Denise J. Cai
Deroche-Gamonet
Ferraro
Ferraro
Garreau
Gold
Grabowski
Grabowski
Hart
Jennifer R. Sage
Karila
Korotkova
Madras
Malcolm
Martin-Iverson
Minzenberg
Murillo-Rodriguez
Myrick
Nguyen
O’Connor
Paterson
Pierce
Qu
Robinson
Robinson
Robinson
Rush
Schechter
Shearer
Shuman
Simon
Simon
Stephan G. Anagnostaras
Stoops
Tristan Shuman
Turner
van Vliet
Volkow
Vosburg
Warot
Wood
Wuo-Silva
Zolkowska
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

A unified catalog of 204,938 reference genomes from the human gut microbiome

Author: Almeida Alexandre
Beracochea Martin
Boland Miguel
Finn Robert D.
Hugenholtz Philip
Kyrpides Nikos C.
Nayfach Stephen
Parks Donovan H.
Pollard Katherine S.
Sakharova Ekaterina
Segata Nicola
Shi Zhou Jason
Strozzi Francesco
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 20/07/2020
Field of study

Comprehensive, high-quality reference genomes are required for functional characterization and taxonomic assignment of the human gut microbiota. We present the Unified Human Gastrointestinal Genome (UHGG) collection, comprising 204,938 nonredundant genomes from 4,644 gut prokaryotes. These genomes encode >170 million protein sequences, which we collated in the Unified Human Gastrointestinal Protein (UHGP) catalog. The UHGP more than doubles the number of gut proteins in comparison to those present in the Integrated Gene Catalog. More than 70% of the UHGG species lack cultured representatives, and 40% of the UHGP lack functional annotations. Intraspecies genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which are specific to individual human populations. The UHGG and UHGP collections will enable studies linking genotypes to phenotypes in the human gut microbiome

University of Queensland eSpace

Treatments with eCG and courtship behaviour in rams during the breeding and the non-breeding seasons

Author: Aguirre
Agustín Orihuela
Balthazart
Beracochea
Bodin
Courot
Croker
De Lucas-Tron
Fulkerson
Hervé
Hochereau-De Reviers
Knight
Martin
Maurel
Murphy
Neftalí Clemente
Perkins
Price
Rekkas
Rodolfo Ungerfeld
Roy
Signoret
Ungerfeld
Ungerfeld
Ungerfeld
Ungerfeld
Publication venue: 'CSIRO Publishing'
Publication date: 01/01/2019
Field of study

Crossref

MGnify: the microbiome sequence data analysis resource in 2023

Author: Allen Ben
Baldi Germana
Beracochea Martin
Bileschi Maxwell L
Burdett Tony
Burgin Josephine
Caballero-Pérez Juan
Cochrane Guy
Colwell Lucy J
Curtis Tom
Escobar-Zepeda Alejandra
Finn Robert D
Gurbich Tatiana A
Kale Varsha
Korobeynikov Anton
Raj Shriya
Richardson Lorna
Rogers Alexander B
Sakharova Ekaterina
Sanchez Santiago
Wilkinson Darren J
Publication venue: Oxford University Press
Publication date: 07/12/2022
Field of study

The MGnify platform (https://www.ebi.ac.uk/metagenomics) facilitates the assembly, analysis and archiving of microbiome-derived nucleic acid sequences. The platform provides access to taxonomic assignments and functional annotations for nearly half a million analyses covering metabarcoding, metatranscriptomic, and metagenomic datasets, which are derived from a wide range of different environments. Over the past 3 years, MGnify has not only grown in terms of the number of datasets contained but also increased the breadth of analyses provided, such as the analysis of long-read sequences. The MGnify protein database now exceeds 2.4 billion non-redundant sequences predicted from metagenomic assemblies. This collection is now organised into a relational database making it possible to understand the genomic context of the protein through navigation back to the source assembly and sample metadata, marking a major improvement. To extend beyond the functional annotations already provided in MGnify, we have applied deep learning-based annotation methods. The technology underlying MGnify's Application Programming Interface (API) and website has been upgraded, and we have enabled the ability to perform downstream analysis of the MGnify data through the introduction of a coupled Jupyter Lab environment

Durham Research Online

The psychostimulant modafinil facilitates water maze performance and augments synaptic potentiation in dentate gyrus

Crossref