Search CORE

671 research outputs found

Wochenende — modular and flexible alignment-based shotgun metagenome analysis

Author: Chhatwal Patrick
Davenport Colin F.
Friedrich Fabian C.
Hollstein Lisa
Pust Marie-Madlen
Pörtner Sophia
Rosenboom Ilona
Rosenhahn Bodo
Scheithauer Tobias
Sifakis Konstantinos
Tümmler Burkhard
Wehrbein Tom
Wiehlmann Lutz
Publication venue: London : BioMed Central
Publication date: 01/01/2022
Field of study

Background: Shotgun metagenome analysis provides a robust and verifiable method for comprehensive microbiome analysis of fungal, viral, archaeal and bacterial taxonomy, particularly with regard to visualization of read mapping location, normalization options, growth dynamics and functional gene repertoires. Current read classification tools use non-standard output formats, or do not fully show information on mapping location. As reference datasets are not perfect, portrayal of mapping information is critical for judging results effectively. Results: Our alignment-based pipeline, Wochenende, incorporates flexible quality control, trimming, mapping, various filters and normalization. Results are completely transparent and filters can be adjusted by the user. We observe stringent filtering of mismatches and use of mapping quality sharply reduces the number of false positives. Further modules allow genomic visualization and the calculation of growth rates, as well as integration and subsequent plotting of pipeline results as heatmaps or heat trees. Our novel normalization approach additionally allows calculation of absolute abundance profiles by comparison with reads assigned to the human host genome. Conclusion: Wochenende has the ability to find and filter alignments to all kingdoms of life using both short and long reads, and requires only good quality reference genomes. Wochenende automatically combines multiple available modules ranging from quality control and normalization to taxonomic visualization. Wochenende is available at https://github.com/MHH-RCUG/nf_wochenende

PubMed Central

Institutionelles Repositorium der Leibniz Universität Hannover

Challenges and opportunities in understanding microbial communities with metagenome assembly (accompanied by IPython Notebook tutorial)

Author: Chain Patrick
Howe Adina
Howe Adina
Publication venue: Iowa State University Digital Repository
Publication date: 01/01/2015
Field of study

Metagenomic investigations hold great promise for informing the genetics, physiology, and ecology of environmental microorganisms. Current challenges for metagenomic analysis are related to our ability to connect the dots between sequencing reads, their population of origin, and their encoding functions. Assembly-based methods reduce dataset size by extending overlapping reads into larger contiguous sequences (contigs), providing contextual information for genetic sequences that does not rely on existing references. These methods, however, tend to be computationally intensive and are again challenged by sequencing errors as well as by genomic repeats While numerous tools have been developed based on these methodological concepts, they present confounding choices and training requirements to metagenomic investigators. To help with accessibility to assembly tools, this review also includes an IPython Notebook metagenomic assembly tutorial. This tutorial has instructions for execution any operating system using Amazon Elastic Cloud Compute and guides users through downloading, assembly, and mapping reads to contigs of a mock microbiome metagenome. Despite its challenges, metagenomic analysis has already revealed novel insights into many environments on Earth. As software, training, and data continue to emerge, metagenomic data access and its discoveries will to grow

Digital Repository @ Iowa State University (ISU)

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

MicroScope: a platform for microbial genome annotation and comparative genomics

Author: A. Lajus
Almeida
Bairoch
Bairoch
Barbe
Bendtsen
Bocs
Bryson
C. Médigue
C. Scarpelli
Carver
Caspi
Claudel-Renard
Cruveiller
D'A
D. Mornico
D. Roche
D. Vallenet
G. Salvignol
Gardner
Gardy
Gil
Glasner
Hacker
Hubbard
Hunter
Kanehisa
Karp
Klimke
L. Fleury
Lagesen
Lima
Lowe
Marcotte
Markowitz
Markowitz
Matsumoto
Meyer
Overbeek
Overbeek
Overbeek
Pellegrini
Pelletier
Pruitt
S. Cruveiller
S. Engelen
Saier
Salzberg
Sayers
Selengut
Serres
Sonnhammer
Tatusov
Vallenet
Walter
Waterhouse
Winsor
Z. Rouy
Publication venue: Oxford University Press
Publication date: 01/01/2009
Field of study

The initial outcome of genome sequencing is the creation of long text strings written in a four letter alphabet. The role of in silico sequence analysis is to assist biologists in the act of associating biological knowledge with these sequences, allowing investigators to make inferences and predictions that can be tested experimentally. A wide variety of software is available to the scientific community, and can be used to identify genomic objects, before predicting their biological functions. However, only a limited number of biologically interesting features can be revealed from an isolated sequence. Comparative genomics tools, on the other hand, by bringing together the information contained in numerous genomes simultaneously, allow annotators to make inferences based on the idea that evolution and natural selection are central to the definition of all biological processes. We have developed the MicroScope platform in order to offer a web-based framework for the systematic and efficient revision of microbial genome annotation and comparative analysis (http://www.genoscope.cns.fr/agc/microscope). Starting with the description of the flow chart of the annotation processes implemented in the MicroScope pipeline, and the development of traditional and novel microbial annotation and comparative analysis tools, this article emphasizes the essential role of expert annotation as a complement of automatic annotation. Several examples illustrate the use of implemented tools for the review and curation of annotations of both new and publicly available microbial genomes within MicroScope’s rich integrated genome framework. The platform is used as a viewer in order to browse updated annotation information of available microbial genomes (more than 440 organisms to date), and in the context of new annotation projects (117 bacterial genomes). The human expertise gathered in the MicroScope database (about 280,000 independent annotations) contributes to improve the quality of microbial genome annotation, especially for genomes initially analyzed by automatic procedures alone

Crossref

PubMed Central

HAL-CEA

The Role of Genomics in the Identification, Prediction, and Prevention of Biological Threats

Author: A. Y Peleg
B. J Tindall
B. L Haagmans
C Franco
D Boxrud
D Field
D Wang
David A. Rasko
E Guzman
G. M Garrity
Jacques Ravel
M Zhu
M. A Marra
M. C Schatz
P Aldhous
P Gerner-Smidt
P Keim
P. J Turnbaugh
R Urwin
R. D Fleischmann
S Bambini
S Maurer-Stroh
S Srinivasan
T Rowe
T. D Read
T. T Binnewies
W. Florian Fricke
X Pourrut
Z Gao
Publication venue: Public Library of Science
Publication date: 01/10/2009
Field of study

In all likelihood, it is only a matter of time before our public health system will face a major biological threat, whether intentionally dispersed or originating from a known or newly emerging infectious disease. It is necessary not only to increase our reactive “biodefense,” but also to be proactive and increase our preparedness. To achieve this goal, it is essential that the scientific and public health communities fully embrace the genomic revolution, and that novel bioinformatic and computing tools necessary to make great strides in our understanding of these novel and emerging threats be developed. Genomics has graduated from a specialized field of science to a research tool that soon will be routine in research laboratories and clinical settings. Because the technology is becoming more affordable, genomics can and should be used proactively to build our preparedness and responsiveness to biological threats. All pieces, including major continued funding, advances in next-generation sequencing technologies, bioinformatics infrastructures, and open access to data and metadata, are being set in place for genomics to play a central role in our public health system

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Translational web robots for pathogen genome analysis

Author: A Kahvejian
AC McHardy
C Hyland
D Parks
G Mariscal
J Shon
JW Huss
M Haeussler
OG Pybus
PS Dehal
SM Leach
T Davidsen
T Oinn
V Sintchenko
V Sintchenko
VM Markowitz
Y Kano
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

4 page(s

Crossref

Springer - Publisher Connector

PubMed Central

Macquarie University ResearchOnline

Using KBase to Assemble and Annotate Prokaryotic Genomes

Author: Allen Benjamin
Drake Meghan
Harris Nomi
Sullivan Tarah
Publication venue: eScholarship, University of California
Publication date: 01/08/2017
Field of study

The DOE Systems Biology Knowledgebase (KBase, http://kbase.us/) is an open-access bioinformatics software and data platform for analyzing plants, microbes, and their communities. KBase enables scientists to create, execute, collaborate on, and share reproducible analyses of their biological data in the context of public data and private collaborator data. For microbiologists researching prokaryotes, KBase offers analysis tools for performing quality control and assessment of Next-Generation Sequencing reads, de novo assembly, genome annotation, and tools for analyzing structural and functional features of genomes. This unit demonstrates an example workflow for taking a comparative and iterative approach to assembly and annotation of prokaryotic genomes using KBase that can be used by microbiologists seeking to perform isolate analysis in a rapid and reproducible fashion. © 2017 by John Wiley & Sons, Inc

Crossref

eScholarship - University of California

Understanding the Evolutionary Relationships and Major Traits of \u3cem\u3eBacillus\u3c/em\u3e through Comparative Genomics

Author: Alcaraz Luis D.
Eguiarte Luis E.
Herrera-Estrella Luis
Moreno-Hagelsieb Gabriel
Olmedo Gabriela
Souza Valeria
Publication venue: Scholars Commons @ Laurier
Publication date: 01/05/2010
Field of study

Background: The presence of Bacillus in very diverse environments reflects the versatile metabolic capabilities of a widely distributed genus. Traditional phylogenetic analysis based on limited gene sampling is not adequate for resolving the genus evolutionary relationships. By distinguishing between core and pan-genome, we determined the evolutionary and functional relationships of known Bacillus. Results: Our analysis is based upon twenty complete and draft Bacillus genomes, including a newly sequenced Bacillus isolate from an aquatic environment that we report for the first time here. Using a core genome, we were able to determine the phylogeny of known Bacilli, including aquatic strains whose position in the phylogenetic tree could not be unambiguously determined in the past. Using the pan-genome from the sequenced Bacillus, we identified functional differences, such as carbohydrate utilization and genes involved in signal transduction, which distinguished the taxonomic groups. We also assessed the genetic architecture of the defining traits of Bacillus, such as sporulation and competence, and showed that less than one third of the B. subtilis genes are conserved across other Bacilli. Most variation was shown to occur in genes that are needed to respond to environmental cues, suggesting that Bacilli have genetically specialized to allow for the occupation of diverse habitats and niches. Conclusions: The aquatic Bacilli are defined here for the first time as a group through the phylogenetic analysis of 814 genes that comprise the core genome. Our data distinguished between genomic components, especially core vs. pan-genome to provide insight into phylogeny and function that would otherwise be difficult to achieve. A phylogeny may mask the diversity of functions, which we tried to uncover in our approach. The diversity of sporulation and competence genes across the Bacilli was unexpected based on previous studies of the B. subtilis model alone. The challenge of uncovering the novelties and variations among genes of the non-subtilis groups still remains. This task will be best accomplished by directing efforts toward understanding phylogenetic groups with similar ecological niches

Wilfrid Laurier University

Microbial taxonomy in the post-genomic era: Rebuilding from scratch?

Author: Amaral Gilda R.
Campeão Mariana
Dutilh Bas E.
Edwards Robert A.
Polz Martin F
Polz Martin F.
Sawabe Tomoo
Swings Jean
Thompson Cristiane C.
Thompson Fabiano L.
Ussery David W.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 18/08/2016
Field of study

Microbial taxonomy should provide adequate descriptions of bacterial, archaeal, and eukaryotic microbial diversity in ecological, clinical, and industrial environments. Its cornerstone, the prokaryote species has been re-evaluated twice. It is time to revisit polyphasic taxonomy, its principles, and its practice, including its underlying pragmatic species concept. Ultimately, we will be able to realize an old dream of our predecessor taxonomists and build a genomic-based microbial taxonomy, using standardized and automated curation of high-quality complete genome sequences as the new gold standard.National Science Foundation (U.S.) (NSF Grant DEB-1046413)National Science Foundation (U.S.) (NSF Grant CNS-1305112)National Science Foundation (U.S.) (NSF Grant DEB 0918333)National Science Foundation (U.S.) (NSF grant OCE 1441943)Gordon and Betty Moore FoundationUnited States. Dept. of Energy. Office of ScienceUnited States. Dept. of Energy. Office of Biological and Environmental ResearchOak Ridge National LaboratoryCarlos Chagas Filho Foundation for Research Support of the State of Rio de JaneiroBrazil. Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (grant)Conselho Nacional de Pesquisas (Brazil

DSpace@MIT

NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genes

Author: Andersen G. L.
Brodie E. L.
DeSantis T. Z.
Hugenholtz P.
Keller K.
Larsen N.
Phan R.
Piceno Y. M.
Publication venue: Oxford University Press
Publication date: 01/07/2006
Field of study

Microbiologists conducting surveys of bacterial and archaeal diversity often require comparative alignments of thousands of 16S rRNA genes collected from a sample. The computational resources and bioinformatics expertise required to construct such an alignment has inhibited high-throughput analysis. It was hypothesized that an online tool could be developed to efficiently align thousands of 16S rRNA genes via the NAST (Nearest Alignment Space Termination) algorithm for creating multiple sequence alignments (MSA). The tool was implemented with a web-interface at . Each user-submitted sequence is compared with Greengenes' ‘Core Set’, comprising ∼10 000 aligned non-chimeric sequences representative of the currently recognized diversity among bacteria and archaea. User sequences are oriented and paired with their closest match in the Core Set to serve as a template for inserting gap characters. Non-16S data (sequence from vector or surrounding genomic regions) are conveniently removed in the returned alignment. From the resulting MSA, distance matrices can be calculated for diversity estimates and organisms can be classified by taxonomy. The ability to align and categorize large sequence sets using a simple interface has enabled researchers with various experience levels to obtain bacterial and archaeal community profiles

Crossref

PubMed Central

University of Queensland eSpace

gcType : a high-quality type strain genome database for microbial phylogenetic and functional research

Author: Alexander S
Arahal DR
Cai M
Chen Z
Dénes D
Eurwilaichitr L
Evtushenko L
Fan G
Gomez-Gil B
Hazbón MH
Hideaki S
Ingsriswang S
Itoh T
Kawasaki H
Kim SG
Lee JS
Liu D
Lucena T
Ma J
Meng Z
Moriya O
Peng F
Riojas MA
Sedlacek I
Shi W
Sun Q
Sun X
Suwannachart C
Tanasupawat S
Vandamme Peter
Weir BS
Wu L
Yao S
Zhang X
Zhou Y
Zhou Y
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Taxonomic and functional research of microorganisms has increasingly relied upon genome-based data and methods. As the depository of the Global Catalogue of Microorganisms (GCM) 10K prokaryotic type strain sequencing project, Global Catalogue of Type Strain (gcType) has published 1049 type strain genomes sequenced by the GCM 10K project which are preserved in global culture collections with a valid published status. Additionally, the information provided through gcType includes >12 000 publicly available type strain genome sequences from GenBank incorporated using quality control criteria and standard data annotation pipelines to form a high-quality reference database. This database integrates type strain sequences with their phenotypic information to facilitate phenotypic and genotypic analyses. Multiple formats of cross-genome searches and interactive interfaces have allowed extensive exploration of the database's resources. In this study, we describe web-based data analysis pipelines for genomic analyses and genome-based taxonomy, which could serve as a one-stop platform for the identification of prokaryotic species. The number of type strain genomes that are published will continue to increase as the GCM 10K project increases its collaboration with culture collections worldwide. Data of this project is shared with the International Nucleotide Sequence Database Collaboration. Access to gcType is free at http://gctype.wdcm.org/

Ghent University Academic Bibliography

Archivsystem Ask23