Search CORE

22 research outputs found

GenomePeek—an online tool for prokaryotic genome and metagenome analysis

Author: Katelyn McNair
Robert A. Edwards
Publication venue: 'PeerJ'
Publication date: 01/01/2015
Field of study

Crossref

Recommended from our members

Dietary prophage inducers and antimicrobials: toward landscaping the human gut microbiome.

Author: Boling Lance
Cuevas Daniel A
Grasis Juris A
Kang Han Suh
Knowles Ben
Levi Kyle
Maughan Heather
McNair Katelyn
Rohwer Forest
Rojas Maria Isabel
Sanchez Savannah E
Smurthwaite Cameron
Publication venue: eScholarship, University of California
Publication date: 01/07/2020
Field of study

The approximately 1011 viruses and microbial cells per gram of fecal matter (dry weight) in the large intestine are important to human health. The responses of three common gut bacteria species, and one opportunistic pathogen, to 117 commonly consumed foods, chemical additives, and plant extracts were tested. Many compounds, including Stevia rebaudiana and bee propolis extracts, exhibited species-specific growth inhibition by prophage induction. Overall, these results show that various foods may change the abundances of gut bacteria by modulating temperate phage and suggests a novel path for landscaping the human gut microbiome

eScholarship - University of California

PHACTS, a computational approach to classifying the lifestyle of phages

Author: Barbara A. Bailey
Clark
Clarke
Deschavanne
Gini
Hendrix
Housby
Katelyn McNair
Labrie
Liaw
Lima-Mendez
Lima-Mendez
Pe
Pearson
Proux
Robert A. Edwards
Rohwer
Rohwer
Srinivasiah
Whitman
Witkin
Wommack
Publication venue: Oxford University Press
Publication date
Field of study

Motivation: Bacteriophages have two distinct lifestyles: virulent and temperate. The virulent lifestyle has many implications for phage therapy, genomics and microbiology. Determining which lifestyle a newly sequenced phage falls into is currently determined using standard culturing techniques. Such laboratory work is not only costly and time consuming, but also cannot be used on phage genomes constructed from environmental sequencing. Therefore, a computational method that utilizes the sequence data of phage genomes is needed

Crossref

PubMed Central

Global phylogeography and ancient evolution of the widespread human gut virus crAssphage

Author: Aarestrup Frank M.
Ahmadov Gunduz
Alassaf Abeer
Anton Josefa
Asangba Abigail
Aziz Ramy K
Barr Jeremy J.
Bibby Kyle
Billings Emma K.
Brouns Stan J. J.
Cantu Vito Adrian
Carlton Jane M.
Cazares Adrian
Cazares Daniel
Cho Gyu-Sung
Cinek Ondrej
Condeff Tess
Cortés Pilar
Cranfield Mark
Cuevas Daniel A.
de Jonge Patrick A.
De la Iglesia Rodrigo
Decewicz Przemyslaw
Desnues Christelle
Dinsdale Elizabeth A.
Doane Michael P.
Dominy Nathaniel J.
Dutilh Bas E.
Dziewit Lukasz
Díaz Muñoz Samuel L.
Edwards Robert A.
Elwasila Bashir Mukhtar
Eren A. Murat
Fineran Peter C.
Franz Charles
Fu Jingyuan
García Aljaro Cristina
Ghedin Elodie
Gulino Kristen M.
Haggerty John M.
Head Steven R.
Hendriksen Rene S.
Hill Colin
Hyöty Heikki
Ilina Elena N.
Irwin Mitchell T.
Jeffries Thomas C.
Jofre i Torroella Joan
Junge Randall E.
Kelley Scott T.
Kowalewski Martin
Kumaresan Deepak
Kurilshikov Alexander
Lavigne Rob
Leigh Steven R.
Levi Kyle
Lipson David
Lisitsyna Eugenia S.
Llagostera Montserrat
Maritz Julia M.
Marr Linsey C.
Mazankova Karla
McCann Angela
McCarthy David T.
McNair Katelyn
Mirzaei Mohammadali Khan
Molshanski-Mor Shahar
Monteiro Silvia
Moreira-Grez Benjamin
Morris Megan
Mugisha Lawrence
Muniesa Pérez Ma. Teresa
Neve Horst
Nguyen Nam-Phuong
Nigro Olivia D.
Nilsson Anders S.
Nobrega Franklin L.
Norman Holly M.
O'Connell Taylor
O'Rice Gillian A.
Odeh Rasha
Ohaeri Maria
Oliver Andrew
Piuri. Mariana
Prussin li Aaron J.
Quan Zhe-Xue
Quimron Udi
Rainetova Petra
Ramírez-Rojas Adán
Raya Raul
Reasor Kim
Reyes Muñoz Alejandro
Rossi Alessandro
Santos Ricardo
Shimashita John
Stachler Elyse N.
Stene Lars C.
Strain Ronan
Stumpf Rebecca
Tapia German
Torres Pedro J.
Trefault Nicole
Twaddle Alan
Tyakht Alexander V.
Ugochi Ibekwe MaryAnn
Vega Alejandro A.
Villagra Nicolás
Vinuesa Pablo
Wagemans Jeroen
Wandro Stephen
White Bryan
Whiteley Andy
Whiteson Katrine L.
Wijmenga Cisca
Zambrano Maria M.
Zhernakova Alexandra
Zschach Henrike
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/04/2023
Field of study

Microbiomes are vast communities of microorganisms and viruses that populate all natural ecosystems. Viruses have been considered to be the most variable component of microbiomes, as supported by virome surveys and examples of high genomic mosaicism. However, recent evidence suggests that the human gut virome is remarkably stable compared with that of other environments. Here, we investigate the origin, evolution and epidemiology of crAssphage, a widespread human gut virus. Through a global collaboration, we obtained DNA sequences of crAssphage from more than one-third of the world's countries and showed that the phylogeography of crAssphage is locally clustered within countries, cities and individuals. We also found fully colinear crAssphage-like genomes in both Old-World and New-World primates, suggesting that the association of crAssphage with primates may be millions of years old. Finally, by exploiting a large cohort of more than 1,000 individuals, we tested whether crAssphage is associated with bacterial taxonomic groups of the gut microbiome, diverse human health parameters and a wide range of dietary factors. We identified strong correlations with different clades of bacteria that are related to Bacteroidetes and weak associations with several diet categories, but no significant association with health or disease. We conclude that crAssphage is a benign cosmopolitan virus that may have coevolved with the human lineage and is an integral part of the normal human gut virome

Diposit Digital de la Universitat de Barcelona

Global phylogeography and ancient evolution of the widespread human gut virus crAssphage

Author: Aarestrup Frank M
Ahmadov Gunduz
Alassaf Abeer
Anton Josefa
Asangba Abigail
Aziz Ramy K
Barr Jeremy J
Bibby Kyle
Billings Emma K
Brouns Stan J J
Cantu Vito Adrian
Carlton Jane M
Cazares Adrian
Cazares Daniel
Cho Gyu-Sung
Cinek Ondrej
Condeff Tess
Cortés Pilar
Cranfield Mike
Cuevas Daniel A
de Jonge Patrick A
De la Iglesia Rodrigo
Decewicz Przemyslaw
Desnues Christelle
Dinsdale Elizabeth A
Doane Michael P
Dominy Nathaniel J
Dutilh Bas E
Dziewit Lukasz
Díaz Muñoz Samuel L
Edwards Robert A
Elwasila Bashir Mukhtar
Eren A Murat
Fineran Peter C
Franz Charles
Fu Jingyuan
Garcia-Aljaro Cristina
Ghedin Elodie
Gulino Kristen M
Haggerty John M
Head Steven R
Hendriksen Rene S
Hill Colin
Hyöty Heikki
Ilina Elena N
Irwin Mitchell T
Jeffries Thomas C
Jofre Juan
Junge Randall E
Kelley Scott T
Khan Mirzaei Mohammadali
Kowalewski Martin
Kumaresan Deepak
Kurilshikov Alexander
Lavigne Rob
Leigh Steven R
Levi Kyle
Lipson David
Lisitsyna Eugenia S
Llagostera Montserrat
Maritz Julia M
Marr Linsey C
Mazankova Karla
McCann Angela
McCarthy David T
McNair Katelyn
Molshanski-Mor Shahar
Monteiro Silvia
Moreira-Grez Benjamin
Morris Megan
Mugisha Lawrence
Muniesa Maite
Neve Horst
Nguyen Nam-Phuong
Nigro Olivia D
Nilsson Anders S
Nobrega Franklin L
Norman Holly M
O'Connell Taylor
Odeh Rasha
Ohaeri Maria
Oliver Andrew
Piuri Mariana
Prussin Ii Aaron J
Qimron Udi
Quan Zhe-Xue
Rainetova Petra
Ramírez-Rojas Adán
Raya Raul
Reasor Kim
Reyes Muñoz Alejandro
Rice Gillian A O
Rossi Alessandro
Santos Ricardo
Shimashita John
Stachler Elyse N
Stene Lars C
Strain Ronan
Stumpf Rebecca
Sub Bioinformatics
Tapia German
Theoretical Biology and Bioinformatics
Torres Pedro J
Trefault Nicole
Twaddle Alan
Tyakht Alexander V
Ugochi Ibekwe MaryAnn
Vega Alejandro A
Villagra Nicolás
Vinuesa Pablo
Wagemans Jeroen
Wandro Stephen
White Bryan
Whiteley Andy
Whiteson Katrine L
Wijmenga Cisca
Zambrano Maria M
Zhernakova Alexandra
Zschach Henrike
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Microbiomes are vast communities of microorganisms and viruses that populate all natural ecosystems. Viruses have been considered to be the most variable component of microbiomes, as supported by virome surveys and examples of high genomic mosaicism. However, recent evidence suggests that the human gut virome is remarkably stable compared with that of other environments. Here, we investigate the origin, evolution and epidemiology of crAssphage, a widespread human gut virus. Through a global collaboration, we obtained DNA sequences of crAssphage from more than one-third of the world’s countries and showed that the phylogeography of crAssphage is locally clustered within countries, cities and individuals. We also found fully colinear crAssphage-like genomes in both Old-World and New-World primates, suggesting that the association of crAssphage with primates may be millions of years old. Finally, by exploiting a large cohort of more than 1,000 individuals, we tested whether crAssphage is associated with bacterial taxonomic groups of the gut microbiome, diverse human health parameters and a wide range of dietary factors. We identified strong correlations with different clades of bacteria that are related to Bacteroidetes and weak associations with several diet categories, but no significant association with health or disease. We conclude that crAssphage is a benign cosmopolitan virus that may have coevolved with the human lineage and is an integral part of the normal human gut virome

Repositorio Institucional de la Universidad de Alicante

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

University of Groningen

HAL AMU

Online Research Database In Technology

Archivio istituzionale della ricerca - Università di Padova

Proceedings - University of Groningen

ARTS repository - University of Groningen

HAL-INSU

Copenhagen University Research Information System

Queensland University of Technology ePrints Archive

HAL-IRD

Utrecht University Repository

Monash University Research Portal

Dissertations of the University of Groningen

GenomePeek—an online tool for prokaryotic genome and metagenome analysis

Author: Katelyn McNair
Robert A. Edwards
Publication venue: 'PeerJ'
Publication date: 01/06/2015
Field of study

As more and more prokaryotic sequencing takes place, a method to quickly and accurately analyze this data is needed. Previous tools are mainly designed for metagenomic analysis and have limitations; such as long runtimes and significant false positive error rates. The online tool GenomePeek (edwards.sdsu.edu/GenomePeek) was developed to analyze both single genome and metagenome sequencing files, quickly and with low error rates. GenomePeek uses a sequence assembly approach where reads to a set of conserved genes are extracted, assembled and then aligned against the highly specific reference database. GenomePeek was found to be faster than traditional approaches while still keeping error rates low, as well as offering unique data visualization options

Directory of Open Access Journals

PubMed Central

PRFect: a tool to predict programmed ribosomal frameshifts in prokaryotic and viral genomes

Author: Anca M. Segall
Katelyn McNair
Peter Salamon
Robert A. Edwards
Publication venue: BMC
Publication date: 01/02/2024
Field of study

Abstract Background One of the stranger phenomena that can occur during gene translation is where, as a ribosome reads along the mRNA, various cellular and molecular properties contribute to stalling the ribosome on a slippery sequence and shifting the ribosome into one of the other two alternate reading frames. The alternate frame has different codons, so different amino acids are added to the peptide chain. More importantly, the original stop codon is no longer in-frame, so the ribosome can bypass the stop codon and continue to translate the codons past it. This produces a longer version of the protein, a fusion of the original in-frame amino acids, followed by all the alternate frame amino acids. There is currently no automated software to predict the occurrence of these programmed ribosomal frameshifts (PRF), and they are currently only identified by manual curation. Results Here we present PRFect, an innovative machine-learning method for the detection and prediction of PRFs in coding genes of various types. PRFect combines advanced machine learning techniques with the integration of multiple complex cellular properties, such as secondary structure, codon usage, ribosomal binding site interference, direction, and slippery site motif. Calculating and incorporating these diverse properties posed significant challenges, but through extensive research and development, we have achieved a user-friendly approach. The code for PRFect is freely available, open-source, and can be easily installed via a single command in the terminal. Our comprehensive evaluations on diverse organisms, including bacteria, archaea, and phages, demonstrate PRFect’s strong performance, achieving high sensitivity, specificity, and an accuracy exceeding 90%. The code for PRFect is freely available and installs with a single terminal command. Conclusion PRFect represents a significant advancement in the field of PRF detection and prediction, offering a powerful tool for researchers and scientists to unravel the intricacies of programmed ribosomal frameshifting in coding genes

Directory of Open Access Journals

Computational approaches to predict bacteriophage-host relationships

Author: Dutilh Bas E
Edwards Robert A
Faust Karoline
McNair Katelyn
Raes Jeroen
Publication venue
Publication date: 01/01/2016
Field of study

Metagenomics has changed the face of virus discovery by enabling the accurate identification of viral genome sequences without requiring isolation of the viruses. As a result, metagenomic virus discovery leaves the first and most fundamental question about any novel virus unanswered: What host does the virus infect? The diversity of the global virosphere and the volumes of data obtained in metagenomic sequencing projects demand computational tools for virus-host prediction. We focus on bacteriophages (phages, viruses that infect bacteria), the most abundant and diverse group of viruses found in environmental metagenomes. By analyzing 820 phages with annotated hosts, we review and assess the predictive power of in silico phage-host signals. Sequence homology approaches are the most effective at identifying known phage-host pairs. Compositional and abundance-based methods contain significant signal for phage-host classification, providing opportunities for analyzing the unknowns in viral metagenomes. Together, these computational approaches further our knowledge of the interactions between phages and their hosts. Importantly, we find that all reviewed signals significantly link phages to their hosts, illustrating how current knowledge and insights about the interaction mechanisms and ecology of coevolving phages and bacteria can be exploited to predict phage-host relationships, with potential relevance for medical and industrial applications

Lirias

Radboud Repository

Utrecht University Repository

Utilizing Amino Acid Composition and Entropy of Potential Open Reading Frames to Identify Protein-Coding Genes

Author: Brian Souza
Carol L. Ecale Zhou
Katelyn McNair
Robert A. Edwards
Stephanie Malfatti
Publication venue: 'MDPI AG'
Publication date: 08/01/2021
Field of study

One of the main steps in gene-finding in prokaryotes is determining which open reading frames encode for a protein, and which occur by chance alone. There are many different methods to differentiate the two; the most prevalent approach is using shared homology with a database of known genes. This method presents many pitfalls, most notably the catch that you only find genes that you have seen before. The four most popular prokaryotic gene-prediction programs (GeneMark, Glimmer, Prodigal, Phanotate) all use a protein-coding training model to predict protein-coding genes, with the latter three allowing for the training model to be created ab initio from the input genome. Different methods are available for creating the training model, and to increase the accuracy of such tools, we present here GOODORFS, a method for identifying protein-coding genes within a set of all possible open reading frames (ORFS). Our workflow begins with taking the amino acid frequencies of each ORF, calculating an entropy density profile (EDP), using KMeans to cluster the EDPs, and then selecting the cluster with the lowest variation as the coding ORFs. To test the efficacy of our method, we ran GOODORFS on 14,179 annotated phage genomes, and compared our results to the initial training-set creation step of four other similar methods (Glimmer, MED2, PHANOTATE, Prodigal). We found that GOODORFS was the most accurate (0.94) and had the best F1-score (0.85), while Glimmer had the highest precision (0.92) and PHANOTATE had the highest recall (0.96)

Multidisciplinary Digital Publishing Institute