Search CORE

5 research outputs found

Predicting antimicrobial resistance using conserved genes.

Author: James J Davis
Marcus Nguyen
Margo VanOeffelen
Maulik Shukla
Robert Olson
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/10/2020
Field of study

A growing number of studies are using machine learning models to accurately predict antimicrobial resistance (AMR) phenotypes from bacterial sequence data. Although these studies are showing promise, the models are typically trained using features derived from comprehensive sets of AMR genes or whole genome sequences and may not be suitable for use when genomes are incomplete. In this study, we explore the possibility of predicting AMR phenotypes using incomplete genome sequence data. Models were built from small sets of randomly-selected core genes after removing the AMR genes. For Klebsiella pneumoniae, Mycobacterium tuberculosis, Salmonella enterica, and Staphylococcus aureus, we report that it is possible to classify susceptible and resistant phenotypes with average F1 scores ranging from 0.80-0.89 with as few as 100 conserved non-AMR genes, with very major error rates ranging from 0.11-0.23 and major error rates ranging from 0.10-0.20. Models built from core genes have predictive power in cases where the primary AMR mechanisms result from SNPs or horizontal gene transfer. By randomly sampling non-overlapping sets of core genes, we show that F1 scores and error rates are stable and have little variance between replicates. Although these small core gene models have lower accuracies and higher error rates than models built from the corresponding assembled genomes, the results suggest that sufficient variation exists in the core non-AMR genes of a species for predicting AMR phenotypes

Directory of Open Access Journals

Recommended from our members

Predicting antimicrobial resistance using conserved genes

Author: Davis James J.
Nguyen Marcus
Olson Robert
Shukla Maulik
VanOeffelen Margo
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 30/05/2023
Field of study

A growing number of studies are using machine learning models to accurately predict antimicrobial resistance (AMR) phenotypes from bacterial sequence data. Although these studies are showing promise, the models are typically trained using features derived from comprehensive sets of AMR genes or whole genome sequences and may not be suitable for use when genomes are incomplete. In this study, we explore the possibility of predicting AMR phenotypes using incomplete genome sequence data. Models were built from small sets of randomly-selected core genes after removing the AMR genes. For Klebsiella pneumoniae, Mycobacterium tuberculosis, Salmonella enterica, and Staphylococcus aureus, we report that it is possible to classify susceptible and resistant phenotypes with average F1 scores ranging from 0.80–0.89 with as few as 100 conserved non-AMR genes, with very major error rates ranging from 0.11–0.23 and major error rates ranging from 0.10–0.20. Models built from core genes have predictive power in cases where the primary AMR mechanisms result from SNPs or horizontal gene transfer. By randomly sampling non-overlapping sets of core genes, we show that F1 scores and error rates are stable and have little variance between replicates. Although these small core gene models have lower accuracies and higher error rates than models built from the corresponding assembled genomes, the results suggest that sufficient variation exists in the core non-AMR genes of a species for predicting AMR phenotypes.</p

Knowledge UChicago

A genomic data resource for predicting antimicrobial resistance from laboratory-derived antimicrobial susceptibility phenotypes

Author: Aytan-Aktug Derya
Brettin Thomas
Davis James J.
Dietrich Emily M
Kenyon Ronald W.
Machi Dustin
Mao Chunhong
Nguyen Marcus
Olson Robert
Pusch Gordon D
Shukla Maulik
Stevens Rick
VanOeffelen Margo
Vonstein Veronika
Warren Andrew S.
Wattam Alice R.
Yoo Hyunseung
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Antimicrobial resistance (AMR) is a major global health threat that affects millions of people each year. Funding agencies worldwide and the global research community have expended considerable capital and effort tracking the evolution and spread of AMR by isolating and sequencing bacterial strains and performing antimicrobial susceptibility testing (AST). For the last several years, we have been capturing these efforts by curating data from the literature and data resources and building a set of assembled bacterial genome sequences that are paired with laboratory-derived AST data. This collection currently contains AST data for over 67 000 genomes encompassing approximately 40 genera and over 100 species. In this paper, we describe the characteristics of this collection, highlighting areas where sampling is comparatively deep or shallow, and showing areas where attention is needed from the research community to improve sampling and tracking efforts. In addition to using the data to track the evolution and spread of AMR, it also serves as a useful starting point for building machine learning models for predicting AMR phenotypes. We demonstrate this by describing two machine learning models that are built from the entire dataset to show where the predictive power is comparatively high or low. This AMR metadata collection is freely available and maintained on the Bacterial and Viral Bioinformatics Center (BV-BRC) FTP site ftp://ftp.bvbrc.org/RELEASE_NOTES/PATRIC_genomes_AMR.txt

PubMed Central

Online Research Database In Technology

Recommended from our members

Introducing the Bacterial and Viral Bioinformatics Resource Center (BV-BRC): a resource combining PATRIC, IRD and ViPR.

Author: Assaf Rida
Brettin Thomas
Conrad Neal
Cucinell Clark
Davis James J
Dempsey Donald M
Dickerman Allan
Dietrich Emily M
Kenyon Ronald W
Kuscuoglu Mehmet
Lefkowitz Elliot J
Lu Jian
Machi Dustin
Macken Catherine
Mao Chunhong
Nguyen Marcus
Niewiadomska Anna
Olsen Gary J
Olson Robert D
Overbeek Jamie C
Parrello Bruce
Parrello Victoria
Porter Jacob S
Pusch Gordon D
Scheuermann Richard H
Shukla Maulik
Singh Indresh
Stevens Rick L
Stewart Lucy
Tan Gene
Thomas Chris
VanOeffelen Margo
Vonstein Veronika
Wallace Zachary S
Warren Andrew S
Wattam Alice R
Xia Fangfang
Yoo Hyunseung
Zhang Yun
Zmasek Christian M
Publication venue: eScholarship, University of California
Publication date: 06/01/2023
Field of study

The National Institute of Allergy and Infectious Diseases (NIAID) established the Bioinformatics Resource Center (BRC) program to assist researchers with analyzing the growing body of genome sequence and other omics-related data. In this report, we describe the merger of the PAThosystems Resource Integration Center (PATRIC), the Influenza Research Database (IRD) and the Virus Pathogen Database and Analysis Resource (ViPR) BRCs to form the Bacterial and Viral Bioinformatics Resource Center (BV-BRC) https://www.bv-brc.org/. The combined BV-BRC leverages the functionality of the bacterial and viral resources to provide a unified data model, enhanced web-based visualization and analysis tools, bioinformatics services, and a powerful suite of command line tools that benefit the bacterial and viral research communities

eScholarship - University of California

The PATRIC Bioinformatics Resource Center: expanding data and analysis capabilities

Author: Alice R Wattam
Allan Dickerman
Andrew Guard
Andrew S Warren
Antonopoulos
Aziz
Bankevich
Boratyn
Brettin
Bruce Parrello
Chris Thomas
Chunhong Mao
Clausen
Cock
Dan Murphy-Olson
Darling
Darling
Davis
Dawen Xie
Dustin Machi
Eddy
Edgar
Emily M Dietrich
Enright
Eric K Nordberg
Fangfang Xia
Federhen
Feldgarden
Felsenstein
Gary J Olsen
Gordon D Pusch
Haft
Hyunseung Yoo
James J Davis
Jamie C Overbeek
Jia
Joseph L Gabbard
Kim
Koren
Krueger
Langmead
Langmead
Lassmann
Li
Li
Liu
Marcus Nguyen
Margo VanOeffelen
Martin
Maulik Shukla
McArthur
McNair
McNeil
Neal Conrad
Nguyen
Nguyen
Ondov
Ondov
Overbeek
Overbeek
Overbeek
Parks
Parrello
Philippe Chlenski
Price
Ralph Butler
Ramy K Aziz
Rick Stevens
Robert D Olson
Ronald W Kenyon
Rory M Butler
Ross Overbeek
Snyder
Stamatakis
Stamatakis
Svetlana Gerdes
Talavera
Thomas Brettin
Vaser
Veronika Vonstein
Walker
Wattam
Wattam
Wick
Wick
Wood
Yoo
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref