Search CORE

4 research outputs found

Prediction of Susceptibility to First-Line Tuberculosis Drugs by DNA Sequencing

Author: 100000 Genomes Project
Allix-Beguec C
Arandjelovic I
Beckert P
Bi L
Bonnet M
Bradley P
Cabibbe AM
Cancino-Munoz I
Caulfield MJ
Chaiprasert A
Cirillo DM
Clifton D
Comas I
Crook DW
CRyPTIC Consortium
De Filippo MR
de Neeling H
Diel R
Drobniewski FA
Faksri K
Farhat MR
Fleming J
Fowler P
Fowler TA
Gao Q
Gardy J
Gascoyne-Binzi D
Gibertoni-Cruz A-L
Gil-Brusola A
Golubchik T
Gonzalo X
Grandjean L
Guthrie JL
He G
Hoosdally S
Hunt M
Iqbal Z
Ismail N
Johnston J
Khanzada FM
Khor CC
Kohl TA
Kong C
Lipworth S
Liu Q
Maphalala G
Martinez E
Mathys V
Merker M
Miotto P
Mistry N
Moore DAJ
Murray M
Niemann S
Omar SV
Ong RT-H
Padilla ES
Peto TEA
Posey JE
Prammananan T
Pym A
Rodrigues C
Rodrigues M
Rodwell T
Rossolini GM
Schito M
Shen X
Shendure J
Sintchenko V
Sloutsky A
Smith EG
Snyder M
Soetaert K
Starks AM
Supply P
Suriyapol P
Tahseen S
Tang P
Teo Y-Y
Thuong TNT
Thwaites G
Tortoli E
van Soolingen D
Walker AS
Walker TM
Wilcox M
Wilson DJ
Wyllie D
Yang Y
Zhang H
Zhao Y
Zhu B
Publication venue: 'Massachusetts Medical Society'
Publication date: 01/01/2018
Field of study

Background: The World Health Organization recommends drug-susceptibility testing of Mycobacterium tuberculosis complex for all patients with tuberculosis to guide treatment decisions and improve outcomes. Whether DNA sequencing can be used to accurately predict profiles of susceptibility to first-line antituberculosis drugs has not been clear. Methods: We obtained whole-genome sequences and associated phenotypes of resistance or susceptibility to the first-line antituberculosis drugs isoniazid, rifampin, ethambutol, and pyrazinamide for isolates from 16 countries across six continents. For each isolate, mutations associated with drug resistance and drug susceptibility were identified across nine genes, and individual phenotypes were predicted unless mutations of unknown association were also present. To identify how whole-genome sequencing might direct first-line drug therapy, complete susceptibility profiles were predicted. These profiles were predicted to be susceptible to all four drugs (i.e., pansusceptible) if they were predicted to be susceptible to isoniazid and to the other drugs or if they contained mutations of unknown association in genes that affect susceptibility to the other drugs. We simulated the way in which the negative predictive value changed with the prevalence of drug resistance. Results: A total of 10,209 isolates were analyzed. The largest proportion of phenotypes was predicted for rifampin (9660 [95.4%] of 10,130) and the smallest was predicted for ethambutol (8794 [89.8%] of 9794). Resistance to isoniazid, rifampin, ethambutol, and pyrazinamide was correctly predicted with 97.1%, 97.5%, 94.6%, and 91.3% sensitivity, respectively, and susceptibility to these drugs was correctly predicted with 99.0%, 98.8%, 93.6%, and 96.8% specificity. Of the 7516 isolates with complete phenotypic drug-susceptibility profiles, 5865 (78.0%) had complete genotypic predictions, among which 5250 profiles (89.5%) were correctly predicted. Among the 4037 phenotypic profiles that were predicted to be pansusceptible, 3952 (97.9%) were correctly predicted. Conclusions: Genotypic predictions of the susceptibility of M. tuberculosis to first-line drugs were found to be correlated with phenotypic susceptibility to these drugs. (Funded by the Bill and Melinda Gates Foundation and others.

LSHTM Research Online

Sciensano Publications Repository

eScholarship - University of California

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Digital.CSIC

White Rose Research Online

Application of machine learning techniques to tuberculosis drug resistance analysis

Author: Arandjelovic I
Battaglia S
Borroni E
Cabibbe A
Carter J
Cirillo DM
Claxton P
Clifton DA
Clifton DA
Comas I
Coronel J
Crook DW
De Oliveira RS
Drobniewski F
Ferrazoli L
Fowler PW
Gao GF
Gao Q
Gardy J
Ghodousi A
Gibertoni Cruz AL
Grazian C
He G
Hoffmann H
Hoosdally SJ
Hunt M
Iqbal Z
Ismail N
Jarrett L
Joseph L
Jou R
Kambli P
Khot R
Kohl T
Kouchaki S
Lalvani A
Laurenson I
Lin WH
Liu C
Ma A
Marubini E
Matias D
Merker M
Molodtsov N
Moore D
Ngoc NH
Niemann S
Nilgiriwala K
Omar SV
Paton N
Peto TEA
Plesnik S
Posey J
Rathod P
Rodrigues C
Shah S
Sintchenko V
Smith EG
Solano W
Spitaleri A
Srinivasan V
Supply P
Surve U
Tahseen S
Thuong TNT
Thwaites G
Van Soolingen D
Walker AS
Walker TM
Werngren J
Wilkinson RJ
Wilson DJ
Wu MH
Yang YY
Zhao Y
Zhu B
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/07/2019
Field of study

Motivation: Timely identification of Mycobacterium tuberculosis (MTB) resistance to existing drugs is vital to decrease mortality and prevent the amplification of existing antibiotic resistance. Machine learning methods have been widely applied for timely predicting resistance of MTB given a specific drug and identifying resistance markers. However, they have been not validated on a large cohort of MTB samples from multi-centers across the world in terms of resistance prediction and resistance marker identification. Several machine learning classifiers and linear dimension reduction techniques were developed and compared for a cohort of 13 402 isolates collected from 16 countries across 6 continents and tested 11 drugs. Results: Compared to conventional molecular diagnostic test, area under curve of the best machine learning classifier increased for all drugs especially by 23.11%, 15.22% and 10.14% for pyrazinamide, ciprofloxacin and ofloxacin, respectively (P < 0.01). Logistic regression and gradient tree boosting found to perform better than other techniques. Moreover, logistic regression/gradient tree boosting with a sparse principal component analysis/non-negative matrix factorization step compared with the classifier alone enhanced the best performance in terms of F1-score by 12.54%, 4.61%, 7.45% and 9.58% for amikacin, moxifloxacin, ofloxacin and capreomycin, respectively, as well increasing area under curve for amikacin and capreomycin. Results provided a comprehensive comparison of various techniques and confirmed the application of machine learning for better prediction of the large diverse tuberculosis data. Furthermore, mutation ranking showed the possibility of finding new resistance/susceptible markers

UNSWorks

Multi-Label Random Forest Model for Tuberculosis Drug Resistance Classification and Mutation Ranking

Author: Arandjelovic I
Battaglia S
Borroni E
Cabibbe A
Carter J
Chaiprasert A
Cirillo DM
Claxton P
Clifton D
Comas I
Coronel J
Crook D
Drobniewski FA
Earle SG
Farhat MR
Ferrazoli L
Fowler P
Gao GF
Gao Q
Gardy J
Ghodousi A
Gibertoni Cruz AL
Grazian C
He G
Hee ROT
Hoffmann H
Hoosdally S
Hunt M
Iqbal Z
Ismail N
Jarrett L
Joseph L
Jou R
Kambli P
Khot R
Knaggs J
Koch A
Kouchaki S
Lachapelle A
Lalvani A
Laurenson I
Lin WH
Liu C
Ma A
Matias D
Merker M
Moore D
Ngoc NH
Niemann S
Nilgiriwala K
Omar SV
Paton N
Peto TEA
Plesnik S
Posey J
Rathod P
Rodrigues C
Roig CR
Shah S
Sintchenko V
Siqueira de Oliveira R
Smith GE
Solano W
Spitaler A
Srinivasan V
Supply P
Surve U
Tahseen S
Thuong TNT
Thwaites G
Todt K
van Soolingen D
Walker SA
Walker T
Werngren J
Wilkinson R
Wu MH
Yang Y
Zhao Y
Zhu B
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2020
Field of study

UNSWorks

GenomegaMap: within-species genome-wide dN/dS estimation from over 10,000 genomes

Author: Arandjelovic I
Battaglia S
Borroni E
Cabibbe A
Carter J
Chaiprasert A
Cirillo DM
Claxton P
Clifton DA
Comas I
Coronel J
Crook DW
De Oliveira RS
Drobniewski FA
Earle SG
Farhat MR
Ferrazoli L
Fowler PW
Gao GF
Gao Q
Gardy J
Ghodousi A
Gibertoni Cruz AL
Grazian C
He G
Hee ROT
Hoffmann H
Hoosdally SJ
Hunt M
Iqbal Z
Ismail N
Jarrett L
Joseph L
Jou R
Kambli P
Khot R
Knaggs J
Koch A
Kohl TA
Kouchaki S
Lachapelle A
Lalvani A
Laurenson I
Lin W-H
Liu C
Ma A
Matias D
Merker M
Moore D
Ngoc NH
Niemann S
Nilgiriwala K
Omar SV
Paton N
Peto TEA
Plesnik S
Posey J
Rathod P
Rodrigues C
Roig CJ
Shah S
Sintchenko V
Smith EG
Solano W
Spitaleri A
Srinivasan V
Supply P
Surve U
Tahseen S
Thuong TNT
Thwaites G
Todt K
Van Soolingen D
Walker AS
Walker TM
Werngren J
Wilkinson R
Wilson DJ
Wu M-H
Yang Y
Zhao Y
Zhu B
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2020
Field of study

The dN/dS ratio provides evidence of adaptation or functional constraint in protein-coding genes by quantifying the relative excess or deficit of amino acid-replacing versus silent nucleotide variation. Inexpensive sequencing promises a better understanding of parameters, such as dN/dS⁠, but analyzing very large data sets poses a major statistical challenge. Here, I introduce genomegaMap for estimating within-species genome-wide variation in dN/dS⁠, and I apply it to 3,979 genes across 10,209 tuberculosis genomes to characterize the selection pressures shaping this global pathogen. GenomegaMap is a phylogeny-free method that addresses two major problems with existing approaches: 1) It is fast no matter how large the sample size and 2) it is robust to recombination, which causes phylogenetic methods to report artefactual signals of adaptation. GenomegaMap uses population genetics theory to approximate the distribution of allele frequencies under general, parent-dependent mutation models. Coalescent simulations show that substitution parameters are well estimated even when genomegaMap’s simplifying assumption of independence among sites is violated. I demonstrate the ability of genomegaMap to detect genuine signatures of selection at antimicrobial resistance-conferring substitutions in Mycobacterium tuberculosis and describe a novel signature of selection in the cold-shock DEAD-box protein A gene deaD/csdA. The genomegaMap approach helps accelerate the exploitation of big data for gaining new insights into evolution within species

UNSWorks

Spiral - Imperial College Digital Repository

ScholarBank@NUS