Search CORE

60 research outputs found

A Multinational Analysis of Mutations and Heterogeneity in PZase, RpsA, and PanD Associated with Pyrazinamide Resistance in M/XDR Mycobacterium tuberculosis.

Author: Catanzaro A
Catanzaro D
Fink L
Jackson RL
Pettigrove M
Ramirez-Busby SM
Rodwell TC
Valafar F
Publication venue: eScholarship, University of California
Publication date: 01/01/2017
Field of study

Pyrazinamide (PZA) is an important first-line drug in all existing and new tuberculosis (TB) treatment regimens. PZA-resistance in M. tuberculosis is increasing, especially among M/XDR cases. Noted issues with PZA Drug Susceptibility Testing (DST) have driven the search for alternative tests. This study provides a comprehensive assessment of PZA molecular diagnostics in M/XDR TB cases. A set of 296, mostly XDR, clinical M. tuberculosis isolates from four countries were subjected to DST for eight drugs, confirmatory Wayne's assay, and whole-genome sequencing. Three genes implicated in PZA resistance, pncA, rpsA, and panD were investigated. Assuming all non-synonymous mutations cause resistance, we report 90% sensitivity and 65% specificity for a pncA-based molecular test. The addition of rpsA and panD potentially provides 2% increase in sensitivity. Molecular heterogeneity in pncA was associated with resistance and should be evaluated as a diagnostic tool. Mutations near the N-terminus and C-terminus of PZase were associated with East-Asian and Euro-American lineages, respectively. Finally, Euro-American isolates are most likely to have a wild-type PZase and escape molecular detection. Overall, the 8-10% resistance without markers may point to alternative mechanisms of resistance. Confirmatory mutagenesis may improve the disconcertingly low specificity but reduce sensitivity since not all mutations may cause resistance

Crossref

eScholarship - University of California

Examining the Classification Accuracy of TSVMs with Feature Selection in Comparison with the GLAD Algorithm

Author: A Gommerman
A S M Yong
A Zien
C Harris
F Valafar
Hala Helmi
I Guyon
J Han
Jonathan M. Garibaldi
K Bennett
M A Shipp
M P Brown
R Collobert
R Zhang
S Abney
T Jaakkola
T Joachims
T Joachims
T R Golub
Uwe Aickelin
X Zhu
Publication venue: 'Elsevier BV'
Publication date: 01/01/2011
Field of study

Gene expression data sets are used to classify and predict patient diagnostic categories. As we know, it is extremely difficult and expensive to obtain gene expression labelled examples. Moreover, conventional supervised approaches cannot function properly when labelled data (training examples) are insufficient using Support Vector Machines (SVM) algorithms. Therefore, in this paper, we suggest Transductive Support Vector Machines (TSVMs) as semi-supervised learning algorithms, learning with both labelled samples data and unlabelled samples to perform the classification of microarray data. To prune the superfluous genes and samples we used a feature selection method called Recursive Feature Elimination (RFE), which is supposed to enhance the output of classification and avoid the local optimization problem. We examined the classification prediction accuracy of the TSVM-RFE algorithm in comparison with the Genetic Learning Across Datasets (GLAD) algorithm, as both are semi-supervised learning methods. Comparing these two methods, we found that the TSVM-RFE surpassed both a SVM using RFE and GLAD

Nottingham eTheses

Crossref

University of Melbourne Institutional Repository

The Global Consortium for Drug-resistant Tuberculosis Diagnostics (GCDD): design of a multi-site, head-to-head study of three rapid tests to detect extensively drug-resistant tuberculosis

Author: Andre Trollip
Antonino Catanzaro
AP Trollip
AR Escombe
AS Dharmadhikari
AS Fauci
C Dye
C Rodrigues
Camilla Rodrigues
D Almeida
DAJ Moore
Daniel Park
Donald Catanzaro
Erik J Groessl
F Brossier
Faramarz Valafar
HE Jenkins
Kathleen Eisenach
LTC Bravo
Lynn Jackson
M Barnard
MA Diggle
Naomi Hillery
Richard S Garfein
S-Y Grace Lin
SYG Lin
SYG Lin
TC Rodwell
Theodore G Ganiats
Thomas C Victor
Timothy C Rodwell
Valeriu Crudu
VS Kiet
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

The structure of the KlcA and ArdB proteins reveals a novel fold and antirestriction activity against Type I DNA restriction systems in vivo but not in vitro

Author: Adamczyk
Alexandrov
Altschul
Andrew P. Herbert
Atanasiu
Bairoch
Bandyopadhyay
Belogurov
Belogurov
Belogurov
Bennett-Lovsey
Berman
Blakely
Bond
Brunger
Burlage
Cazalet
Combet
Cornilescu
David T. F. Dryden
Dimitra Serfiotis-Mitsa
Dinesh C. Soares
Dosset
Dryden
Dunn
Dušan Uhrín
Fraczkiewicz
Frishman
Gareth A. Roberts
Garry W. Blakely
Gill
Glaser
Grzesiek
Guntert
Harada
Heiden
Herrmann
Holm
Hutchinson
Hutchinson
John H. White
Kamachi
Kay
Kennaway
Koradi
Krissinel
Kumar
Larsen
Laskowski
Loenen
Makovets
Makovets
McMahon
Muhandiram
Murray
Nekrasov
Nicholls
Ottiger
Pei
Rasko
Riley
Roberts
Saltman
Serfiotis-Mitsa
Sklenar
Stephanou
Stephanou
Studier
Thomas
Thomas
Tjandra
Tock
Uhrinova
Ulrich
Valafar
Vedler
Vranken
Vriend
Walkinshaw
Webb
Welch
Wilkins
Yamazaki
Yip
Zavilgelsky
Zavilgelsky
Publication venue: Oxford University Press
Publication date: 01/03/2010
Field of study

Plasmids, conjugative transposons and phage frequently encode anti-restriction proteins to enhance their chances of entering a new bacterial host that is highly likely to contain a Type I DNA restriction and modification (RM) system. The RM system usually destroys the invading DNA. Some of the anti-restriction proteins are DNA mimics and bind to the RM enzyme to prevent it binding to DNA. In this article, we characterize ArdB anti-restriction proteins and their close homologues, the KlcA proteins from a range of mobile genetic elements; including an ArdB encoded on a pathogenicity island from uropathogenic Escherichia coli and a KlcA from an IncP-1b plasmid, pBP136 isolated from Bordetella pertussis. We show that all the ArdB and KlcA act as anti-restriction proteins and inhibit the four main families of Type I RM systems in vivo, but fail to block the restriction endonuclease activity of the archetypal Type I RM enzyme, EcoKI, in vitro indicating that the action of ArdB is indirect and very different from that of the DNA mimics. We also present the structure determined by NMR spectroscopy of the pBP136 KlcA protein. The structure shows a novel protein fold and it is clearly not a DNA structural mimic

Crossref

PubMed Central

Edinburgh Research Explorer

Empirical comparison of cross-platform normalization methods for gene expression data

Author: A Platts
A Ramasamy
A Shabalin
Applied Biosystems
B Bolstad
C Metz
C Yauk
D Hekstra
D Petersen
D Rhodes
E Glaab
F Hong
Faramarz Valafar
G Elvidge
G Hardiman
G Held
G Smyth
H Jiang
H Parkinson
H Yasrebi
I Borozan
I Wick
J Larkin
J Storey
J Storey
J Tuszynski
Jason Rudy
JM Chambers
K Kugler
K Noguchi
L Gautier
L Shi
L Shi
LIK Lin
M Barnes
M Benito
M Mulligan
M Schena
P Tan
P Warnat
P Wirapati
R Development Core Team
R Gentleman
R Grützmann
R Kothapalli
R Lacson
R Martinez
S Assou
S Carter
S Davis
S Rogic
T Barrett
VE Velculescu
W Kuo
W Kuo
W Walker
X Chi
XQ Xia
Y Woo
Z Hu
Z Wang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Simultaneous measurement of gene expression on a genomic scale can be accomplished using microarray technology or by sequencing based methods. Researchers who perform high throughput gene expression assays often deposit their data in public databases, but heterogeneity of measurement platforms leads to challenges for the combination and comparison of data sets. Researchers wishing to perform cross platform normalization face two major obstacles. First, a choice must be made about which method or methods to employ. Nine are currently available, and no rigorous comparison exists. Second, software for the selected method must be obtained and incorporated into a data analysis workflow. Results Using two publicly available cross-platform testing data sets, cross-platform normalization methods are compared based on inter-platform concordance and on the consistency of gene lists obtained with transformed data. Scatter and ROC-like plots are produced and new statistics based on those plots are introduced to measure the effectiveness of each method. Bootstrapping is employed to obtain distributions for those statistics. The consistency of platform effects across studies is explored theoretically and with respect to the testing data sets. Conclusions Our comparisons indicate that four methods, DWD, EB, GQ, and XPN, are generally effective, while the remaining methods do not adequately correct for platform effects. Of the four successful methods, XPN generally shows the highest inter-platform concordance when treatment groups are equally sized, while DWD is most robust to differently sized treatment groups and consistently shows the smallest loss in gene detection. We provide an R package, CONOR, capable of performing the nine cross-platform normalization methods considered. The package can be downloaded at <url>http://alborz.sdsu.edu/conor</url> and is available from CRAN.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data

Abstract Background Researchers using RNA expression microarrays in experimental designs with more than two treatment groups often identify statistically significant genes with ANOVA approaches. However, the ANOVA test does not discriminate which of the multiple treatment groups differ from one another. Thus, <it>post hoc </it>tests, such as linear contrasts, template correlations, and pairwise comparisons are used. Linear contrasts and template correlations work extremely well, especially when the researcher has <it>a priori </it>information pointing to a particular pattern/template among the different treatment groups. Further, all pairwise comparisons can be used to identify particular, treatment group-dependent patterns of gene expression. However, these approaches are biased by the researcher's assumptions, and some treatment-based patterns may fail to be detected using these approaches. Finally, different patterns may have different probabilities of occurring by chance, importantly influencing researchers' conclusions about a pattern and its constituent genes. Results We developed a four step, <it>post hoc </it>pattern matching (PPM) algorithm to automate single channel gene expression pattern identification/significance. First, 1-Way Analysis of Variance (ANOVA), coupled with <it>post hoc </it>'all pairwise' comparisons are calculated for all genes. Second, for each ANOVA-significant gene, all pairwise contrast results are encoded to create unique pattern ID numbers. The # genes found in each pattern in the data is identified as that pattern's 'actual' frequency. Third, using Monte Carlo simulations, those patterns' frequencies are estimated in random data ('random' gene pattern frequency). Fourth, a Z-score for overrepresentation of the pattern is calculated ('actual' against 'random' gene pattern frequencies). We wrote a Visual Basic program (StatiGen) that automates PPM procedure, constructs an Excel workbook with standardized graphs of overrepresented patterns, and lists of the genes comprising each pattern. The visual basic code, installation files for StatiGen, and sample data are available as supplementary material. Conclusion The PPM procedure is designed to augment current microarray analysis procedures by allowing researchers to incorporate all of the information from post hoc tests to establish unique, overarching gene expression patterns in which there is no overlap in gene membership. In our hands, PPM works well for studies using from three to six treatment groups in which the researcher is interested in treatment-related patterns of gene expression. Hardware/software limitations and extreme number of theoretical expression patterns limit utility for larger numbers of treatment groups. Applied to a published microarray experiment, the StatiGen program successfully flagged patterns that had been manually assigned in prior work, and further identified other gene expression patterns that may be of interest. Thus, over a moderate range of treatment groups, PPM appears to work well. It allows researchers to assign statistical probabilities to patterns of gene expression that fit <it>a priori </it>expectations/hypotheses, it preserves the data's ability to show the researcher interesting, yet unanticipated gene expression patterns, and assigns the majority of ANOVA-significant genes to non-overlapping patterns.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

University of Kentucky

Towards human-computer synergetic analysis of large-scale biological data

Author: A Andreeva
A Butte
A Perer
A Prlić
A Sturn
A Tagmount
AI Saeed
B Dalziel
B Shneiderman
Ben Dalziel
C Orengo
Daniel Asarnow
David Foote
DB Allison
DH Jeong
EH Baehrecke
F Valafar
H Hochheiser
Hui Yang
I Vessey
J Ernst
J Hou
J Hou
J Seo
JC Pinzon
Jonathan Stillman
K Aas
KS Teranishi
L Holm
M Kanehisa
M Osadchy
MA Hibbs
Matthew Gormley
MB Eisen
P Craig
P Malik
R Jain
R Singh
R Singh
R Singh
R Singh
R Tibshirani
Rahul Singh
RC Gentleman
SG Hart
Susan Fisher
T Hastie
TPS Chan
VD Winn
VG Tusher
W Tong
William Murad
X Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Genetic code as a Gray code revisited

Author: Bosnacki D.
Eikelder ten, H.M.M.
Hilbers P.A.J.
Valafar F.
Valafar H.
Publication venue: CSREA Press
Publication date: 01/01/2003
Field of study

Predicting the Effectiveness of Hydroxyurea in Individual Sickle Cell Anemia Patients

Author: Albersheim Peter
Darville Alan
Hardin John
Kutlar Abdullah
Valafar Faramarz
Valafar Homayoun
Woods Kristy F.
Publication venue: Scholar Commons
Publication date: 01/02/2000
Field of study

The study described in this paper was undertaken to develop the ability to predict the response of sickle-cell patients to hydroxyurea (HU) therapy. We analyzed the effect of HU on the values of 23 parameters of 83 patients. A Student’s t-test was used to confirm (Rodgers GP, Dover GJ, Noguchi CT, Schechter AN, Nienhuis AW. Hematologic responses of patients with sickle cell disease to treatment with hydroxyurea, N Engl J Med 1990;322;1037–44) at the 0.001 level that treatment with HU increases the proportion of fetal hemoglobin (HbF), and the average corpuscular volume (MCV) of the red blood cells. Correlation analysis failed to establish a statistically significant relationship between any of the 23 parameters and the HbF response. Linear regression analysis also failed to predict a patient’s response to HU. On the other hand, artificial neural network (ANN) pattern-recognition analysis of the 23 parameters predicts, with 86.6% accuracy, those patients that respond positively to HU and those that do not. Furthermore, we have found that the values of only 10 of the 23 parameters (listed in the body of this paper) are sufficient to train ANNs to predict which patients will respond to HU

Scholar Commons - Institutional Repository of the University of South Carolina