19 research outputs found
A novel application of pattern recognition for accurate SNP and indel discovery from high-throughput data: Targeted resequencing of the glucocorticoid receptor co-chaperone FKBP5 in a Caucasian population
The detection of single nucleotide polymorphisms (SNPs) and insertion/deletions (indels) with precision from high-throughput data remains a significant bioinformatics challenge. Accurate detection is necessary before next-generation sequencing can routinely be used in the clinic. In research, scientific advances are inhibited by gaps in data, exemplified by the underrepresented discovery of rare variants, variants in non-coding regions and indels. The continued presence of false positives and false negatives prevents full automation and requires additional manual verification steps. Our methodology presents applications of both pattern recognition and sensitivity analysis to eliminate false positives and aid in the detection of SNP/indel loci and genotypes from high-throughput data. We chose FK506-binding protein 51(FKBP5) (6p21.31) for our clinical target because of its role in modulating pharmacological responses to physiological and synthetic glucocorticoids and because of the complexity of the genomic region. We detected genetic variation across a160 kb region encompassing FKBP5. 613 SNPs and 57 indels, including a 3.3 kb deletion were discovered. We validated our method using three independent data sets and, with Sanger sequencing and Affymetrix and Illumina microarrays, achieved 99% concordance. Furthermore we were able to detect 267 novel rare variants and assess linkage disequilibrium. Our results showed both a sensitivity and specificity of 98%, indicating near perfect classification between true and false variants. The process is scalable and amenable to automation, with the downstream filters taking only 1.5 hours to analyze 96 individuals simultaneously. We provide examples of how our level of precision uncovered the interactions of multiple loci, their predicted influences on mRNA stability, perturbations of the hsp90 binding site, and individual variation in FKBP5 expression. Finally we show how our discovery of rare variants may change current conceptions of evolution at this locus
Functional Genetic Polymorphisms in the Aromatase Gene CYP19 Vary the Response of Breast Cancer Patients to Neoadjuvant Therapy with Aromatase Inhibitors
Aromatase (CYP19) is a critical enzyme for estrogen biosynthesis, and aromatase inhibitors (AIs) are established endocrine therapy for post-menopausal women with breast cancer. DNA samples were obtained from 52 women pre- and post-AI treatment in the neoadjuvant setting. 82 breast cancer and 19 normal breast samples were resequenced to test the hypothesis that single nucleotide polymorphisms (SNPs) in the CYP19 gene might contribute to response to neoadjuvant AI therapy. There were no differences in CYP19 sequence between tumor and germline DNA in the same patient. Forty-eight CYP19 SNPs were identified, with four being novel when compared with previous resequencing data. Genotype-phenotype association studies performed with levels of aromatase activity, estrone, estradiol and tumor size pre- and post-AI treatment indicated that two tightly linked SNPs, rs6493497 and rs7176005 in the 5’-flanking region of CYP19 exon 1.1, were significantly associated with a greater change in aromatase activity after AI treatment. A follow-up study in 200 women with early breast cancer treated with adjuvant anastrozole showed that these same two SNPs were also associated with higher plasma estradiol levels pre- and post-AI treatment. Electrophoretic mobility shift and reporter gene assays confirmed the potential functional effects of these two SNPs on transcription regulation. These studies provide insight into the role of common genetic polymorphisms in CYP19 in variation in response to AIs by breast cancer patients
Glutathione Stransferase omega 1 and omega 2 pharmacogenomics. Drug Metab Dispos
ABSTRACT Glutathione S-transferase omega 1 and omega 2 (GSTO1 and GSTO2) catalyze monomethyl arsenate reduction, the rate-limiting reaction in arsenic biotransformation. As a step toward pharmacogenomic studies of these phase II enzymes, we resequenced human GSTO1 and GSTO2 using DNA samples from four ethnic groups. We identified 31 and 66 polymorphisms in GSTO1 and GSTO2, respectively, with 4 nonsynonymous coding single nucleotide polymorphisms (cSNPs) in each gene. There were striking variations among ethnic groups in polymorphism frequencies and types. Expression constructs were created for all eight nonsynonymous cSNPs as well as a deletion of codon 155 in GSTO1, and those constructs were used to transfect COS-1 cells. Quantitative Western blot analysis, after correction for transfection efficiency, showed a reduction in protein level of greater than 50% for the GSTO1 Tyr32 variant allozyme when compared with wild type (WT), while levels for the Asp140, Lys208, Val236 and codon 155 deletion variant constructs were similar to that of the WT. For GSTO2, the Tyr130 and Ile158 variant allozymes showed 50% and 84% reductions in levels of expression, respectively, when compared to WT, while the Ile41 and Asp142 allozymes displayed levels similar to that of WT GSTO2. Rabbit reticulocyte lysate (RRL) degradation studies showed that the GSTO1 Tyr32 and the GSTO2 Tyr130, Ile158 and Asp142/Ile158 variant allozymes were degraded more rapidly than were their respective WT allozymes. These observations raise the possibility of functionally significant pharmacogenomic variation in the expression and function of GSTO1 and GSTO2
Cytosolic 5 '-nucleotidase III (NT5C3): gene sequence variation and functional genomics
Background 5'-Nucleotidases play a critical role in nucleotide pool balance and in the metabolism of nucleoside analogs such as gemcitabine and cytosine arabinoside (AraC). We previously performed an expression array association study with gemcitabine and AraC cytotoxicity using 197 human lymphoblastoid cell lines. One gene that was significantly associated with gemcitabine cytotoxicity was a nucleotidase family member, NT5C3. Very little is known with regard to the pharmacogenomics of this family of enzymes
Acetaminophen-NAPQI Hepatotoxicity: A Cell Line Model System Genome-Wide Association Study
Acetaminophen is the leading cause of acute hepatic failure in many developed nations. Acetaminophen hepatotoxicity is mediated by the reactive metabolite N-acetyl-p-benzoquinonimine (NAPQI). We performed a “discovery” genome-wide association study using a cell line–based model system to study the possible contribution of genomics to NAPQI-induced cytotoxicity. A total of 176 lymphoblastoid cell lines from healthy subjects were treated with increasing concentrations of NAPQI. Inhibiting concentration 50 values were determined and were associated with “glutathione pathway” gene single nucleotide polymorphisms (SNPs) and genome-wide basal messenger RNA expression, as well as with 1.3 million genome-wide SNPs. A group of SNPs in linkage disequilibrium on chromosome 3 was highly associated with NAPQI toxicity. The p value for rs2880961, the SNP with the lowest p value, was 1.88 × 10−7. This group of SNPs mapped to a “gene desert,” but chromatin immunoprecipitation assays demonstrated binding of several transcription factor proteins including heat shock factor 1 (HSF1) and HSF2, at or near rs2880961. These chromosome 3 SNPs were not significantly associated with variation in basal expression for any of the genome-wide genes represented on the Affymetrix U133 Plus 2.0 GeneChip. We have used a cell line–based model system to identify a SNP signal associated with NAPQI cytotoxicity. If these observations are validated in future clinical studies, this SNP signal might represent a potential biomarker for risk of acetaminophen hepatotoxicity. The mechanisms responsible for this association remain unclear
Gemcitabine pharmacogenomics: Deoxycytidine kinase and cytidylate kinase gene resequencing and functional genomics
Gemcitabine and other cytidine antimetabolites require metabolic activation by phosphorylation. Deoxycytidine kinase (DCK) and cytidine monophosphate kinase (CMPK) catalyze these reactions. We have applied a genotype-to-phenotype strategy to study DCK and CMPK pharmacogenomics. Specifically, we resequenced DCK and CMPK using 240 DNA samples, 60 each from African-American, Caucasian-American, Han Chinese-American, and Mexican-American subjects. We observed 28 DCK polymorphisms and 28 polymorphisms in CMPK, 33 of which were novel. Expression in COS-1 cells showed that variant allozyme enzyme activities ranged from 32 to 105% of the wild type (WT) for DCK and from 78 to 112% of WT for CMPK-with no significant differences in apparent Km values for either enzyme except for a DCK Val24/Ser122 double variant allozyme. Relative levels of DCK and CMPK immunoreactive protein in the COS-1 cells paralleled relative levels of enzyme activity and were significantly correlated for DCK (R(p) = 0.89, P = 0.0004) but not for CMPK (R(p) = 0.82, P = 0.095). The results of an analysis of DCK and CMPK structural models were compatible with the observed functional consequences of sequence alterations in variant allozymes. We also confirmed that the CMPK protein expressed in COS-1 cells and in a rabbit reticulocyte lysate was 196 rather than 228 amino acids in length. In summary, we determined common sequence variations in DCK and CMPK and systematically evaluated their functional implications. These gene sequence differences may contribute to variations in the metabolic activation of gemcitabine and other cytidine antimetabolites
Gemcitabine pharmacogenomics: Deoxycytidine kinase and cytidylate kinase gene resequencing and functional genomics
Gemcitabine and other cytidine antimetabolites require metabolic activation by phosphorylation. Deoxycytidine kinase (DCK) and cytidine monophosphate kinase (CMPK) catalyze these reactions. We have applied a genotype-to-phenotype strategy to study DCK and CMPK pharmacogenomics. Specifically, we resequenced DCK and CMPK using 240 DNA samples, 60 each from African-American, Caucasian-American, Han Chinese-American, and Mexican-American subjects. We observed 28 DCK polymorphisms and 28 polymorphisms in CMPK, 33 of which were novel. Expression in COS-1 cells showed that variant allozyme enzyme activities ranged from 32 to 105% of the wild type (WT) for DCK and from 78 to 112% of WT for CMPK-with no significant differences in apparent Km values for either enzyme except for a DCK Val24/Ser122 double variant allozyme. Relative levels of DCK and CMPK immunoreactive protein in the COS-1 cells paralleled relative levels of enzyme activity and were significantly correlated for DCK (R(p) = 0.89, P = 0.0004) but not for CMPK (R(p) = 0.82, P = 0.095). The results of an analysis of DCK and CMPK structural models were compatible with the observed functional consequences of sequence alterations in variant allozymes. We also confirmed that the CMPK protein expressed in COS-1 cells and in a rabbit reticulocyte lysate was 196 rather than 228 amino acids in length. In summary, we determined common sequence variations in DCK and CMPK and systematically evaluated their functional implications. These gene sequence differences may contribute to variations in the metabolic activation of gemcitabine and other cytidine antimetabolites