57 research outputs found
Consistent Differential Expression Pattern (CDEP) on microarray to identify genes related to metastatic behavior
<p>Abstract</p> <p>Background</p> <p>To utilize the large volume of gene expression information generated from different microarray experiments, several meta-analysis techniques have been developed. Despite these efforts, there remain significant challenges to effectively increasing the statistical power and decreasing the Type I error rate while pooling the heterogeneous datasets from public resources. The objective of this study is to develop a novel meta-analysis approach, Consistent Differential Expression Pattern (CDEP), to identify genes with common differential expression patterns across different datasets.</p> <p>Results</p> <p>We combined False Discovery Rate (FDR) estimation and the non-parametric RankProd approach to estimate the Type I error rate in each microarray dataset of the meta-analysis. These Type I error rates from all datasets were then used to identify genes with common differential expression patterns. Our simulation study showed that CDEP achieved higher statistical power and maintained low Type I error rate when compared with two recently proposed meta-analysis approaches. We applied CDEP to analyze microarray data from different laboratories that compared transcription profiles between metastatic and primary cancer of different types. Many genes identified as differentially expressed consistently across different cancer types are in pathways related to metastatic behavior, such as ECM-receptor interaction, focal adhesion, and blood vessel development. We also identified novel genes such as <it>AMIGO2</it>, <it>Gem</it>, and <it>CXCL11 </it>that have not been shown to associate with, but may play roles in, metastasis.</p> <p>Conclusions</p> <p>CDEP is a flexible approach that borrows information from each dataset in a meta-analysis in order to identify genes being differentially expressed consistently. We have shown that CDEP can gain higher statistical power than other existing approaches under a variety of settings considered in the simulation study, suggesting its robustness and insensitivity to data variation commonly associated with microarray experiments.</p> <p><b>Availability</b>: CDEP is implemented in R and freely available at: <url>http://genomebioinfo.musc.edu/CDEP/</url></p> <p><b>Contact</b>: [email protected]</p
Differential expression of mitogen activating protein kinases in periodontitis
Aim Following tollâlike receptor ( TLR ) engagement, lipopolysaccharide ( LPS ) can stimulate the expression of proâinflammatory cytokines thus activating the innate immune response. The production of inflammatory cytokines results, in part, from the activation of kinaseâinduced signalling cascades and transcriptional factors. Of the four distinct classes of mitogenâactivated protein kinases ( MAPK ) described in mammals, p38, câJun Nâterminal activated kinases ( JNK 1â3) and extracellular activated kinases ( ERK 1,2) are the best studied. Previous data have established that p38 MAPK signalling is required for inflammation and bone loss in periodontal disease preâclinical animal models. Materials & Methods In this study, we obtained healthy and diseased periodontal tissues along with clinical parameters and microbiological parameters. Excised fixed tissues were immunostained with total and phosphoâspecific antibodies against p38, JNK and ERK kinases. Results Intensity scoring from immunostained tissues was correlated with clinical periodontal parameters. Rank correlations with clinical indices were statistically significantly positive ( p âvalue < 0.05) for total p38 (correlations ranging 0.49â0.68), phosphoâp38 (range 0.44â0.56), and total ERK (range 0.52â0.59) levels, and correlations with JNK levels also supported association (range 0.42â0.59). Phosphoâ JNK and phosphoâ ERK showed no significant positive correlation with clinical parameters of disease. Conclusion These data strongly implicate p38 MAPK as a major MAPK involved in human periodontal inflammation and severity.Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/98997/1/jcpe12123.pd
Deep Historical Borrowing Framework to Prospectively and Simultaneously Synthesize Control Information in Confirmatory Clinical Trials with Multiple Endpoints
In current clinical trial development, historical information is receiving
more attention as providing value beyond sample size calculation.
Meta-analytic-predictive (MAP) priors and robust MAP priors have been proposed
for prospectively borrowing historical data on a single endpoint. To
simultaneously synthesize control information from multiple endpoints in
confirmatory clinical trials, we propose to approximate posterior probabilities
from a Bayesian hierarchical model and estimate critical values by deep
learning to construct pre-specified decision functions before the trial
conduct. Simulation studies and a case study demonstrate that our method
additionally preserves power, and has a satisfactory performance under
prior-data conflict
A multivariate prediction model for microarray cross-hybridization
BACKGROUND: Expression microarray analysis is one of the most popular molecular diagnostic techniques in the post-genomic era. However, this technique faces the fundamental problem of potential cross-hybridization. This is a pervasive problem for both oligonucleotide and cDNA microarrays; it is considered particularly problematic for the latter. No comprehensive multivariate predictive modeling has been performed to understand how multiple variables contribute to (cross-) hybridization. RESULTS: We propose a systematic search strategy using multiple multivariate models [multiple linear regressions, regression trees, and artificial neural network analyses (ANNs)] to select an effective set of predictors for hybridization. We validate this approach on a set of DNA microarrays with cytochrome p450 family genes. The performance of our multiple multivariate models is compared with that of a recently proposed third-order polynomial regression method that uses percent identity as the sole predictor. All multivariate models agree that the 'most contiguous base pairs between probe and target sequences,' rather than percent identity, is the best univariate predictor. The predictive power is improved by inclusion of additional nonlinear effects, in particular target GC content, when regression trees or ANNs are used. CONCLUSION: A systematic multivariate approach is provided to assess the importance of multiple sequence features for hybridization and of relationships among these features. This approach can easily be applied to larger datasets. This will allow future developments of generalized hybridization models that will be able to correct for false-positive cross-hybridization signals in expression experiments
A multivariate prediction model for microarray cross-hybridization
© 2006 Chen et al; licensee BioMed Central Ltd.The electronic version of this article is the complete one and can be found online at: http://www.biomedcentral.com/1471-2105/7/101doi:10.1186/1471-2105-7-101Background: Expression microarray analysis is one of the most popular molecular diagnostic
techniques in the post-genomic era. However, this technique faces the fundamental problem of
potential cross-hybridization. This is a pervasive problem for both oligonucleotide and cDNA
microarrays; it is considered particularly problematic for the latter. No comprehensive multivariate
predictive modeling has been performed to understand how multiple variables contribute to
(cross-) hybridization.
Results: We propose a systematic search strategy using multiple multivariate models [multiple
linear regressions, regression trees, and artificial neural network analyses (ANNs)] to select an
effective set of predictors for hybridization. We validate this approach on a set of DNA
microarrays with cytochrome p450 family genes. The performance of our multiple multivariate
models is compared with that of a recently proposed third-order polynomial regression method
that uses percent identity as the sole predictor. All multivariate models agree that the 'most
contiguous base pairs between probe and target sequences,' rather than percent identity, is the
best univariate predictor. The predictive power is improved by inclusion of additional nonlinear
effects, in particular target GC content, when regression trees or ANNs are used.
Conclusion: A systematic multivariate approach is provided to assess the importance of multiple
sequence features for hybridization and of relationships among these features. This approach can
easily be applied to larger datasets. This will allow future developments of generalized hybridization
models that will be able to correct for false-positive cross-hybridization signals in expression
experiments
Shared genetics underlying epidemiological association between endometriosis and ovarian cancer
Epidemiological studies have demonstrated associations between endometriosis and certain histotypes of ovarian cancer, including clear cell, low-grade serous and endometrioid carcinomas. We aimed to determine whether the observed associations might be due to shared genetic aetiology. To address this, we used two endometriosis datasets genotyped on common arrays with full-genome coverage (3194 cases and 7060 controls) and a large ovarian cancer dataset genotyped on the customized Illumina Infinium iSelect (iCOGS) arrays (10 065 cases and 21 663 controls). Previous work has suggested that a large number of genetic variants contribute to endometriosis and ovarian cancer (all histotypes combined) susceptibility. Here, using the iCOGS data, we confirmed polygenic architecture for most histotypes of ovarian cancer. This led us to evaluate if the polygenic effects are shared across diseases. We found evidence for shared genetic risks between endometriosis and all histotypes of ovarian cancer, except for the intestinal mucinous type. Clear cell carcinoma showed the strongest genetic correlation with endometriosis (0.51, 95% CI = 0.18-0.84). Endometrioid and low-grade serous carcinomas had similar correlation coefficients (0.48, 95% CI = 0.07-0.89 and 0.40, 95% CI = 0.05-0.75, respectively). High-grade serous carcinoma, which often arises from the fallopian tubes, showed a weaker genetic correlation with endometriosis (0.25, 95% CI = 0.11-0.39), despite the absence of a known epidemiological association. These results suggest that the epidemiological association between endometriosis and ovarian adenocarcinoma may be attributable to shared genetic susceptibility loci.Other Research Uni
Common variants at theCHEK2gene locus and risk of epithelial ovarian cancer
Genome-wide association studies have identified 20 genomic regions associated with risk of epithelial ovarian cancer (EOC), but many additional risk variants may exist. Here, we evaluated associations between common genetic variants [single nucleotide polymorphisms (SNPs) and indels] in DNA repair genes and EOC risk. We genotyped 2896 common variants at 143 gene loci in DNA samples from 15 397 patients with invasive EOC and controls. We found evidence of associations with EOC risk for variants at FANCA, EXO1, E2F4, E2F2, CREB5 and CHEK2 genes (P †0.001). The strongest risk association was for CHEK2 SNP rs17507066 with serous EOC (P = 4.74 x 10(-7)). Additional genotyping and imputation of genotypes from the 1000 genomes project identified a slightly more significant association for CHEK2 SNP rs6005807 (r (2) with rs17507066 = 0.84, odds ratio (OR) 1.17, 95% CI 1.11-1.24, P = 1.1Ă10(-7)). We identified 293 variants in the region with likelihood ratios of less than 1:100 for representing the causal variant. Functional annotation identified 25 candidate SNPs that alter transcription factor binding sites within regulatory elements active in EOC precursor tissues. In The Cancer Genome Atlas dataset, CHEK2 gene expression was significantly higher in primary EOCs compared to normal fallopian tube tissues (P = 3.72Ă10(-8)). We also identified an association between genotypes of the candidate causal SNP rs12166475 (r (2) = 0.99 with rs6005807) and CHEK2 expression (P = 2.70Ă10(-8)). These data suggest that common variants at 22q12.1 are associated with risk of serous EOC and CHEK2 as a plausible target susceptibility gene.Other Research Uni
Recommended from our members
Consortium analysis of 7 candidate SNPs for ovarian cancer
The Ovarian Cancer Association Consortium selected 7 candidate single nucleotide polymorphisms (SNPs), for which there is evidence from previous studies of an association with variation in ovarian cancer or breast cancer risks. The SNPs selected for analysis were F31I (rs2273535) in AURKA, N372H (rs144848) in BRCA2, rs2854344 in intron 17 of RB1, rs2811712 5âČ flanking CDKN2A, rs523349 in the 3âČ UTR of SRD5A2, D302H (rs1045485) in CASP8 and L10P (rs1982073) in TGFB1. Fourteen studies genotyped 4,624 invasive epithelial ovarian cancer cases and 8,113 controls of white non-Hispanic origin. A marginally significant association was found for RB1 when all studies were included [ordinal odds ratio (OR) 0.88 (95% confidence interval (CI) 0.79-1.00) p = 0.041 and dominant OR 0.87 (95% CI 0.76-0.98) p = 0.025]; when the studies that originally suggested an association were excluded, the result was suggestive although no longer statistically significant (ordinal OR 0.92, 95% CI 0.79-1.06). This SNP has also been shown to have an association with decreased risk in breast cancer. There was a suggestion of an association for AURKA, when one study that caused significant study heterogeneity was excluded [ordinal OR 1.10 (95% CI 1.01-1.20) p = 0.027; dominant OR 1.12 (95% CI 1.01-1.24) p = 0.03]. The other 5 SNPs in BRCA2, CDKN2A, SRD5A2, CASP8 and TGFB1 showed no association with ovarian cancer risk; given the large sample size, these results can also be considered to be informative. These null results for SNPs identified from relatively large initial studies shows the importance of replicating associations by a consortium approach
Recommended from our members
Risk of Ovarian Cancer and the NF-Â B Pathway: Genetic Association with IL1A and TNFSF10
A missense single-nucleotide polymorphism (SNP) in the immune modulatory gene IL1A has been associated with ovarian cancer risk (rs17561). Although the exact mechanism through which this SNP alters risk of ovarian cancer is not clearly understood, rs17561 has also been associated with risk of endometriosis, an epidemiologic risk factor for ovarian cancer. Interleukin-1α (IL1A) is both regulated by and able to activate NF-ÎșB, a transcription factor family that induces transcription of many proinflammatory genes and may be an important mediator in carcinogenesis. We therefore tagged SNPs in more than 200 genes in the NF-ÎșB pathway for a total of 2,282 SNPs (including rs17561) for genotype analysis of 15,604 cases of ovarian cancer in patients of European descent, including 6,179 of high-grade serous (HGS), 2,100 endometrioid, 1,591 mucinous, 1,034 clear cell, and 1,016 low-grade serous, including 23,235 control cases spanning 40 studies in the Ovarian Cancer Association Consortium. In this large population, we confirmed the association between rs17561 and clear cell ovarian cancer [OR, 0.84; 95% confidence interval (CI), 0.76-0.93; P = 0.00075], which remained intact even after excluding participants in the prior study (OR, 0.85; 95% CI, 0.75-0.95; P = 0.006). Considering a multiple-testing-corrected significance threshold of P < 2.5 Ă 10(-5), only one other variant, the TNFSF10 SNP rs6785617, was associated significantly with a risk of ovarian cancer (low malignant potential tumors OR, 0.85; 95% CI, 0.79-0.91; P = 0.00002). Our results extend the evidence that borderline tumors may have a distinct genetic etiology. Further investigation of how these SNPs might modify ovarian cancer associations with other inflammation-related risk factors is warranted.Other Research Uni
Recommended from our members
Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk
BACKGROUND: Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations.
METHODS: We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls).
RESULTS: Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network.
CONCLUSION: We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development.
IMPACT: Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization
- âŠ