152 research outputs found
Tissue microarrays: one size does not fit all
<p>Abstract</p> <p>Background</p> <p>Although tissue microarrays (TMAs) are commonly employed in clinical and basic-science research, there are no guidelines for evaluating the appropriateness of a TMA for a given biomarker and tumor type. Furthermore, TMA performance across multiple biomarkers has not been systematically explored.</p> <p>Methods</p> <p>A simulated TMA with between 1 and 10 cores was designed to study tumor expression of 6 biomarkers with varied expression patterns (B7-H1, B7-H3, survivin, Ki-67, CAIX, and IMP3) using 100 patients with clear cell renal cell carcinoma (RCC). We evaluated agreement between whole tissue section and TMA immunohistochemical biomarker quantification to assess how many TMA cores are necessary to adequately represent RCC whole tissue section expression. Additionally, we evaluated associations of whole tissue section and TMA expression with RCC-specific death.</p> <p>Results</p> <p>The number of simulated TMA cores necessary to adequately represent whole tissue section quantification is biomarker specific. Although 2-3 cores appeared adequate for B7-H3, Ki-67, CAIX, and IMP3, even as many as 10 cores resulted in poor agreement for B7-H1 and survivin compared to RCC whole tissue sections. While whole tissue section B7-H1 was significantly associated with RCC-specific death, no significant associations were detected using as many as 10 TMA cores, suggesting that TMAs can result in false-negative findings if the TMA is not optimally designed.</p> <p>Conclusions</p> <p>Prior to TMA analysis, the number of TMA cores necessary to accurately represent biomarker expression on whole tissue sections should be established as there is not a one-size-fits-all TMA. We illustrate the use of a simulated TMA as a cost-effective tool for this purpose.</p
Quality assessment metrics for whole genome gene expression profiling of paraffin embedded samples
BACKGROUND: Formalin fixed, paraffin embedded tissues are most commonly used for routine pathology analysis and for long term tissue preservation in the clinical setting. Many institutions have large archives of Formalin fixed, paraffin embedded tissues that provide a unique opportunity for understanding genomic signatures of disease. However, genome-wide expression profiling of Formalin fixed, paraffin embedded samples have been challenging due to RNA degradation. Because of the significant heterogeneity in tissue quality, normalization and analysis of these data presents particular challenges. The distribution of intensity values from archival tissues are inherently noisy and skewed due to differential sample degradation raising two primary concerns; whether a highly skewed array will unduly influence initial normalization of the data and whether outlier arrays can be reliably identified. FINDINGS: Two simple extensions of common regression diagnostic measures are introduced that measure the stress an array undergoes during normalization and how much a given array deviates from the remaining arrays post-normalization. These metrics are applied to a study involving 1618 formalin-fixed, paraffin-embedded HER2-positive breast cancer samples from the N9831 adjuvant trial processed with Illumina’s cDNA-mediated Annealing Selection extension and Ligation assay. CONCLUSION: Proper assessment of array quality within a research study is crucial for controlling unwanted variability in the data. The metrics proposed in this paper have direct biological interpretations and can be used to identify arrays that should either be removed from analysis all together or down-weighted to reduce their influence in downstream analyses
Software comparison for evaluating genomic copy number variation for Affymetrix 6.0 SNP array platform
<p>Abstract</p> <p>Background</p> <p>Copy number data are routinely being extracted from genome-wide association study chips using a variety of software. We empirically evaluated and compared four freely-available software packages designed for Affymetrix SNP chips to estimate copy number: Affymetrix Power Tools (APT), Aroma.Affymetrix, PennCNV and CRLMM. Our evaluation used 1,418 GENOA samples that were genotyped on the Affymetrix Genome-Wide Human SNP Array 6.0. We compared bias and variance in the locus-level copy number data, the concordance amongst regions of copy number gains/deletions and the false-positive rate amongst deleted segments.</p> <p>Results</p> <p>APT had median locus-level copy numbers closest to a value of two, whereas PennCNV and Aroma.Affymetrix had the smallest variability associated with the median copy number. Of those evaluated, only PennCNV provides copy number specific quality-control metrics and identified 136 poor CNV samples. Regions of copy number variation (CNV) were detected using the hidden Markov models provided within PennCNV and CRLMM/VanillaIce. PennCNV detected more CNVs than CRLMM/VanillaIce; the median number of CNVs detected per sample was 39 and 30, respectively. PennCNV detected most of the regions that CRLMM/VanillaIce did as well as additional CNV regions. The median concordance between PennCNV and CRLMM/VanillaIce was 47.9% for duplications and 51.5% for deletions. The estimated false-positive rate associated with deletions was similar for PennCNV and CRLMM/VanillaIce.</p> <p>Conclusions</p> <p>If the objective is to perform statistical tests on the locus-level copy number data, our empirical results suggest that PennCNV or Aroma.Affymetrix is optimal. If the objective is to perform statistical tests on the summarized segmented data then PennCNV would be preferred over CRLMM/VanillaIce. Specifically, PennCNV allows the analyst to estimate locus-level copy number, perform segmentation and evaluate CNV-specific quality-control metrics within a single software package. PennCNV has relatively small bias, small variability and detects more regions while maintaining a similar estimated false-positive rate as CRLMM/VanillaIce. More generally, we advocate that software developers need to provide guidance with respect to evaluating and choosing optimal settings in order to obtain optimal results for an individual dataset. Until such guidance exists, we recommend trying multiple algorithms, evaluating concordance/discordance and subsequently consider the union of regions for downstream association tests.</p
Expression profiling of formalin-fixed paraffin-embedded primary breast tumors using cancer-specific and whole genome gene panels on the DASL® platform
<p>Abstract</p> <p>Background</p> <p>The cDNA-mediated Annealing, extension, Selection and Ligation (DASL) assay has become a suitable gene expression profiling system for degraded RNA from paraffin-embedded tissue. We examined assay characteristics and the performance of the DASL 502-gene Cancer Panel<sup>v1 </sup>(1.5K) and 24,526-gene panel (24K) platforms at differentiating nine human epidermal growth factor receptor 2- positive (HER2+) and 11 HER2-negative (HER2-) paraffin-embedded breast tumors.</p> <p>Methods</p> <p>Bland-Altman plots and Spearman correlations evaluated intra/inter-panel agreement of normalized expression values. Unequal-variance <it>t</it>-statistics tested for differences in expression levels between HER2 + and HER2 - tumors. Regulatory network analysis was performed using Metacore (GeneGo Inc., St. Joseph, MI).</p> <p>Results</p> <p>Technical replicate correlations ranged between 0.815-0.956 and 0.986-0.997 for the 1.5K and 24K panels, respectively. Inter-panel correlations of expression values for the common 498 genes across the two panels ranged between 0.485-0.573. Inter-panel correlations of expression values of 17 probes with base-pair sequence matches between the 1.5K and 24K panels ranged between 0.652-0.899. In both panels, <it>erythroblastic leukemia viral oncogene homolog 2 </it>(<it>ERBB2</it>) was the most differentially expressed gene between the HER2 + and HER2 - tumors and seven additional genes had p-values < 0.05 and log2 -fold changes > |0.5| in expression between HER2 + and HER2 - tumors: <it>topoisomerase II alpha </it>(<it>TOP2A</it>), <it>cyclin a2 </it>(<it>CCNA2</it>), <it>v-fos fbj murine osteosarcoma viral oncogene homolog </it>(<it>FOS</it>), <it>wingless-type mmtv integration site family, member 5a </it>(<it>WNT5A</it>), <it>growth factor receptor-bound protein </it><it>7 </it>(<it>GRB7</it>), <it>cell division cycle 2 </it>(<it>CDC2</it>), <it>and baculoviral iap repeat-containing protein 5 </it>(<it>BIRC5</it>). The top 52 discriminating probes from the 24K panel are enriched with genes belonging to the regulatory networks centered around <it>v-myc avian myelocytomatosis viral oncogene homolog </it>(<it>MYC</it>), <it>tumor protein p53 </it>(<it>TP53</it>), and <it>estrogen receptor α </it>(<it>ESR1</it>). Network analysis with a two-step extension also showed that the eight discriminating genes common to the 1.5K and 24K panels are functionally linked together through <it>MYC</it>, <it>TP53</it>, and <it>ESR1</it>.</p> <p>Conclusions</p> <p>The relative RNA abundance obtained from two highly differing density gene panels are correlated with eight common genes differentiating HER2 + and HER2 - breast tumors. Network analyses demonstrated biological consistency between the 1.5K and 24K gene panels.</p
Integrated Analysis of Gene Expression, CpG Island Methylation, and Gene Copy Number in Breast Cancer Cells by Deep Sequencing
We used deep sequencing technology to profile the transcriptome, gene copy number, and CpG island methylation status simultaneously in eight commonly used breast cell lines to develop a model for how these genomic features are integrated in estrogen receptor positive (ER+) and negative breast cancer. Total mRNA sequence, gene copy number, and genomic CpG island methylation were carried out using the Illumina Genome Analyzer. Sequences were mapped to the human genome to obtain digitized gene expression data, DNA copy number in reference to the non-tumor cell line (MCF10A), and methylation status of 21,570 CpG islands to identify differentially expressed genes that were correlated with methylation or copy number changes. These were evaluated in a dataset from 129 primary breast tumors. Gene expression in cell lines was dominated by ER-associated genes. ER+ and ER− cell lines formed two distinct, stable clusters, and 1,873 genes were differentially expressed in the two groups. Part of chromosome 8 was deleted in all ER− cells and part of chromosome 17 amplified in all ER+ cells. These loci encoded 30 genes that were overexpressed in ER+ cells; 9 of these genes were overexpressed in ER+ tumors. We identified 149 differentially expressed genes that exhibited differential methylation of one or more CpG islands within 5 kb of the 5′ end of the gene and for which mRNA abundance was inversely correlated with CpG island methylation status. In primary tumors we identified 84 genes that appear to be robust components of the methylation signature that we identified in ER+ cell lines. Our analyses reveal a global pattern of differential CpG island methylation that contributes to the transcriptome landscape of ER+ and ER− breast cancer cells and tumors. The role of gene amplification/deletion appears to more modest, although several potentially significant genes appear to be regulated by copy number aberrations
Influence of obesity-related risk factors in the aetiology of glioma
BACKGROUND: Obesity and related factors have been implicated as possible aetiological factors for the development of glioma in epidemiological observation studies. We used genetic markers in a Mendelian randomisation framework to examine whether obesity-related traits influence glioma risk. This methodology reduces bias from confounding and is not affected by reverse causation. METHODS: Genetic instruments were identified for 10 key obesity-related risk factors, and their association with glioma risk was evaluated using data from a genome-wide association study of 12,488 glioma patients and 18,169 controls. The estimated odds ratio of glioma associated with each of the genetically defined obesity-related traits was used to infer evidence for a causal relationship. RESULTS: No convincing association with glioma risk was seen for genetic instruments for body mass index, waist-to-hip ratio, lipids, type-2 diabetes, hyperglycaemia or insulin resistance. Similarly, we found no evidence to support a relationship between obesity-related traits with subtypes of glioma-glioblastoma (GBM) or non-GBM tumours. CONCLUSIONS: This study provides no evidence to implicate obesity-related factors as causes of glioma
Genome-wide association study identifies 25 known breast cancer susceptibility loci as risk factors for triple-negative breast cancer
Triple-negative (TN) breast cancer is an aggressive subtype of breast cancer associated with a unique set of epidemiologic and genetic risk factors. We conducted a two-stage genome-wide association study of TN breast cancer (stage 1: 1529 TN cases, 3399 controls; stage 2: 2148 cases, 1309 controls) to identify loci that influence TN breast cancer risk. Variants in the 19p13.1 and PTHLH loci showed genome-wide significant associations (P < 5 × 10− 8) in stage 1 and 2 combined. Results also suggested a substantial enrichment of significantly associated variants among the single nucleotide polymorphisms (SNPs) analyzed in stage 2. Variants from 25 of 74 known breast cancer susceptibility loci were also associated with risk of TN breast cancer (P < 0.05). Associations with TN breast cancer were confirmed for 10 loci (LGR6, MDM4, CASP8, 2q35, 2p24.1, TERT-rs10069690, ESR1, TOX3, 19p13.1, RALY), and we identified associations with TN breast cancer for 15 additional breast cancer loci (P < 0.05: PEX14, 2q24.1, 2q31.1, ADAM29, EBF1, TCF7L2, 11q13.1, 11q24.3, 12p13.1, PTHLH, NTN4, 12q24, BRCA2, RAD51L1-rs2588809, MKL1). Further, two SNPs independent of previously reported signals in ESR1 [rs12525163 odds ratio (OR) = 1.15, P = 4.9 × 10− 4] and 19p13.1 (rs1864112 OR = 0.84, P = 1.8 × 10− 9) were associated with TN breast cancer. A polygenic risk score (PRS) for TN breast cancer based on known breast cancer risk variants showed a 4-fold difference in risk between the highest and lowest PRS quintiles (OR = 4.03, 95% confidence interval 3.46–4.70, P = 4.8 × 10− 69). This translates to an absolute risk for TN breast cancer ranging from 0.8% to 3.4%, suggesting that genetic variation may be used for TN breast cancer risk prediction
The influence of obesity-related factors in the etiology of renal cell carcinoma-A mendelian randomization study.
BACKGROUND: Several obesity-related factors have been associated with renal cell carcinoma (RCC), but it is unclear which individual factors directly influence risk. We addressed this question using genetic markers as proxies for putative risk factors and evaluated their relation to RCC risk in a mendelian randomization (MR) framework. This methodology limits bias due to confounding and is not affected by reverse causation. METHODS AND FINDINGS: Genetic markers associated with obesity measures, blood pressure, lipids, type 2 diabetes, insulin, and glucose were initially identified as instrumental variables, and their association with RCC risk was subsequently evaluated in a genome-wide association study (GWAS) of 10,784 RCC patients and 20,406 control participants in a 2-sample MR framework. The effect on RCC risk was estimated by calculating odds ratios (ORSD) for a standard deviation (SD) increment in each risk factor. The MR analysis indicated that higher body mass index increases the risk of RCC (ORSD: 1.56, 95% confidence interval [CI] 1.44-1.70), with comparable results for waist-to-hip ratio (ORSD: 1.63, 95% CI 1.40-1.90) and body fat percentage (ORSD: 1.66, 95% CI 1.44-1.90). This analysis further indicated that higher fasting insulin (ORSD: 1.82, 95% CI 1.30-2.55) and diastolic blood pressure (DBP; ORSD: 1.28, 95% CI 1.11-1.47), but not systolic blood pressure (ORSD: 0.98, 95% CI 0.84-1.14), increase the risk for RCC. No association with RCC risk was seen for lipids, overall type 2 diabetes, or fasting glucose. CONCLUSIONS: This study provides novel evidence for an etiological role of insulin in RCC, as well as confirmatory evidence that obesity and DBP influence RCC risk
- …