76 research outputs found
Optimization of the BLASTN substitution matrix for prediction of non-specific DNA microarray hybridization
DNA microarray measurements are susceptible to error caused by non-specific hybridization between a probe and a target (cross-hybridization), or between two targets (bulk-hybridization). Search algorithms such as BLASTN can quickly identify potentially hybridizing sequences. We set out to improve BLASTN accuracy by modifying the substitution matrix and gap penalties. We generated gene expression microarray data for samples in which 1 or 10% of the target mass was an exogenous spike of known sequence. We found that the 10% spike induced 2-fold intensity changes in 3% of the probes, two-third of which were decreases in intensity likely caused by bulk-hybridization. These changes were correlated with similarity between the spike and probe sequences. Interestingly, even very weak similarities tended to induce a change in probe intensity with the 10% spike. Using this data, we optimized the BLASTN substitution matrix to more accurately identify probes susceptible to non-specific hybridization with the spike. Relative to the default substitution matrix, the optimized matrix features a decreased score for AâT base pairs relative to GâC base pairs, resulting in a 5â15% increase in area under the ROC curve for identifying affected probes. This optimized matrix may be useful in the design of microarray probes, and in other BLASTN-based searches for hybridization partners
Jetset: selecting the optimal microarray probe set to represent a gene
<p>Abstract</p> <p>Background</p> <p>Interpretation of gene expression microarrays requires a mapping from probe set to gene. On many Affymetrix gene expression microarrays, a given gene may be detected by multiple probe sets, which may deliver inconsistent or even contradictory measurements. Therefore, obtaining an unambiguous expression estimate of a pre-specified gene can be a nontrivial but essential task.</p> <p>Results</p> <p>We developed scoring methods to assess each probe set for specificity, splice isoform coverage, and robustness against transcript degradation. We used these scores to select a single representative probe set for each gene, thus creating a simple one-to-one mapping between gene and probe set. To test this method, we evaluated concordance between protein measurements and gene expression values, and between sets of genes whose expression is known to be correlated. For both test cases, we identified genes that were nominally detected by multiple probe sets, and we found that the probe set chosen by our method showed stronger concordance.</p> <p>Conclusions</p> <p>This method provides a simple, unambiguous mapping to allow assessment of the expression levels of specific genes of interest.</p
Evaluation of Microarray Preprocessing Algorithms Based on Concordance with RT-PCR in Clinical Samples
BACKGROUND
Several preprocessing algorithms for Affymetrix gene expression microarrays have been developed, and their performance on spike-in data sets has been evaluated previously. However, a comprehensive comparison of preprocessing algorithms on samples taken under research conditions has not been performed.
METHODOLOGY/PRINCIPAL FINDINGS
We used TaqMan RT-PCR arrays as a reference to evaluate the accuracy of expression values from Affymetrix microarrays in two experimental data sets: one comprising 84 genes in 36 colon biopsies, and the other comprising 75 genes in 29 cancer cell lines. We evaluated consistency using the Pearson correlation between measurements obtained on the two platforms. Also, we introduce the log-ratio discrepancy as a more relevant measure of discordance between gene expression platforms. Of nine preprocessing algorithms tested, PLIER+16 produced expression values that were most consistent with RT-PCR measurements, although the difference in performance between most of the algorithms was not statistically significant.
CONCLUSIONS/SIGNIFICANCE
Our results support the choice of PLIER+16 for the preprocessing of clinical Affymetrix microarray data. However, other algorithms performed similarly and are probably also good choices
Recommended from our members
Redefinition of Affymetrix probe sets by sequence overlap with cDNA microarray probes reduces cross-platform inconsistencies in cancer-associated gene expression measurements
BACKGROUND: Comparison of data produced on different microarray platforms often shows surprising discordance. It is not clear whether this discrepancy is caused by noisy data or by improper probe matching between platforms. We investigated whether the significant level of inconsistency between results produced by alternative gene expression microarray platforms could be reduced by stringent sequence matching of microarray probes. We mapped the short oligo probes of the Affymetrix platform onto cDNA clones of the Stanford microarray platform. Affymetrix probes were reassigned to redefined probe sets if they mapped to the same cDNA clone sequence, regardless of the original manufacturer-defined grouping. The NCI-60 gene expression profiles produced by Affymetrix HuFL platform were recalculated using these redefined probe sets and compared to previously published cDNA measurements of the same panel of RNA samples. RESULTS: The redefined probe sets displayed a substantially higher level of cross-platform consistency at the level of gene correlation, cell line correlation and unsupervised hierarchical clustering. The same strategy allowed an almost complete correspondence of breast cancer subtype classification between Affymetrix gene chip and cDNA microarray derived gene expression data, and gave an increased level of similarity between normal lung derived gene expression profiles using the two technologies. In total, two Affymetrix gene-chip platforms were remapped to three cDNA platforms in the various cross-platform analyses, resulting in improved concordance in each case. CONCLUSION: We have shown that probes which target overlapping transcript sequence regions on cDNA microarrays and Affymetrix gene-chips exhibit a greater level of concordance than the corresponding Unigene or sequence matched features. This method will be useful for the integrated analysis of gene expression data generated by multiple disparate measurement platforms
Biasogram: visualization of confounding technical bias in gene expression data.
Gene expression profiles of clinical cohorts can be used to identify genes that are correlated with a clinical variable of interest such as patient outcome or response to a particular drug. However, expression measurements are susceptible to technical bias caused by variation in extraneous factors such as RNA quality and array hybridization conditions. If such technical bias is correlated with the clinical variable of interest, the likelihood of identifying false positive genes is increased. Here we describe a method to visualize an expression matrix as a projection of all genes onto a plane defined by a clinical variable and a technical nuisance variable. The resulting plot indicates the extent to which each gene is correlated with the clinical variable or the technical variable. We demonstrate this method by applying it to three clinical trial microarray data sets, one of which identified genes that may have been driven by a confounding technical variable. This approach can be used as a quality control step to identify data sets that are likely to yield false positive results
Tumor Mutation Burden Forecasts Outcome in Ovarian Cancer with BRCA1 or BRCA2 Mutations
Background: Increased number of single nucleotide substitutions is seen in breast and ovarian cancer genomes carrying disease-associated mutations in BRCA1 or BRCA2. The significance of these genome-wide mutations is unknown. We hypothesize genome-wide mutation burden mirrors deficiencies in DNA repair and is associated with treatment outcome in ovarian cancer. Methods and Results: The total number of synonymous and non-synonymous exome mutations (Nmut), and the presence of germline or somatic mutation in BRCA1 or BRCA2 (mBRCA) were extracted from whole-exome sequences of high-grade serous ovarian cancers from The Cancer Genome Atlas (TCGA). Cox regression and Kaplan-Meier methods were used to correlate Nmut with chemotherapy response and outcome. Higher Nmut correlated with a better response to chemotherapy after surgery. In patients with mBRCA-associated cancer, low Nmut was associated with shorter progression-free survival (PFS) and overall survival (OS), independent of other prognostic factors in multivariate analysis. Patients with mBRCA-associated cancers and a high Nmut had remarkably favorable PFS and OS. The association with survival was similar in cancers with either BRCA1 or BRCA2 mutations. In cancers with wild-type BRCA, tumor Nmut was associated with treatment response in patients with no residual disease after surgery. Conclusions: Tumor Nmut was associated with treatment response and with both PFS and OS in patients with high-grade serous ovarian cancer carrying BRCA1 or BRCA2 mutations. In the TCGA cohort, low Nmut predicted resistance to chemotherapy, and for shorter PFS and OS, while high Nmut forecasts a remarkably favorable outcome in mBRCA-associated ovarian cancer. Our observations suggest that the total mutation burden coupled with BRCA1 or BRCA2 mutations in ovarian cancer is a genomic marker of prognosis and predictor of treatment response. This marker may reflect the degree of deficiency in BRCA-mediated pathways, or the extent of compensation for the deficiency by alternative mechanisms
Consistent metagenes from cancer expression profiles yield agent specific predictors of chemotherapy response
Genome scale expression profiling of human tumor samples is likely to yield improved cancer treatment decisions. However, identification of clinically predictive or prognostic classifiers can be challenging when a large number of genes are measured in a small number of tumors.Journal ArticleResearch Support, N.I.H. ExtramuralResearch Support, Non-U.S. Gov'tSCOPUS: ar.jinfo:eu-repo/semantics/publishe
Pan-cancer analysis of genomic scar signatures associated with homologous recombination deficiency suggests novel indications for existing cancer drugs
Background: Ovarian and triple-negative breast cancers with BRCA1 or BRCA2 loss are highly sensitive to treatment with PARP inhibitors and platinum-based cytotoxic agents and show an accumulation of genomic scars in the form of gross DNA copy number aberrations. Cancers without BRCA1 or BRCA2 loss but with accumulation of similar genomic scars also show increased sensitivity to platinum-based chemotherapy. Therefore, reliable biomarkers to identify DNA repair-deficient cancers prior to treatment may be useful for directing patients to platinum chemotherapy and possibly PARP inhibitors. Recently, three SNP array-based signatures of chromosomal instability were published that each quantitate a distinct type of genomic scar considered likely to be caused by improper DNA repair. They measure telomeric allelic imbalance (named NtAI), large scale transition (named LST), and loss of heterozygosity (named HRD-LOH), and it is suggested that these signatures may act as biomarkers for the state of DNA repair deficiency in a given cancer. Results: We explored the pan-cancer distribution of scores of the three signatures utilizing a panel of 5371 tumors representing 15 cancer types from The Cancer Genome Atlas, and found a good correlation between scores of the three signatures (Spearmanâs Ï 0.73â0.87). In addition we found that cancer types ordinarily receiving platinum as standard of care have higher median scores of all three signatures. Interestingly, we also found that smaller subpopulations of high-scoring tumors exist in most cancer types, including those for which platinum chemotherapy is not standard therapy. Conclusions: Within several cancer types that are not ordinarily treated with platinum chemotherapy, we identified tumors with high levels of the three genomic biomarkers. These tumors represent identifiable subtypes of patients which may be strong candidates for clinical trials with PARP inhibitors or platinum-based chemotherapeutic regimens. Electronic supplementary material The online version of this article (doi:10.1186/s40364-015-0033-4) contains supplementary material, which is available to authorized users
Weighted composition operators on Korenblum type spaces of analytic functions
[EN] We investigate the continuity, compactness and invertibility of weighted composition operators W-psi,W-phi: f -> psi(f circle phi) when they act on the classical Korenblum space A(-infinity) and other related Frechet or (LB)-spaces of analytic functions on the open unit disc which are defined as intersections or unions of weighted Banach spaces with sup-norms. Some results about the spectrum of these operators are presented in case the self-map phi has a fixed point in the unit disc. A precise description of the spectrum is obtained in this case when the operator acts on the Korenblum space.This research was partially supported by the research project MTM2016-76647-P and the grant BES-2017-081200.Gomez-Orts, E. (2020). Weighted composition operators on Korenblum type spaces of analytic functions. Revista de la Real Academia de Ciencias Exactas FĂsicas y Naturales Serie A MatemĂĄticas. 114(4):1-15. https://doi.org/10.1007/s13398-020-00924-1S1151144Abramovich, Y.A., Aliprantis, C.D.: An invitation to operator theory. Graduate Studies in Mathematics. Amer. Math. Soc., 50 (2002)Albanese, A.A., Bonet, J., Ricker, W.J.: The CesĂ ro operator in the FrĂ©chet spaces and . Glasgow Math. J. 59, 273â287 (2017)Albanese, A.A., Bonet, J., Ricker, W.J.: The CesĂ ro operator on Korenblum type spaces of analytic functions. Collect. Math. 69(2), 263â281 (2018)Albanese, A.A., Bonet, J., Ricker, W.J.: Operators on the FrĂ©chet sequence spaces . Rev. R. Acad. Cienc. Exactas FĂs. Nat. Ser. A Mat. RACSAM 113(2), 1533â1556 (2019)Albanese, A.A., Bonet, J., Ricker, W.J.: Linear operators on the (LB)-sequence spaces . Descriptive topology and functional analysis. II, 43â67, Springer Proc. Math. Stat., 286, Springer, Cham (2019)Arendt, W., Chalendar, I., Kumar, M., Srivastava, S.: Powers of composition operators: asymptotic behaviour on Bergman, Dirichlet and Bloch spaces. J. Austral. Math. Soc. 1â32. https://doi.org/10.1017/S1446788719000235Aron, R., Lindström, M.: Spectra of weighted composition operators on weighted Banach spaces of analytic funcions. Israel J. Math. 141, 263â276 (2004)Bierstedt, K.D., Summers, W.H.: Biduals of weighted Banach spaces of analytic functions. J. Austral. Math. Soc., Ser. A, 54(1), 70â79 (1993)Bonet, J.: A note about the spectrum of composition operators induced by a rotation. RACSAM 114, 63 (2020). https://doi.org/10.1007/s13398-020-00788-5Bonet, J., DomaĆski, P., Lindström, M., Taskinen, J.: Composition operators between weighted Banach spaces of analytic functions. J. Austral. Math. Soc., Ser. A, 64(1), 101â118 (1998)Bourdon, P.S.: Essential angular derivatives and maximum growth of Königs eigenfunctions. J. Func. Anal. 160, 561â580 (1998)Bourdon, P.S.: Invertible weighted composition operators. Proc. Am. Math. Soc. 142(1), 289â299 (2014)Carleson, L., Gamelin, T.: Complex Dynamics. Springer, Berlin (1991)Cowen, C., MacCluer, B.: Composition Operators on Spaces of Analytic Functions. CRC Press, Boca Raton, FL (1995)Contreras, M., HernĂĄndez-DĂaz, A.G.: Weighted composition operators in weighted Banach spacs of analytic functions. J. Austral. Math. Soc., Ser. A 69, 41â60 (2000)Eklund, T., Galindo, P., Lindström, M.: Königs eigenfunction for composition operators on Bloch and spaces. J. Math. Anal. Appl. 445, 1300â1309 (2017)Hedenmalm, H., Korenblum, B., Zhu, K.: Theory of Bergman Spaces. Grad. Texts in Math. 199. Springer, New York (2000)Jarchow, H.: Locally Convex Spaces. Teubner, Stuttgart (1981)Kamowitz, H.: Compact operators of the form . Pac. J. Math. 80(1) (1979)Korenblum, B.: An extension of the Nevanlinna theory. Acta Math. 135, 187â219 (1975)Köthe, G.: Topological Vector Spaces II. Springer, New York Inc (1979)Lusky, W.: On the isomorphism classes of weighted spaces of harmonic and holomophic functions. Stud. Math. 75, 19â45 (2006)Meise, R., Vogt, D.: Introduction to functional analysis. Oxford Grad. Texts in Math. 2, New York, (1997)Montes-RodrĂguez, A.: Weighted composition operators on weighted Banach spaces of analytic functions. J. Lond. Math. Soc. 61(3), 872â884 (2000)QueffĂ©lec, H., QueffĂ©lec, M.: Diophantine Approximation and Dirichlet series. Hindustain Book Agency, New Delhi (2013)Shapiro, J.H.: Composition Operators and Classical Function Theory. Springer, New York (1993)Shields, A.L., Williams, D.L.: Bounded projections, duality and multipliers in spaces of analytic functions. Trans. Amer. Math. Soc. 162, 287â302 (1971)Zhu, K.: Operator Theory on Function Spaces, Math. Surveys and Monographs, Amer. Math. Soc. 138 (2007
Predictive biomarker discovery through the parallel integration of clinical trial and functional genomics datasets
The European Union multi-disciplinary Personalised RNA interference to Enhance the Delivery of Individualised Cytotoxic and Targeted therapeutics (PREDICT) consortium has recently initiated a framework to accelerate the development of predictive biomarkers of individual patient response to anti-cancer agents. The consortium focuses on the identification of reliable predictive biomarkers to approved agents with anti-angiogenic activity for which no reliable predictive biomarkers exist: sunitinib, a multi-targeted tyrosine kinase inhibitor and everolimus, a mammalian target of rapamycin (mTOR) pathway inhibitor. Through the analysis of tumor tissue derived from pre-operative renal cell carcinoma (RCC) clinical trials, the PREDICT consortium will use established and novel methods to integrate comprehensive tumor-derived genomic data with personalized tumor-derived small hairpin RNA and high-throughput small interfering RNA screens to identify and validate functionally important genomic or transcriptomic predictive biomarkers of individual drug response in patients. PREDICT's approach to predictive biomarker discovery differs from conventional associative learning approaches, which can be susceptible to the detection of chance associations that lead to overestimation of true clinical accuracy. These methods will identify molecular pathways important for survival and growth of RCC cells and particular targets suitable for therapeutic development. Importantly, our results may enable individualized treatment of RCC, reducing ineffective therapy in drug-resistant disease, leading to improved quality of life and higher cost efficiency, which in turn should broaden patient access to beneficial therapeutics, thereby enhancing clinical outcome and cancer survival. The consortium will also establish and consolidate a European network providing the technological and clinical platform for large-scale functional genomic biomarker discovery. Here we review our current understanding of molecular mechanisms driving resistance to anti-angiogenesis agents, the current limitations of laboratory and clinical trial strategies and how the PREDICT consortium will endeavor to identify a new generation of predictive biomarkers
- âŠ