36 research outputs found
Subtype prediction in pediatric acute myeloid leukemia: Classification using differential network rank conservation revisited
Background: One of the most important application spectrums of transcriptomic data is cancer phenotype classification. Many characteristics of transcriptomic data, such as redundant features and technical artifacts, make over-fitting commonplace. Promising classification results often fail to generalize across datasets with different sources, platforms, or preprocessing. Recently a novel differential network rank conservation (DIRAC) algorithm to characterize cancer phenotypes using transcriptomic data. DIRAC is a member of a family of algorithms that have shown useful for disease classification based on the relative expression of genes. Combining the robustness of this family's simple decision rules with known biological relationships, this systems approach identifies interpretable, yet highly discriminate networks. While DIRAC has been briefly employed for several classification problems in the original paper, the potentials of DIRAC in cancer phenotype classification, and especially robustness against artifacts in transcriptomic data have not been fully characterized yet. Results: In this study we thoroughly investigate the potentials of DIRAC by applying it to multiple datasets, and examine the variations in classification performances when datasets are (i) treated and untreated for batch effect; (ii) preprocessed with different techniques. We also propose the first DIRAC-based classifier to integrate multiple networks. We show that the DIRAC-based classifier is very robust in the examined scenarios. To our surprise, the trained DIRAC-based classifier even translated well to a dataset with different biological characteristics in the presence of substantial batch effects that, as shown here, plagued the standard expression value based classifier. In addition, the DIRAC-based classifier, because of the integrated biological information, also suggests pathways to target in specific subtypes, which may enhance the establishment of personalized therapy in diseases such as pediatric AML. In order to better comprehend the prediction power of the DIRAC-based classifier in general, we also performed classifications using publicly available datasets from breast and lung cancer. Furthermore, multiple well-known classification algorithms were utilized to create an ideal test bed for comparing the DIRAC-based classifier with the standard gene expression value based classifier. We observed that the DIRAC-based classifier greatly outperforms its rival. Conclusions: Based on our experiments with multiple datasets, we propose that DIRAC is a promising solution to the lack of generalizability in classification efforts that uses transcriptomic data. We believe that superior performances presented in this study may motivate other to initiate a new aline of research to explore the untapped power of DIRAC in a broad range of cancer types
Classification of pediatric acute myeloid leukemia based on miRNA expression profiles
Pediatric acute myeloid leukemia (AML) is a heterogeneous disease with respect to biology as well as outcome. In this study, we investigated whether known biological subgroups of pediatric AML are reflected by a common microRNA (miRNA) expression pattern. We assayed 665 miRNAs on 165 pediatric AML samples. First, unsupervised clustering was performed to identify patient clusters with common miRNA expression profiles. Our analysis unraveled 14 clusters, seven of which had a known (cyto-)genetic denominator. Finally, a robust classifier was constructed to discriminate six molecular aberration groups: 11q23-rearrangements, t(8;21)(q22;q22), inv(16)(p13q22), t(15;17) (q21;q22), NPM1 and CEBPA mutations. The classifier achieved accuracies of 89%, 95%, 95%, 98%, 91% and 96%, respectively. Although lower sensitivities were obtained for the NPM1 and CEBPA (32% and 66%), relatively high sensitivities (84%-94%) were attained for the rest. Specificity was high in all groups (87%-100%). Due to a robust double-loop cross validation procedure employed, the classifier only employed 47 miRNAs to achieve the aforementioned accuracies. To validate the 47 miRNA signatures, we applied them to a publicly available adult AML dataset. Albeit partial overlap of the array platforms and molecular differences between pediatric and adult AML, the signatures performed reasonably well. This corroborates our claim that the identified miRNA signatures are not dominated by sample size bias in the pediatric AML dataset. In conclusion, cytogenetic subtypes of pediatric AML have distinct miRNA expression patterns. Reproducibility of the miRNA signatures in adult dataset suggests that the respective aberrations have a similar biology both in pediatric and adult AML
MicroRNA-106b~25 cluster is upregulated in relapsed MLL-rearranged pediatric acute myeloid leukemia
The most important reason for therapy failure in pediatric acute myeloid leukemia (AML) is relapse. In order to identify miRNAs that contribute to the clonal evolution towards relapse in pediatric AML, miRNA expression profiling of 127 de novo pediatric AML cases were used. In the diagnostic phase, no miRNA signatures could be identified that were predictive for relapse occurrence, in a large pediatric cohort, nor in a nested mixed lineage leukemia (MLL)-rearranged pediatric cohort. AML with MLL- rearrangements are found in 15-20% of all pediatric AML samples, and reveal a relapse rate up to 50% for certain translocation partner subgroups. Therefore, microRNA expression profiling of six paired initial diagnosis-relapse MLL-rearranged pediatric AML samples (test cohort) and additional eight paired initial diagnosisrelapse samples with MLL-rearrangements (validation cohort) was performed. A list of 53 differentially expressed miRNAs was identified of which the miR-106b~25 cluster, located in intron 13 of MCM7, was the most prominent. These differentially expressed miRNAs however could not predict a relapse in de novo AML samples with MLLrearrangements at diagnosis. Furthermore, higher mRNA expression of both MCM7 and its upstream regulator E2F1 was found in relapse samples with MLL-rearrangements. In conclusion, we identified the miR-106b~25 cluster to be upregulated in relapse pediatric AML with MLL-rearrangements
Stepwise classification of cancer samples using clinical and molecular data
<p>Abstract</p> <p>Background</p> <p>Combining clinical and molecular data types may potentially improve prediction accuracy of a classifier. However, currently there is a shortage of effective and efficient statistical and bioinformatic tools for true integrative data analysis. Existing integrative classifiers have two main disadvantages: First, coarse combination may lead to subtle contributions of one data type to be overshadowed by more obvious contributions of the other. Second, the need to measure both data types for all patients may be both unpractical and (cost) inefficient.</p> <p>Results</p> <p>We introduce a novel classification method, a stepwise classifier, which takes advantage of the distinct classification power of clinical data and high-dimensional molecular data. We apply classification algorithms to two data types independently, starting with the traditional clinical risk factors. We only turn to relatively expensive molecular data when the uncertainty of prediction result from clinical data exceeds a predefined limit. Experimental results show that our approach is adaptive: the proportion of samples that needs to be re-classified using molecular data depends on how much we expect the predictive accuracy to increase when re-classifying those samples.</p> <p>Conclusions</p> <p>Our method renders a more cost-efficient classifier that is at least as good, and sometimes better, than one based on clinical or molecular data alone. Hence our approach is not just a classifier that minimizes a particular loss function. Instead, it aims to be cost-efficient by avoiding molecular tests for a potentially large subgroup of individuals; moreover, for these individuals a test result would be quickly available, which may lead to reduced waiting times (for diagnosis) and hence lower the patients distress. Stepwise classification is implemented in R-package <it>stepwiseCM </it>and available at the Bioconductor website.</p
Endogenous tumor suppressor microRNA-193b: Therapeutic and prognostic value in acute myeloid leukemia
Purpose Dysregulated microRNAs are implicated in the pathogenesis and aggressiveness of acute myeloid leukemia (AML). We describe the effect of the hematopoietic stem-cell self-renewal regulating miR-193b on progression and prognosis of AML. Methods We profiled miR-193b-5p/3p expression in cytogenetically and clinically characterized de novo pediatric AML (n = 161) via quantitative real-time polymerase chain reaction and validated our findings in an independent cohort of 187 adult patients. We investigated the tumor suppressive function of miR-193b in human AML blasts, patient-derived xenografts, and miR-193b knockout mice in vitro and in vivo. Results miR-193b exerted important, endogenous, tumor-suppressive functions on the hematopoietic system. miR-193b-3p was downregulated in several cytogenetically defined subgroups of pediatric and adult AML, and low expression served as an independent indicator for poor prognosis in pediatric AML (risk ratio 6 standard error, 20.56 6 0.23; P = .016). miR-193b-3p expression improved the prognostic value of the European LeukemiaNet risk-group stratification or a 17-gene leukemic stemness score. In knockout mice, loss of miR-193b cooperated with Hoxa9/Meis1 during leukemogenesis, whereas restoring miR-193b expression impaired leukemic engraftment. Similarly, expression of miR-193b in AML blasts from patients diminished leukemic growth in vitro and in mouse xenografts. Mechanistically, miR-193b induced apoptosis and a G1/S-phase block in various human AML subgroups by targeting multiple factors of the KIT-RAS-RAF-MEK-ERK (MAPK) signaling cascade and the downstream cell cycle regulator CCND1. Conclusion The tumor-suppressive function is independent of patient age or genetics; therefore, restoring miR-193b would assure high antileukemic efficacy by blocking the entire MAPK signaling cascade while preventing the emergence of resistance mechanisms