Skip to main content
Article thumbnail
Location of Repository

New ensemble machine learning method for classification and prediction on gene expression data

By Ching-Wei Wang


–A reliable and precise classification of tumours is\ud essential for successful treatment of cancer. Recent researches have confirmed the utility of ensemble machine learning algorithms for gene expression data analysis. In this paper, a new ensemble machine learning algorithm is proposed for classification and prediction on gene expression data. The algorithm is tested and compared with three popular adopted ensembles, i.e. bagging, boosting and arcing. The results show that the proposed algorithm greatly outperforms existing methods, achieving high accuracy over 12 gene expression datasets

Topics: G400 Computer Science
Publisher: Institute of Electrical and Electronics Engineers, Inc
Year: 2006
OAI identifier:

Suggested articles


  1. [1996b] Bagging predictors,
  2. (1998). Arcing Classifiers,
  3. (2004). BagBoosting for tumor classification with gene expression data,
  4. (1996). Bagging, boosting, and C4.5.
  5. (1999). Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays.
  6. (2002). Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling.
  7. (2005). Data Mining: Practical machine learning tools and techniques, 2nd Edition,
  8. (2000). Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling.
  9. (2003). Ensemble Machine Learning on Gene Expression Data for Cancer Classification, Applied BioInformatics,
  10. et al [2002] Prediction of central nervous system embryonal tumour outcome based on gene expression.
  11. et al [2003] Gene expression-based classification of malignant gliomas correlates better with survival than histological classification.
  12. (1996). Experiments with a new boosting algorithm,
  13. Gene expression correlates of clinical prostate cancer behavior.
  14. (2002). Gene expression profiling predicts clinical outcome of breast cancer.
  15. (2002). Genome-wide cDNA microarray screening to correlate gene expression profiles with sensitivity of 85 human cancer xenografts to anticancer drugs.
  16. (2002). MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia.
  17. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.
  18. (1997). Note on free lunches and cross-validation,
  19. (1990). The strength of weak learnability,
  20. (2002). Translation of Microarray Data into Clinically Relevant Cancer Diagnostic Tests Using Gene Expression Ratios in Lung Cancer and Mesothelioma.

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.