Search CORE

28 research outputs found

Evaluating Microarray-based Classifiers: An Overview

Author: Augustin T.
Boulesteix A.-L.
Daumer M.
Strobl C.
Publication venue: Libertas Academica
Publication date: 01/01/2007
Field of study

For the last eight years, microarray-based class prediction has been the subject of numerous publications in medicine, bioinformatics and statistics journals. However, in many articles, the assessment of classification accuracy is carried out using suboptimal procedures and is not paid much attention. In this paper, we carefully review various statistical aspects of classifier evaluation and validation from a practical point of view. The main topics addressed are accuracy measures, error rate estimation procedures, variable selection, choice of classifiers and validation strategy

Crossref

Directory of Open Access Journals

PubMed Central

Open Access LMU

Text Categorization and Machine Learning Methods: Current State Of The Art

Author: Dr. Venu Gopala Rao. K
Durga Bhavani Dasari
Publication venue: Global Journals Inc. (US)
Publication date: 15/01/2012
Field of study

In this informative age, we find many documents are available in digital forms which need classification of the text. For solving this major problem present researchers focused on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of pre classified documents, the characteristics of the categories. The main benefit of the present approach is consisting in the manual definition of a classifier by domain experts where effectiveness, less use of expert work and straightforward portability to different domains are possible. The paper examines the main approaches to text categorization comparing the machine learning paradigm and present state of the art. Various issues pertaining to three different text similarity problems, namely, semantic, conceptual and contextual are also discussed

Global Journal of Computer Science and Technology (GJCST)

Designing multiple classifier combinations a survey

Author: Husin Abdullah
Ku-Mahamud Ku Ruhana
Publication venue: Little Lion Scientific
Publication date: 01/01/2019
Field of study

Classification accuracy can be improved through multiple classifier approach. It has been proven that multiple classifier combinations can successfully obtain better classification accuracy than using a single classifier. There are two main problems in designing a multiple classifier combination which are determining the classifier ensemble and combiner construction. This paper reviews approaches in constructing the classifier ensemble and combiner. For each approach, methods have been reviewed and their advantages and disadvantages have been highlighted. A random strategy and majority voting are the most commonly used to construct the ensemble and combiner, respectively. The results presented in this review are expected to be a road map in designing multiple classifier combinations

UUM Repository

A novel dynamic rough subspace based selective ensemble

Author: Guo Yuwei
Jiao Licheng
Liu Fang
Rong Kaixuan
Wang Shuang
Wang Shuo
Xiong Tao
Publication venue: 'Elsevier BV'
Publication date: 01/05/2015
Field of study

Crossref

University of Birmingham Research Portal

Advances in Data Mining Knowledge Discovery and Applications

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Advances in Data Mining Knowledge Discovery and Applications aims to help data miners, researchers, scholars, and PhD students who wish to apply data mining techniques. The primary contribution of this book is highlighting frontier fields and implementations of the knowledge discovery and data mining. It seems to be same things are repeated again. But in general, same approach and techniques may help us in different fields and expertise areas. This book presents knowledge discovery and data mining applications in two different sections. As known that, data mining covers areas of statistics, machine learning, data management and databases, pattern recognition, artificial intelligence, and other areas. In this book, most of the areas are covered with different data mining applications. The eighteen chapters have been classified in two parts: Knowledge Discovery and Data Mining Applications

Directory of Open Access Books (DOAB)

Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography

Author: Abeel
Adem
Ahn
Ali
Altincay
Anand
Arbel
Archer
Averbuch
Banfield
Bao
Bartlett
Bauer
Bay
Bennett
Brazdil
Breiman
Breiman
Breiman
Brodley
Brown
Brown
Bruzzone
Bryll
Buntine
Buttrey
Chan
Chawla
Christensen
Christmann
Clark
Cohen
Croux
Cunningham
Dasarathy
Denison
Derbeko
Dietterich
Dietterich
Dietterich
Dimitrakakis
Domingos
Drucker
Džeroski
Elovici
Elovici
Frank
Friedman
Friedman
Friedman
Gama
Gams
Gey
Gunter
Hansen
Ho
Ho
Ho
Ho
Hothorn
Hu
Hu
Huang
Islam
Jacobs
Jordan
Kamel
Kang
Kim
Kolen
Krogh
Kuncheva
Kuncheva
Kuncheva
Kusiak
Lam
Langdon
Leigh
Li
Liao
Lin
Lin
Lior Rokach
Liu
Liu
Lu
Maimon
Maimon
Maimon
Mangiameli
Menahem
Merkwirth
Merler
Merz
Michalski
Mitchell
Moskovitch
Nowlan
Opitz
Opitz
Opitz
Parmanto
Partridge
Phama
Polikar
Ridgeway
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rokach
Rosen
Rudin
Schaffer
Schapire
Schclar
Seewald
Sexton
Sharkey
Sharkey
Sharkey
Sharkey
Shilen
Sivalingam
Skurichina
Sohna
Sun
Tan
Tao
Tao
Towell
Tsao
Tsymbal
Tsymbal
Tukey
Tumer
Tumer
Tumer
Valentini
Vilalta
Wanas
Wang
Webb
Webb
Windeatt
Wolpert
Woods
Wu
Xu
Yates
Zhang
Zhang
Zhou
Zhou
Zhou
Zhoua
Zupan
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

Recent Trends in Computational Intelligence

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications

Directory of Open Access Books (DOAB)

Spare parts classification in industrial manufacturing using the dominance-based rough set approach

Author: Chakhar S
Hu Q
Labib A
Siraj S
Publication venue: 'Elsevier BV'
Publication date: 01/11/2017
Field of study

Classification is one of the critical issues in the operations management of spare parts. The issue of managing spare parts involves multiple criteria to be taken into consideration, and therefore, a number of approaches exists that consider criteria such as criticality, price, demand, lead time, and obsolescence, to name a few. In this paper, we first review proposals to deal with inventory control. We then propose a three-phase multicriteria classification framework for spare parts management using the dominance-based rough set approach (DRSA). In the first phase, a set of ‘if–then’ decision rules is generated from historical data using the DRSA. The generated rules are then validated in the second phase by using both the automated and manual approaches, including cross-validation and feedback assessments by the decision maker. The third and final phase is to classify an unseen set of spare parts in a real setting. The proposed approach has been successfully applied to data collected from a manufacturing company in China. The proposed framework was practically tested on different spare parts and, based on the feedback received from the industry experts, 96% of the spare parts were correctly classified. Furthermore, the cross-validation results show that the proposed approach significantly outperforms other well-known classification methods. The proposed approach has several important characteristics that distinguish it from existing ones: (i) it is a learning-set based analysis approach; (ii) it uses a powerful multicriteria classification method, namely the DRSA; (iii) it validates the generated decision rules with multiple strategies; and (iv) it actively involves the decision maker during all the steps of the decision making process

Crossref

Portsmouth University Research Portal (Pure)

White Rose Research Online