28 research outputs found

    Evaluating Microarray-based Classifiers: An Overview

    Get PDF
    For the last eight years, microarray-based class prediction has been the subject of numerous publications in medicine, bioinformatics and statistics journals. However, in many articles, the assessment of classification accuracy is carried out using suboptimal procedures and is not paid much attention. In this paper, we carefully review various statistical aspects of classifier evaluation and validation from a practical point of view. The main topics addressed are accuracy measures, error rate estimation procedures, variable selection, choice of classifiers and validation strategy

    Text Categorization and Machine Learning Methods: Current State Of The Art

    Get PDF
    In this informative age, we find many documents are available in digital forms which need classification of the text. For solving this major problem present researchers focused on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of pre classified documents, the characteristics of the categories. The main benefit of the present approach is consisting in the manual definition of a classifier by domain experts where effectiveness, less use of expert work and straightforward portability to different domains are possible. The paper examines the main approaches to text categorization comparing the machine learning paradigm and present state of the art. Various issues pertaining to three different text similarity problems, namely, semantic, conceptual and contextual are also discussed

    Designing multiple classifier combinations a survey

    Get PDF
    Classification accuracy can be improved through multiple classifier approach. It has been proven that multiple classifier combinations can successfully obtain better classification accuracy than using a single classifier. There are two main problems in designing a multiple classifier combination which are determining the classifier ensemble and combiner construction. This paper reviews approaches in constructing the classifier ensemble and combiner. For each approach, methods have been reviewed and their advantages and disadvantages have been highlighted. A random strategy and majority voting are the most commonly used to construct the ensemble and combiner, respectively. The results presented in this review are expected to be a road map in designing multiple classifier combinations

    Advances in Data Mining Knowledge Discovery and Applications

    Get PDF
    Advances in Data Mining Knowledge Discovery and Applications aims to help data miners, researchers, scholars, and PhD students who wish to apply data mining techniques. The primary contribution of this book is highlighting frontier fields and implementations of the knowledge discovery and data mining. It seems to be same things are repeated again. But in general, same approach and techniques may help us in different fields and expertise areas. This book presents knowledge discovery and data mining applications in two different sections. As known that, data mining covers areas of statistics, machine learning, data management and databases, pattern recognition, artificial intelligence, and other areas. In this book, most of the areas are covered with different data mining applications. The eighteen chapters have been classified in two parts: Knowledge Discovery and Data Mining Applications

    Recent Trends in Computational Intelligence

    Get PDF
    Traditional models struggle to cope with complexity, noise, and the existence of a changing environment, while Computational Intelligence (CI) offers solutions to complicated problems as well as reverse problems. The main feature of CI is adaptability, spanning the fields of machine learning and computational neuroscience. CI also comprises biologically-inspired technologies such as the intellect of swarm as part of evolutionary computation and encompassing wider areas such as image processing, data collection, and natural language processing. This book aims to discuss the usage of CI for optimal solving of various applications proving its wide reach and relevance. Bounding of optimization methods and data mining strategies make a strong and reliable prediction tool for handling real-life applications

    Spare parts classification in industrial manufacturing using the dominance-based rough set approach

    Get PDF
    Classification is one of the critical issues in the operations management of spare parts. The issue of managing spare parts involves multiple criteria to be taken into consideration, and therefore, a number of approaches exists that consider criteria such as criticality, price, demand, lead time, and obsolescence, to name a few. In this paper, we first review proposals to deal with inventory control. We then propose a three-phase multicriteria classification framework for spare parts management using the dominance-based rough set approach (DRSA). In the first phase, a set of ā€˜ifā€“thenā€™ decision rules is generated from historical data using the DRSA. The generated rules are then validated in the second phase by using both the automated and manual approaches, including cross-validation and feedback assessments by the decision maker. The third and final phase is to classify an unseen set of spare parts in a real setting. The proposed approach has been successfully applied to data collected from a manufacturing company in China. The proposed framework was practically tested on different spare parts and, based on the feedback received from the industry experts, 96% of the spare parts were correctly classified. Furthermore, the cross-validation results show that the proposed approach significantly outperforms other well-known classification methods. The proposed approach has several important characteristics that distinguish it from existing ones: (i) it is a learning-set based analysis approach; (ii) it uses a powerful multicriteria classification method, namely the DRSA; (iii) it validates the generated decision rules with multiple strategies; and (iv) it actively involves the decision maker during all the steps of the decision making process
    corecore