1,353 research outputs found

    Jumlah Transisi Pada Ciri Transisi Dalam Pengenalan Pola Tulisan Tangan Aksara Jawa Nglegeno Dengan Multiclass Support Vector Machines

    Get PDF
    Feature extraction is one of the most improtant step on characters recognition system. Transition features is one from many features used on characters recognition system. This paper report a research on handwritten basic Jawanesse characters recognition system to found the proper numbers of transitions used on transition features. To recognize the characters,the Multiclass Support Vector Machines were used. The Directed Acyclic Graph (DAG) SVM were used for multiclass classification strategy and to map each input vector to a higher dimention space, the Gaussian Radial Basis Function (RBF) kernel with parameter 1were used. It can be shown, for basicJawanesse characters recognition system, the optimal numbers of transitions used for transition features is 4 (a half of maximum numbers of transition on all patterns)

    Multiclass Cancer Classification by Using Fuzzy Support Vector Machine and Binary Decision Tree With Gene Selection

    Get PDF
    We investigate the problems of multiclass cancer classification with gene selection from gene expression data. Two different constructed multiclass classifiers with gene selection are proposed, which are fuzzy support vector machine (FSVM) with gene selection and binary classification tree based on SVM with gene selection. Using F test and recursive feature elimination based on SVM as gene selection methods, binary classification tree based on SVM with F test, binary classification tree based on SVM with recursive feature elimination based on SVM, and FSVM with recursive feature elimination based on SVM are tested in our experiments. To accelerate computation, preselecting the strongest genes is also used. The proposed techniques are applied to analyze breast cancer data, small round blue-cell tumors, and acute leukemia data. Compared to existing multiclass cancer classifiers and binary classification tree based on SVM with F test or binary classification tree based on SVM with recursive feature elimination based on SVM mentioned in this paper, FSVM based on recursive feature elimination based on SVM can find most important genes that affect certain types of cancer with high recognition accuracy

    Uncertainty Analysis for the Classification of Multispectral Satellite Images Using SVMs and SOMs

    Get PDF
    Abstract: Classification of multispectral remotely sensed data with textural features is investigated with a special focus on uncertainty analysis in the produced land-cover maps. Much effort has already been directed into the research of satisfactory accuracy-assessment techniques in image classification, but a common approach is not yet universally adopted. We look at the relationship between hard accuracy and the uncertainty on the produced answers, introducing two measures based on maximum probability and a quadratic entropy. Their impact differs depending on the type of classifier. In this paper, we deal with two different classification strategies, based on support vector machines (SVMs) and Kohonen's self-organizingmaps (SOMs), both suitably modified to give soft answers. Once the multiclass probability answer vector is available for each pixel in the image, we studied the behavior of the overall classification accuracy as a function of the uncertainty associated with each vector, given a hard-labeled test set. The experimental results show that the SVM with one-versus-one architecture and linear kernel clearly outperforms the other supervised approaches in terms of overall accuracy. On the other hand, our analysis reveals that the proposed SOM-based classifier, despite its unsupervised learning procedure, is able to provide soft answers which are the best candidates for a fusion with supervised results

    Network Intrusion Detection Using Multiclass Support Vector Machine

    Get PDF
    Intrusion detection is a topic of interest in current scenario. Statistical IDS overcomes many pitfalls present in signature based IDS. Statistical IDS uses models such as NB, C4.5 etc for classification to detect Intrusions. Multiclass Support Vector Machine is able to perform multiclass classification. This paper shows the performance of MSVM (1-versus-1, 1-versusmany and Error Correcting Output Coding (ECOC)) and it’s variants for statistical NBIDS. This paper explores the performance of MSVM for various categories of attack

    Instance-based concept learning from multiclass DNA microarray data

    Get PDF
    BACKGROUND: Various statistical and machine learning methods have been successfully applied to the classification of DNA microarray data. Simple instance-based classifiers such as nearest neighbor (NN) approaches perform remarkably well in comparison to more complex models, and are currently experiencing a renaissance in the analysis of data sets from biology and biotechnology. While binary classification of microarray data has been extensively investigated, studies involving multiclass data are rare. The question remains open whether there exists a significant difference in performance between NN approaches and more complex multiclass methods. Comparative studies in this field commonly assess different models based on their classification accuracy only; however, this approach lacks the rigor needed to draw reliable conclusions and is inadequate for testing the null hypothesis of equal performance. Comparing novel classification models to existing approaches requires focusing on the significance of differences in performance. RESULTS: We investigated the performance of instance-based classifiers, including a NN classifier able to assign a degree of class membership to each sample. This model alleviates a major problem of conventional instance-based learners, namely the lack of confidence values for predictions. The model translates the distances to the nearest neighbors into 'confidence scores'; the higher the confidence score, the closer is the considered instance to a pre-defined class. We applied the models to three real gene expression data sets and compared them with state-of-the-art methods for classifying microarray data of multiple classes, assessing performance using a statistical significance test that took into account the data resampling strategy. Simple NN classifiers performed as well as, or significantly better than, their more intricate competitors. CONCLUSION: Given its highly intuitive underlying principles – simplicity, ease-of-use, and robustness – the k-NN classifier complemented by a suitable distance-weighting regime constitutes an excellent alternative to more complex models for multiclass microarray data sets. Instance-based classifiers using weighted distances are not limited to microarray data sets, but are likely to perform competitively in classifications of high-dimensional biological data sets such as those generated by high-throughput mass spectrometry

    Image Classification in Remote Sensing

    Get PDF
    One of the most important functions of remote sensing data is the production of Land Use and Land Cover maps and thus can be managed through a process called image classification. This paper looks into the following components related to the image classification process and procedures and image classification techniques and explains two common techniques K-means Classifier and Support Vector Machine (SVM). Keywords: Remote Sensing, Image Classification, K-means Classifier, Support Vector Machin

    Fuzzy support vector machine for classification of time series data: A simulation study

    Get PDF
    Support vector machine (SVM) has become one of most developed methods for classification, focusing on cross-sectional analysis. However, classification of time series data is an important issue in statistics and data mining. Classification of time series data using SVMs that focus on cross-sectional data leads to improper classification, and hence, the SVM needs to be extended for handling time series dataset. As with cross-section data, the problem of imbalanced data is also common in time series data. Fuzzy method has been proven to be capable of overcoming the case of imbalanced data. In this paper, we developed a Fuzzy Support Vector Machine (FSVM) model to classify time series data with imbalanced class. The proposed method puts the fuzzy membership function on the constraint function. Through simulation studies, this research aims to assess the performance of the developed FSVM in classifying time series data. Based on the classification accuracy criteria, we prove that the proposed FSVM method outperforms the standard SVM method for the classification of multiclass time series data
    corecore