Search CORE

51 research outputs found

Feature weights of (a) and (b) attribute values with respect to indices and the distribution of attribute weights.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Feature weights of (a) and (b) attribute values with respect to indices and the distribution of attribute weights.</p

FigShare

A Consistency-Based Feature Selection Method Allied with Linear SVMs for HIV-1 Protease Cleavage Site Prediction

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date: 23/08/2013
Field of study

<div>BackgroundPredicting type-1 Human Immunodeficiency Virus (HIV-1) protease cleavage site in protein molecules and determining its specificity is an important task which has attracted considerable attention in the research community. Achievements in this area are expected to result in effective drug design (especially for HIV-1 protease inhibitors) against this life-threatening virus. However, some drawbacks (like the shortage of the available training data and the high dimensionality of the feature space) turn this task into a difficult classification problem. Thus, various machine learning techniques, and specifically several classification methods have been proposed in order to increase the accuracy of the classification model. In addition, for several classification problems, which are characterized by having few samples and many features, selecting the most relevant features is a major factor for increasing classification accuracy.ResultsWe propose for HIV-1 data a consistency-based feature selection approach in conjunction with recursive feature elimination of support vector machines (SVMs). We used various classifiers for evaluating the results obtained from the feature selection process. We further demonstrated the effectiveness of our proposed method by comparing it with a state-of-the-art feature selection method applied on HIV-1 data, and we evaluated the reported results based on attributes which have been selected from different combinations.ConclusionApplying feature selection on training data before realizing the classification task seems to be a reasonable data-mining process when working with types of data similar to HIV-1. On HIV-1 data, some feature selection or extraction operations in conjunction with different classifiers have been tested and noteworthy outcomes have been reported. These facts motivate for the work presented in this paper.Software availabilityThe software is available at <a href="http://ozyer.etu.edu.tr/c-fs-svm.rar" target="_blank">http://ozyer.etu.edu.tr/c-fs-svm.rar</a>.The software can be downloaded at <a href="http://esnag.etu.edu.tr/software/hiv_cleavage_site_prediction.rar" target="_blank">esnag.etu.edu.tr/software/hiv_cleavage_site_prediction.rar</a>; you will find a readme file which explains how to set the software in order to work.</div

Directory of Open Access Journals

PubMed Central

FigShare

Detailed System Overview.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Closer look at the various components of the proposed system architecture; orthonormal encoding is used to represent amino acids.</p

FigShare

Selected attributes according to the FS methods (- indicates no feature is specifically determined for the column).

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Values common to all methods have been italicized.</p

FigShare

Feature weights of (a) P3' and (b) and P4': attribute values with respect to indices and the distribution of attribute weights.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Feature weights of (a) P3' and (b) and P4': attribute values with respect to indices and the distribution of attribute weights.</p

FigShare

Standard Deviations of classification results for external cross validation with MLP and their average performance results for each metric.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Standard Deviations of classification results for external cross validation with MLP and their average performance results for each metric.</p

FigShare

Standard Deviations of classification results for external cross validation with SMO and their average performance results for each metric.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Standard Deviations of classification results for external cross validation with SMO and their average performance results for each metric.</p

FigShare

Feature weights of (a) P1' and (b) P2': attribute values with respect to indices and the distribution of attribute weights.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Feature weights of (a) P1' and (b) P2': attribute values with respect to indices and the distribution of attribute weights.</p

FigShare

Overall System Architecture.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

The input data is preprocessed then the preprocessed data may be directly classified or feature selection is applied to utilize in the classification only relevant features.</p

FigShare

Feature weights of (a) P4 and (b) and P3.

Author: Abdallah Elsheikh (449730)
Alper Aksaç (449729)
Orkun Öztürk (449728)
Reda Alhajj (449732)
Tansel Özyer (449731)
Publication venue
Publication date
Field of study

Feature weights of (a) P4 and (b) and P3.</p

FigShare

Feature weights of (a) and (b) attribute values with respect to indices and the distribution of attribute weights.

A Consistency-Based Feature Selection Method Allied with Linear SVMs for HIV-1 Protease Cleavage Site Prediction

Detailed System Overview.

Selected attributes according to the FS methods (- indicates no feature is specifically determined for the column).

Feature weights of (a) <i>P</i><sub>3</sub>' and (b) and <i>P</i><sub>4</sub>': attribute values with respect to indices and the distribution of attribute weights.

Standard Deviations of classification results for external cross validation with MLP and their average performance results for each metric.

Standard Deviations of classification results for external cross validation with SMO and their average performance results for each metric.

Feature weights of (a) <i>P</i><sub>1</sub>' and (b) <i>P</i><sub>2</sub>': attribute values with respect to indices and the distribution of attribute weights.

Overall System Architecture.

Feature weights of (a) <i>P</i><sub>4</sub> and (b) and <i>P</i><sub>3</sub>.