Search CORE

29 research outputs found

Peptide classification using optimal and information theoretic syntactic modeling

Author: Aygün Ezra
Cataltepe Z
Oommen B. John
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

We consider the problem of classifying peptides using the information residing in their syntactic representations. This problem, which has been studied for more than a decade, has typically been investigated using distance-based metrics that involve the edit operations required in the peptide comparisons. In this paper, we shall demonstrate that the Optimal and Information Theoretic (OIT) model of Oommen and Kashyap [22] applicable for syntactic pattern recognition can be used to tackle peptide classification problem. We advocate that one can model the differences between compared strings as a mutation model consisting of random substitutions, insertions and deletions obeying the OIT model. Thus, in this paper, we show that the probability measure obtained from the OIT model can be perceived as a sequence similarity metric, using which a support vector machine (SVM)-based peptide classifier can be devised. The classifier, which we have built has been tested for eight different substitution matrices and for two different data sets, namely, the HIV-1 Protease cleavage sites and the T-cell epitopes. The results show that the OIT model performs significantly better than the one which uses a Needleman-Wunsch sequence alignment score, it is less sensitive to the substitution matrix than the other methods compared, and that when combined with a SVM, is among the best peptide classification methods availabl

Crossref

NORA - Norwegian Open Research Archives

Agder University Research Archive

Fast insect damage detection in wheat kernels using transmittance images

Author: Cataltepe Z.
Cetin E.
Pearson T.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2004
Field of study

We used transmittance images and different learning algorithms to classify insect damaged and un-damaged wheat kernels. Using the histogram of the pixels of the wheat images as the feature, and the linear model as the learning algorithm, we achieved a False Positive Rate (1-specificity) of 0.12 at the True Positive Rate (sensitivity) of 0.8 and an Area Under the ROC Curve (AUC) of 0.90 ± 0.02. Combining the linear model and a Radial Basis Function Network in a committee resulted in a FP Rate of 0.09 at the TP Rate of 0.8 and an AUC of 0.93 ± 0.03

Bilkent University Institutional Repository

Preface

Author: Aksoy S.
Cataltepe Z.
Unay D.
Publication venue: Springer Verlag
Publication date: 01/01/2010
Field of study

[No abstract available

Bilkent University Institutional Repository

Identification of insect damaged wheat kernels using transmittance images

Author: Cataltepe Z.
Cetin E.
Pearson T.
Publication venue
Publication date: 01/01/2004
Field of study

We used transmittance images and different learning algorithms to classify insect damaged and un-damaged wheat kernels. Using the histogram of the pixels of the wheat images as the feature, and the linear model as the learning algorithm, we achieved a False Positive Rate (1-specificity) of 0.2 at the True Positive Rate (sensitivity) of 0.8 and an Area Under the ROC Curve (AUC) of 0.86. Combining the linear model and a Radial Basis Function Network in a committee resulted in a FP Rate of 0.1 at the TP Rate of 0.8 and an AUC of 0.92. © 2004 IEEE

Bilkent University Institutional Repository

Recommended from our members

Improving music genre classification using automatically induced harmony rules

Author: Amélie Anglade
Aucouturier J.-J.
Cataltepe Z.
Emmanouil Benetos
Fukunaga K.
Lawson C. L.
Matthias Mauch
Piston W.
Pérez-Sancho C.
Quinlan J. R.
Schölkopf B.
Simon Dixon
Tzanetakis G.
van der Hedjen F.
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2009
Field of study

We present a new genre classification framework using both low-level signal-based features and high-level harmony features. A state-of-the-art statistical genre classifier based on timbral features is extended using a first-order random forest containing for each genre rules derived from harmony or chord sequences. This random forest has been automatically induced, using the first-order logic induction algorithm TILDE, from a dataset, in which for each chord the degree and chord category are identified, and covering classical, jazz and pop genre classes. The audio descriptor-based genre classifier contains 206 features, covering spectral, temporal, energy, and pitch characteristics of the audio signal. The fusion of the harmony-based classifier with the extracted feature vectors is tested on three-genre subsets of the GTZAN and ISMIR04 datasets, which contain 300 and 448 recordings, respectively. Machine learning classifiers were tested using 5 × 5-fold cross-validation and feature selection. Results indicate that the proposed harmony-based rules combined with the timbral descriptor-based genre classification system lead to improved genre classification rates

City Research Online

Crossref

Ghent University Academic Bibliography

University of Miami: Scholarship Miami

The University of Manchester - Institutional Repository

Radboud Repository

Peptide classification using optimal and information theoretic syntactic modeling

Author: Aygün E. (E.)
Cataltepe Z. (Z.)
Oommen J. (B. John)
Publication venue: 'Elsevier BV'
Publication date: 01/11/2010
Field of study

Carleton University's Institutional Repository

On utilizing optimal and information theoretic syntactic modeling for peptide classification

Author: Aygün E. (Eser)
Cataltepe Z. (Zehra)
Oommen J. (B. John)
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 16/10/2009
Field of study

Syntactic methods in pattern recognition have been used extensively in bioinformatics, and in particular, in the analysis of gene and protein expressions, and in the recognition and classification of bio-sequences. These methods are almost universally distance-based. This paper concerns the use of an Optimal and Information Theoretic (OIT) probabilistic model [11] to achieve peptide classification using the information residing in their syntactic representations. The latter has traditionally been achieved using the edit distances required in the respective peptide comparisons. We advocate that one can model the differences between compared strings as a mutation model consisting of random Substitutions, Insertions and Deletions (SID) obeying the OIT model. Thus, in this paper, we show that the probability measure obtained from the OIT model can be perceived as a sequence similarity metric, using which a Support Vector Machine (SVM)-based peptide classifier, referred to as OIT-SVM, can be devised. The classifier, which we have built has been tested for eight different "substitution" matrices and for two different data sets, namely, the HIV-1 Protease Cleavage sites and the T-cell Epitopes. The results show that the OIT model performs significantly better than the one which uses a Needleman-Wunsch sequence alignment score, and the peptide classification methods that previously experimented with the same two datasets

Carleton University's Institutional Repository