Search CORE

47,665 research outputs found

Neural networks and support vector machines based bio-activity classification

Author: Salim Naomie
Zeb Shah Jehan
Publication venue
Publication date: 01/07/2006
Field of study

Classification of various compounds into their respective biological activity classes is important in drug discovery applications from an early phase virtual compound filtering and screening point of view. In this work two types of neural networks, multi layer perceptron (MLP) and radial basis functions (RBF), and support vector machines (SVM) were employed for the classification of three types of biologically active enzyme inhibitors. Both of the networks were trained with back propagation learning method with chemical compounds whose active inhibition properties were previously known. A group of topological indices, selected with the help of principle component analysis (PCA) were used as descriptors. The results of all the three classification methods show that the performance of both the neural networks is better than the SVM

Universiti Teknologi Malaysia Institutional Repository

Active Transfer Learning with Zero-Shot Priors: Reusing Past Datasets for Future Tasks

Author: Gavves Efstratios
Mensink Thomas
Snoek Cees G. M.
Tommasi Tatiana
Tuytelaars Tinne
Publication venue
Publication date: 01/01/2015
Field of study

How can we reuse existing knowledge, in the form of available datasets, when solving a new and apparently unrelated target task from a set of unlabeled data? In this work we make a first contribution to answer this question in the context of image classification. We frame this quest as an active learning problem and use zero-shot classifiers to guide the learning process by linking the new task to the existing classifiers. By revisiting the dual formulation of adaptive SVM, we reveal two basic conditions to choose greedily only the most relevant samples to be annotated. On this basis we propose an effective active learning algorithm which learns the best possible target classification model with minimum human labeling effort. Extensive experiments on two challenging datasets show the value of our approach compared to the state-of-the-art active learning methodologies, as well as its potential to reuse past datasets with minimal effort for future tasks

arXiv.org e-Print Archive

Crossref

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

Archivio della ricerca- Università di Roma La Sapienza

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Image Retrieval with Relevance Feedback using SVM Active Learning

Author: Ngo Giang Truong
Ngo Tao Quoc
Nguyen Dung Duc
Publication venue: 'Institute of Advanced Engineering and Science'
Publication date: 01/12/2016
Field of study

In content-based image retrieval, relevant feedback is studied extensively to narrow the gap between low-level image feature and high-level semantic concept. In general, relevance feedback aims to improve the retrieval performance by learning with user's judgements on the retrieval results. Despite widespread interest, but feedback related technologies are often faced with a few limitations. One of the most obvious limitations is often requiring the user to repeat a number of steps before obtaining the improved search results. This makes the process inefficient and tedious search for the online applications. In this paper, a effective feedback related scheme for content-based image retrieval is proposed. First, a decision boundary is learned via Support Vector Machine to filter the images in the database. Then, a ranking function for selecting the most informative samples will be calculated by defining a novel criterion that considers both the scores of Support Vector Machine function and similaritymetric between the "ideal query" and the images in the database. The experimental results on standard datasets have showed the effectiveness of the proposed method

IAES journal

Crossref

Institute of Advanced Engineering and Science

Learning from life-logging data by hybrid HMM: a case study on active states prediction

Author: Lambrou Tryphon
Ni Ji
Ye Xujiong
Publication venue: 'ACTA Press'
Publication date: 01/01/2016
Field of study

In this paper, we have proposed employing a hybrid classifier-hidden Markov model (HMM) as a supervised learning approach to recognize daily active states from sequential life-logging data collected from wearable sensors. We generate synthetic data from real dataset to cope with noise and incompleteness for training purpose and, in conjunction with HMM, propose using a multiobjective genetic programming (MOGP) classifier in comparison of the support vector machine (SVM) with variant kernels. We demonstrate that the system with either algorithm works effectively to recognize personal active states regarding medical reference. We also illustrate that MOGP yields generally better results than SVM without requiring an ad hoc kernel

University of Lincoln Institutional Repository

Automatic skin segmentation for gesture recognition combining region and support vector machine active learning

Author: Awad George M.
Han Junwei
Sutherland Alistair
Wu Hai
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2006
Field of study

Skin segmentation is the cornerstone of many applications such as gesture recognition, face detection, and objectionable image filtering. In this paper, we attempt to address the skin segmentation problem for gesture recognition. Initially, given a gesture video sequence, a generic skin model is applied to the first couple of frames to automatically collect the training data. Then, an SVM classifier based on active learning is used to identify the skin pixels. Finally, the results are improved by incorporating region segmentation. The proposed algorithm is fully automatic and adaptive to different signers. We have tested our approach on the ECHO database. Comparing with other existing algorithms, our method could achieve better performance

Crossref

Irish Universities

DCU Online Research Access Service

On Using Active Learning and Self-Training when Mining Performance Discussions on Stack Overflow

Author: Allamanis M.
Chowdhury S.
Cicchetti A.
Lin Y.
Pedregosa F.
Settles B.
Settles B.
Soliman M.
Ying A.
Publication venue
Publication date: 01/01/2017
Field of study

Abundant data is the key to successful machine learning. However, supervised learning requires annotated data that are often hard to obtain. In a classification task with limited resources, Active Learning (AL) promises to guide annotators to examples that bring the most value for a classifier. AL can be successfully combined with self-training, i.e., extending a training set with the unlabelled examples for which a classifier is the most certain. We report our experiences on using AL in a systematic manner to train an SVM classifier for Stack Overflow posts discussing performance of software components. We show that the training examples deemed as the most valuable to the classifier are also the most difficult for humans to annotate. Despite carefully evolved annotation criteria, we report low inter-rater agreement, but we also propose mitigation strategies. Finally, based on one annotator's work, we show that self-training can improve the classification accuracy. We conclude the paper by discussing implication for future text miners aspiring to use AL and self-training.Comment: Preprint of paper accepted for the Proc. of the 21st International Conference on Evaluation and Assessment in Software Engineering, 201

arXiv.org e-Print Archive

Lund University Publications

Crossref

Swedish Institute of Computer Science Publications Database

Active Sampling of Pairs and Points for Large-scale Linear Bipartite Ranking

Author: Lin Hsuan-Tien
Shen Wei-Yuan
Publication venue
Publication date: 24/08/2017
Field of study

Bipartite ranking is a fundamental ranking problem that learns to order relevant instances ahead of irrelevant ones. The pair-wise approach for bi-partite ranking construct a quadratic number of pairs to solve the problem, which is infeasible for large-scale data sets. The point-wise approach, albeit more efficient, often results in inferior performance. That is, it is difficult to conduct bipartite ranking accurately and efficiently at the same time. In this paper, we develop a novel active sampling scheme within the pair-wise approach to conduct bipartite ranking efficiently. The scheme is inspired from active learning and can reach a competitive ranking performance while focusing only on a small subset of the many pairs during training. Moreover, we propose a general Combined Ranking and Classification (CRC) framework to accurately conduct bipartite ranking. The framework unifies point-wise and pair-wise approaches and is simply based on the idea of treating each instance point as a pseudo-pair. Experiments on 14 real-word large-scale data sets demonstrate that the proposed algorithm of Active Sampling within CRC, when coupled with a linear Support Vector Machine, usually outperforms state-of-the-art point-wise and pair-wise ranking approaches in terms of both accuracy and efficiency.Comment: a shorter version was presented in ACML 201

arXiv.org e-Print Archive

CiteSeerX

Advances in Hyperspectral Image Classification: Earth monitoring with statistical learning methods

Author: Benediktsson Jón Atli
Bruzzone Lorenzo
Camps-Valls Gustavo
Tuia Devis
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/10/2013
Field of study

Hyperspectral images show similar statistical properties to natural grayscale or color photographic images. However, the classification of hyperspectral images is more challenging because of the very high dimensionality of the pixels and the small number of labeled examples typically available for learning. These peculiarities lead to particular signal processing problems, mainly characterized by indetermination and complex manifolds. The framework of statistical learning has gained popularity in the last decade. New methods have been presented to account for the spatial homogeneity of images, to include user's interaction via active learning, to take advantage of the manifold structure with semisupervised learning, to extract and encode invariances, or to adapt classifiers and image representations to unseen yet similar scenes. This tutuorial reviews the main advances for hyperspectral remote sensing image classification through illustrative examples.Comment: IEEE Signal Processing Magazine, 201

arXiv.org e-Print Archive

CiteSeerX

Wageningen University & Research Publications