Distribution-sensitive learning for imbalanced datasets

Davis, Randall; Morency, Louis-Philippe; Song, Yale

research

Distribution-sensitive learning for imbalanced datasets

Authors: Randall Davis
Louis-Philippe Morency
Yale Song
Publication date: 1 January 2013
Publisher: 'Institute of Electrical and Electronics Engineers (IEEE)'
Doi

Abstract

Many real-world face and gesture datasets are by nature imbalanced across classes. Conventional statistical learning models (e.g., SVM, HMM, CRY), however, are sensitive to imbalanced datasets. In this paper we show how an imbalanced dataset affects the performance of a standard learning algorithm, and propose a distribution-sensitive prior to deal with the imbalanced data problem. This prior analyzes the training dataset before learning a model, and puts more weight on the samples from underrepresented classes, allowing all samples in the dataset to have a balanced impact in the learning process. We report on two empirical studies regarding learning with imbalanced data, using two publicly available recent gesture datasets, the Microsoft Research Cambridge-12 (MSRC-12) and NATOPS aircraft handling signals datasets. Experimental results show that learning from balanced data is important, and that the distribution-sensitive prior improves performance with imbalanced datasets.United States. Office of Naval Research (Grant N000140910625)National Science Foundation (U.S.) (Grant IIS-1118018)National Science Foundation (U.S.) (Grant IIS-1018055)United States. Army Research, Development, and Engineering Comman

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

CiteSeerX

oai:CiteSeerX.psu:10.1.1.1041....

Last time updated on 07/12/2020

Crossref

info:doi/10.1109%2Ffg.2013.655...

Last time updated on 16/02/2019

DSpace@MIT

oai:dspace.mit.edu:1721.1/8610...

Last time updated on 25/04/2014