Search CORE

13,156 research outputs found

Reservoir of Diverse Adaptive Learners and Stacking Fast Hoeffding Drift Detection Methods for Evolving Data Streams

Author: Paquet Eric
Pesaranghader Ali
Viktor Herna
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 07/09/2017
Field of study

The last decade has seen a surge of interest in adaptive learning algorithms for data stream classification, with applications ranging from predicting ozone level peaks, learning stock market indicators, to detecting computer security violations. In addition, a number of methods have been developed to detect concept drifts in these streams. Consider a scenario where we have a number of classifiers with diverse learning styles and different drift detectors. Intuitively, the current 'best' (classifier, detector) pair is application dependent and may change as a result of the stream evolution. Our research builds on this observation. We introduce the \mbox{Tornado} framework that implements a reservoir of diverse classifiers, together with a variety of drift detection algorithms. In our framework, all (classifier, detector) pairs proceed, in parallel, to construct models against the evolving data streams. At any point in time, we select the pair which currently yields the best performance. We further incorporate two novel stacking-based drift detection methods, namely the \mbox{FHDDMS} and \mbox{FHDDMS}_{add} approaches. The experimental evaluation confirms that the current 'best' (classifier, detector) pair is not only heavily dependent on the characteristics of the stream, but also that this selection evolves as the stream flows. Further, our \mbox{FHDDMS} variants detect concept drifts accurately in a timely fashion while outperforming the state-of-the-art.Comment: 42 pages, and 14 figure

arXiv.org e-Print Archive

NRC Publications Archive

Request-and-Reverify: Hierarchical Hypothesis Testing for Concept Drift Detection with Expensive Labels

Author: Principe Jose C.
Wang Xiaoyang
Yu Shujian
Publication venue: 'International Joint Conferences on Artificial Intelligence'
Publication date: 28/06/2018
Field of study

One important assumption underlying common classification models is the stationarity of the data. However, in real-world streaming applications, the data concept indicated by the joint distribution of feature and label is not stationary but drifting over time. Concept drift detection aims to detect such drifts and adapt the model so as to mitigate any deterioration in the model's predictive performance. Unfortunately, most existing concept drift detection methods rely on a strong and over-optimistic condition that the true labels are available immediately for all already classified instances. In this paper, a novel Hierarchical Hypothesis Testing framework with Request-and-Reverify strategy is developed to detect concept drifts by requesting labels only when necessary. Two methods, namely Hierarchical Hypothesis Testing with Classification Uncertainty (HHT-CU) and Hierarchical Hypothesis Testing with Attribute-wise "Goodness-of-fit" (HHT-AG), are proposed respectively under the novel framework. In experiments with benchmark datasets, our methods demonstrate overwhelming advantages over state-of-the-art unsupervised drift detectors. More importantly, our methods even outperform DDM (the widely used supervised drift detector) when we use significantly fewer labels.Comment: Published as a conference paper at IJCAI 201

arXiv.org e-Print Archive

Crossref

A Comparative Numerical Study on GEM, MHSP and MSGC

Author: A.V. Lyashenko .
G. Croci
I. Smirnov
K. Nikolopoulos
N Majumdar
P Bhattacharya
P. Fonte
R. Veenhof
R. Veenhof
S Bhattacharya
S Mukhopadhyay
S. Biagi
S. Mukhopadhyay
Ö. Sahin
Publication venue: 'IOP Publishing'
Publication date: 27/04/2012
Field of study

In this work, we have tried to develop a detailed understanding of the physical processes occurring in those variants of Micro Pattern Gas Detectors (MPGDs) that share micro hole and micro strip geometry, like GEM, MHSP and MSGC etc. Some of the important and fundamental characteristics of these detectors such as gain, transparency, efficiency and their operational dependence on different device parameters have been estimated following detailed numerical simulation of the detector dynamics. We have used a relatively new simulation framework developed especially for the MPGDs that combines packages such as GARFIELD, neBEM, MAGBOLTZ and HEED. The results compare closely with the available experimental data. This suggests the efficacy of the framework to model the intricacies of these micro-structured detectors in addition to providing insight into their inherent complex dynamical processes

arXiv.org e-Print Archive

Crossref

CERN Document Server