Search CORE

2,189 research outputs found

Online Clustering for Novelty Detection and Concept Drift in Data Streams

Author: A Amini
CC Aggarwal
EJ Spinosa
ER Faria
I Zliobaite
J Andrade Silva de
J Gama
J Gama
KD Garcia
M Markou
MM Masud
X Ding
ZS Abdallah
Publication venue: Springer
Publication date: 01/01/2019
Field of study

Crossref

University of Twente Research Information

Unsupervised learning approaches for non-stationary data streams

Author: Dearo Garcia Kemilly
Publication venue: University of Twente
Publication date: 16/04/2021
Field of study

University of Twente Research Information

Ensemble Clustering for Novelty Detection in Data Streams

Author: A Haque
EJ Spinosa
ER Faria
J Gama
KD Garcia
MM Masud
MM Masud
P Kranen
S Vega-Pons
Publication venue: Springer
Publication date: 01/01/2019
Field of study

Crossref

University of Twente Research Information

StreamAR: incremental and active learning with evolving sensory data for activity recognition

Author: Abdallah Z.
Gaber M.
Krishnaswamy S.
Srinivasan B.
Publication venue
Publication date: 01/11/2012
Field of study

Portsmouth University Research Portal (Pure)

Explore Bristol Research

Data stream mining: from theory to applications and from stationary to mobile

Author: Gaber M.
Gama J.
Krishnaswamy S.
Publication venue
Publication date: 22/03/2010
Field of study

Portsmouth University Research Portal (Pure)

Evolving Ensemble Fuzzy Classifier

Author: Lughofer Edwin
Pedrycz Witold
Pratama Mahardhika
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

The concept of ensemble learning offers a promising avenue in learning from data streams under complex environments because it addresses the bias and variance dilemma better than its single model counterpart and features a reconfigurable structure, which is well suited to the given context. While various extensions of ensemble learning for mining non-stationary data streams can be found in the literature, most of them are crafted under a static base classifier and revisits preceding samples in the sliding window for a retraining step. This feature causes computationally prohibitive complexity and is not flexible enough to cope with rapidly changing environments. Their complexities are often demanding because it involves a large collection of offline classifiers due to the absence of structural complexities reduction mechanisms and lack of an online feature selection mechanism. A novel evolving ensemble classifier, namely Parsimonious Ensemble pENsemble, is proposed in this paper. pENsemble differs from existing architectures in the fact that it is built upon an evolving classifier from data streams, termed Parsimonious Classifier pClass. pENsemble is equipped by an ensemble pruning mechanism, which estimates a localized generalization error of a base classifier. A dynamic online feature selection scenario is integrated into the pENsemble. This method allows for dynamic selection and deselection of input features on the fly. pENsemble adopts a dynamic ensemble structure to output a final classification decision where it features a novel drift detection scenario to grow the ensemble structure. The efficacy of the pENsemble has been numerically demonstrated through rigorous numerical studies with dynamic and evolving data streams where it delivers the most encouraging performance in attaining a tradeoff between accuracy and complexity.Comment: this paper has been published by IEEE Transactions on Fuzzy System

arXiv.org e-Print Archive

DR-NTU (Digital Repository of NTU)

Concept Drift Detection in Data Stream Mining: The Review of Contemporary Literature

Author: B. Ramakrishna
Publication venue: Global Journals Inc. (US)
Publication date: 26/04/2017
Field of study

Mining process such as classification, clustering of progressive or dynamic data is a critical objective of the information retrieval and knowledge discovery; in particular, it is more sensitive in data stream mining models due to the possibility of significant change in the type and dimensionality of the data over a period. The influence of these changes over the mining process termed as concept drift. The concept drift that depict often in streaming data causes unbalanced performance of the mining models adapted. Hence, it is obvious to boost the mining models to predict and analyse the concept drift to achieve the performance at par best. The contemporary literature evinced significant contributions to handle the concept drift, which fall in to supervised, unsupervised learning, and statistical assessment approaches. This manuscript contributes the detailed review of the contemporary concept-drift detection models depicted in recent literature. The contribution of the manuscript includes the nomenclature of the concept drift models and their impact of imbalanced data tuples

Global Journal of Computer Science and Technology (GJCST)