Search CORE

23,849 research outputs found

Efficient multi-label classification for evolving data streams

Author: Bifet Albert
Holmes Geoffrey
Pfahringer Bernhard
Read Jesse
Publication venue: University of Waikato, Department of Computer Science
Publication date: 01/05/2010
Field of study

Many real world problems involve data which can be considered as multi-label data streams. Efficient methods exist for multi-label classification in non streaming scenarios. However, learning in evolving streaming scenarios is more challenging, as the learners must be able to adapt to change using limited time and memory. This paper proposes a new experimental framework for studying multi-label evolving stream classification, and new efficient methods that combine the best practices in streaming scenarios with the best practices in multi-label classification. We present a Multi-label Hoeffding Tree with multilabel classifiers at the leaves as a base classifier. We obtain fast and accurate methods, that are well suited for this challenging multi-label classification streaming task. Using the new experimental framework, we test our methodology by performing an evaluation study on synthetic and real-world datasets. In comparison to well-known batch multi-label methods, we obtain encouraging results

Research Commons@Waikato

A Novel Progressive Multi-label Classifier for Classincremental Data

Author: Dave Mihika
Er Meng Joo
Tapiawala Sahil
Venkatesan Rajasekar
Publication venue
Publication date: 22/09/2016
Field of study

In this paper, a progressive learning algorithm for multi-label classification to learn new labels while retaining the knowledge of previous labels is designed. New output neurons corresponding to new labels are added and the neural network connections and parameters are automatically restructured as if the label has been introduced from the beginning. This work is the first of the kind in multi-label classifier for class-incremental learning. It is useful for real-world applications such as robotics where streaming data are available and the number of labels is often unknown. Based on the Extreme Learning Machine framework, a novel universal classifier with plug and play capabilities for progressive multi-label classification is developed. Experimental results on various benchmark synthetic and real datasets validate the efficiency and effectiveness of our proposed algorithm.Comment: 5 pages, 3 figures, 4 table

arXiv.org e-Print Archive

Crossref

Scikit-Multiflow: A Multi-output Streaming Framework

Author: Abdessalem Talel
Bifet Albert
Montiel Jacob
Read Jesse
Publication venue: 'Test accounts'
Publication date: 12/07/2018
Field of study

Scikit-multiflow is a multi-output/multi-label and stream data mining framework for the Python programming language. Conceived to serve as a platform to encourage democratization of stream learning research, it provides multiple state of the art methods for stream learning, stream generators and evaluators. scikit-multiflow builds upon popular open source frameworks including scikit-learn, MOA and MEKA. Development follows the FOSS principles and quality is enforced by complying with PEP8 guidelines and using continuous integration and automatic testing. The source code is publicly available at https://github.com/scikit-multiflow/scikit-multiflow.Comment: 5 pages, Open Source Softwar

arXiv.org e-Print Archive

Dynamic Adaptation on Non-Stationary Visual Domains

Author: B Moiseev
B Sun
C Tan
LC Chen
M Ghifary
M Tennant
SR Richter
T Tommasi
Y Ganin
Y LeCun
Y Wang
Publication venue
Publication date: 02/08/2018
Field of study

Domain adaptation aims to learn models on a supervised source domain that perform well on an unsupervised target. Prior work has examined domain adaptation in the context of stationary domain shifts, i.e. static data sets. However, with large-scale or dynamic data sources, data from a defined domain is not usually available all at once. For instance, in a streaming data scenario, dataset statistics effectively become a function of time. We introduce a framework for adaptation over non-stationary distribution shifts applicable to large-scale and streaming data scenarios. The model is adapted sequentially over incoming unsupervised streaming data batches. This enables improvements over several batches without the need for any additionally annotated data. To demonstrate the effectiveness of our proposed framework, we modify associative domain adaptation to work well on source and target data batches with unequal class distributions. We apply our method to several adaptation benchmark datasets for classification and show improved classifier accuracy not only for the currently adapted batch, but also when applied on future stream batches. Furthermore, we show the applicability of our associative learning modifications to semantic segmentation, where we achieve competitive results

arXiv.org e-Print Archive

Crossref

International Migration, Integration and Social Cohesion online publications