Search CORE

29 research outputs found

Toward Interpretable Deep Reinforcement Learning with Linear Model U-Trees

Author: AK McCallum
D Dancey
E Ikonomovska
M Hall
M Riedmiller
N Landwehr
P Chaudhuri
RS Sutton
S Tong
V Mnih
WY Loh
Publication venue
Publication date: 16/07/2018
Field of study

Deep Reinforcement Learning (DRL) has achieved impressive success in many applications. A key component of many DRL models is a neural network representing a Q function, to estimate the expected cumulative reward following a state-action pair. The Q function neural network contains a lot of implicit knowledge about the RL problems, but often remains unexamined and uninterpreted. To our knowledge, this work develops the first mimic learning framework for Q functions in DRL. We introduce Linear Model U-trees (LMUTs) to approximate neural network predictions. An LMUT is learned using a novel on-line algorithm that is well-suited for an active play setting, where the mimic learner observes an ongoing interaction between the neural net and the environment. Empirical evaluation shows that an LMUT mimics a Q function substantially better than five baseline methods. The transparent tree structure of an LMUT facilitates understanding the network's learned knowledge by analyzing feature influence, extracting rules, and highlighting the super-pixels in image inputs.Comment: This paper is accepted by ECML-PKDD 201

arXiv.org e-Print Archive

Crossref

Multi-label classification via multi-target regression on data streams

Author: A Bifet
A Shaker
Aljaž Osojnik
C Largeron
C Vens
E Gibaja
E Ikonomovska
E Ikonomovska
E Ikonomovska
ES Xioufis
G Madjarov
G Tsoumakas
I Triguero
J Demšar
J Fürnkranz
J Gama
J Read
J Read
J Read
L Rutkowski
M Friedman
Panče Panov
Sašo Džeroski
W Cheng
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Interval forecasts based on regression trees for streaming data

Author: A Bifet
A Khosravi
B Krawczyk
Charles C. Taylor
DB Percival
DL Shrestha
E Ikonomovska
E Ikonomovska
GJ Ross
H Quan
J Duarte
P Sobhani
S-I Yoshida
Stuart Barber
T Hothorn
T Hothorn
TS Sethi
X Zhao
Xin Zhao
Z Milan
Zoka Milan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/03/2021
Field of study

In forecasting, we often require interval forecasts instead of just a specific point forecast. To track streaming data effectively, this interval forecast should reliably cover the observed data and yet be as narrow as possible. To achieve this, we propose two methods based on regression trees: one ensemble method and one method based on a single tree. For the ensemble method, we use weighted results from the most recent models, and for the single-tree method, we retain one model until it becomes necessary to train a new model. We propose a novel method to update the interval forecast adaptively using root mean square prediction errors calculated from the latest data batch. We use wavelet-transformed data to capture long time variable information and conditional inference trees for the underlying regression tree model. Results show that both methods perform well, having good coverage without the intervals being excessively wide. When the underlying data generation mechanism changes, their performance is initially affected but can recover relatively quickly as time proceeds. The method based on a single tree performs the best in computational (CPU) time compared to the ensemble method. When compared to ARIMA and GARCH modelling, our methods achieve better or similar coverage and width but require considerably less CPU time

Crossref

White Rose Research Online

Scalable and efficient multi-label classification for evolving data streams

Author: A. Appice
A. Bifet
A. Bifet
A. Bifet
A. Bifet
A. Clare
A. M. Ráez
Albert Bifet
Bernhard Pfahringer
E. Ikonomovska
E. Spyromitros-Xioufis
G. Tsoumakas
G. Tsoumakas
G. Widmer
Geoff Holmes
J. Demšar
J. Fürnkranz
J. Gama
J. Read
J. Read
Jesse Read
K. Crammer
K. Dembczyński
M. Hall
M. L. Zhang
N. C. Oza
N. Cesa-Bianchi
P. Domingos
R. E. Schapire
S. Godbole
W. Cheng
W. Cheng
W. Qu
X. Kong
Y. N. Law
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Revisiting the effect of history on learning performance: the problem of the demanding lord

Author: BI Crabtree
D Angluin
D Goldberg
E Ikonomovska
F Fdez-Riverola
F Massey Jr
G Widmer
George Giannakopoulos
GI Webb
H Harter
J Kiefer
J Ramon
J Schlimmer
K Fukunaga
K Fukunaga
M Hall
M Lazarescu
M Maloof
M Maloof
M Musavi
M Núñez
M Scholz
R Fan
R Fidalgo-Merino
R Kohavi
R Reinke
T Dietterich
T Mitchell
Themis Palpanas
WS Cleveland
Y Freund
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automated Adaptation Strategies for Stream Learning

Author: A Bifet
A Bifet
A Cinar
ALD Rossi
AP Dawid
B Gabrys
B Gabrys
B Gabrys
C Lemke
D Ruta
E Anderson
E Ikonomovska
F Mohr
F Souza
F Wilcoxon
G Carpenter
G Widmer
I Zliobaite
J Demšar
J Montiel
JSR Jang
JZ Kolter
KO Stanley
L Fortuna
L Fortuna
L Kotthoff
L Minku
L Wasserman
LI Kuncheva
M Hall
M Herbster
M Martin Salvador
M Scholz
MT Vakil-Baghmisheh
N Littlestone
NC Oza
P Kadlec
P Kadlec
R Bakirov
R Bakirov
R Elwell
R Klinkenberg
RA Fisher
RS Olson
S Gomes Soares
S Joe Qin
TD Nguyen
Publication venue
Publication date: 30/04/2021
Field of study

Automation of machine learning model development is increasingly becoming an established research area. While automated model selection and automated data pre-processing have been studied in depth, there is, however, a gap concerning automated model adaptation strategies when multiple strategies are available. Manually developing an adaptation strategy can be time consuming and costly. In this paper we address this issue by proposing the use of flexible adaptive mechanism deployment for automated development of adaptation strategies. Experimental results after using the proposed strategies with five adaptive algorithms on 36 datasets confirm their viability. These strategies achieve better or comparable performance to the custom adaptation strategies and the repeated deployment of any single adaptive mechanism

arXiv.org e-Print Archive

Crossref

OPUS - University of Technology Sydney

Bournemouth University Research Online

Adaptive Windowing for Online Learning from Multiple Inter-related Data Streams

Author: Driessens K.
Dzeroski S.
Gama J.
Ikonomovska E.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2011
Field of study

Crossref

Adaptive Windowing for Online Learning from Multiple Inter-related Data Streams

Author: Driessens K.
Dzeroski S.
Gama J.
Ikonomovska E.
Publication venue
Publication date: 01/01/2011
Field of study

Maastricht University Research Portal

Crossref

Predicting unusual energy consumption events from smart home sensor network by data stream mining with misclassified recall

Author: A Pughat
AG Finogeev
D Egarter
E Ikonomovska
E Ikonomovska
Jiaxue Li
L Bottou
MV Shcherbakov
Nilanjan Dey
Raymond K. Wong
S Maity
Simon Fong
Wei Song
WM Kang
Yifei Tian
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Predictive regional trees to supplement geo-physical random fields

Author: A. Ciampi
D. Potts
E. Ikonomovska
G. Góra
G. Watson
J. Gama
S. Shekhar
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Nowadays ubiquitous sensor stations are deployed to measure geophysical fields for several ecological and environmental processes. Although these fields are measured at the specific location of stations, geo-statistical problems demand for inference processes to supplement, smooth and standardize recorded data. We study how predictive regional trees can supplement data sampled periodically in an ubiquitous sensing scenario. Data records that are similar one to each other are clustered according to a rectangular decomposition of the region of analysis; a predictive model is associated to the region covered by each cluster. The cluster model depicts the spatial variation of data over a map, the predictive model supplements any unknown record that is recognized belong to a cluster region. We illustrate an incremental algorithm to yield time-evolving predictive regional trees that account for the fact that the statistical properties of the recorded data may change over time. This algorithm is evaluated with spatio-temporal data collections.Nowadays ubiquitous sensor stations are deployed to measure geophysical fields for several ecological and environmental processes. Although these fields are measured at the specific location of stations, geo-statistical problems demand for inference processes to supplement, smooth and standardize recorded data. We study how predictive regional trees can supplement data sampled periodically in an ubiquitous sensing scenario. Data records that are similar one to each other are clustered according to a rectangular decomposition of the region of analysis; a predictive model is associated to the region covered by each cluster. The cluster model depicts the spatial variation of data over a map, the predictive model supplements any unknown record that is recognized belong to a cluster region. We illustrate an incremental algorithm to yield time-evolving predictive regional trees that account for the fact that the statistical properties of the recorded data may change over time. This algorithm is evaluated with spatio-temporal data collections. © Springer-Verlag Berlin Heidelberg 2013

Crossref

Archivio istituzionale della ricerca - Università di Bari