Search CORE

17 research outputs found

Mining cohesive patterns in sequences and extreme multi-label classification

Author: Feremans Len
Publication venue
Publication date: 01/01/2020
Field of study

Institutional Repository Universiteit Antwerpen

Efficiently mining cohesion-based patterns and rules in event sequences

Author: Cule Boris
Feremans Len
Goethals Bart
Publication venue
Publication date: 01/01/2019
Field of study

Institutional Repository Universiteit Antwerpen

Mining top-k quantile-based cohesive sequential patterns

Author: Cule Boris
Feremans Len
Goethals Bart
Publication venue
Publication date: 01/01/2018
Field of study

Institutional Repository Universiteit Antwerpen

PETSC: pattern-based embedding for time series classification

Author: Feremans Len
Goethals Bart
Čule Boris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2022
Field of study

Efficient and interpretable classification of time series is an essential data mining task with many real-world applications. Recently several dictionary- and shapelet-based time series classification methods have been proposed that employ contiguous subsequences of fixed length. We extend pattern mining to efficiently enumerate long variable-length sequential patterns with gaps. Additionally, we discover patterns at multiple resolutions thereby combining cohesive sequential patterns that vary in length, duration and resolution. For time series classification we construct an embedding based on sequential pattern occurrences and learn a linear model. The discovered patterns form the basis for interpretable insight into each class of time series. The pattern-based embedding for time series classification (PETSC) supports both univariate and multivariate time series datasets of varying length subject to noise or missing data. We experimentally validate that MR-PETSC performs significantly better than baseline interpretable methods such as DTW, BOP and SAX-VSM on univariate and multivariate time series. On univariate time series, our method performs comparably to many recent methods, including BOSS, cBOSS, S-BOSS, ProximityForest and ResNET, and is only narrowly outperformed by state-of-the-art methods such as HIVE-COTE, ROCKET, TS-CHIEF and InceptionTime. Moreover, on multivariate datasets PETSC performs comparably to the current state-of-the-art such as HIVE-COTE, ROCKET, CIF and ResNET, none of which are interpretable. PETSC scales to large datasets and the total time for training and making predictions on all 85 ‘bake off’ datasets in the UCR archive is under 3 h making it one of the fastest methods available. PETSC is particularly useful as it learns a linear model where each feature represents a sequential pattern in the time domain, which supports human oversight to ensure predictions are trustworthy and fair which is essential in financial, medical or bioinformatics applications

Efficient discovery of sets of co-occurring items in event sequences

Author: Cule Boris
Feremans Len
Goethals Bart
Publication venue
Publication date: 01/01/2016
Field of study

Institutional Repository Universiteit Antwerpen

PETSC: pattern-based embedding for time series classification

Author: Feremans Len
Goethals Bart
Čule Boris
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/03/2022
Field of study

Tilburg University Repository

PETSC : pattern-based embedding for time series classification

Author: Cule Boris
Feremans Len
Goethals Bart
Publication venue
Publication date: 01/01/2022
Field of study

Institutional Repository Universiteit Antwerpen

Tilburg University Repository

Combining Instance and Feature neighbors for Efficient Multi-label Classification

Author: Cule Boris
Feremans Len
Goethals Bart
Vens Celine
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2017
Field of study

© 2017 IEEE. Multi-label classification problems occur naturally in different domains. For example, within text categorization the goal is to predict a set of topics for a document, and within image scene classification the goal is to assign labels to different objects in an image. In this work we propose a combination of two variations of k nearest neighborhoods (kNN) where the first neighborhood is computed instance (or row) based and the second neighborhood is feature (or column) based. Instance based kNN is inspired by user-based collaborative filtering, while feature kNN is inspired by item-based collaborative filtering. Finally we apply a linear combination of instance and feature neighbors scores and apply a single threshold to predict the set of labels. Experiments on various multi-label datasets show that our algorithm outperforms other state-of-the-art methods such as ML-kNN, IBLR and Binary Relevance with SVM, on different evaluation metrics. Finally our algorithm uses an inverted index during neighborhood search and scales to extreme datasets that have millions of instances, features and labels.status: publishe

Lirias

Combining instance and feature neighbors for efficient multi-label classification

Author: Cule Boris
Feremans Len
Goethals Bart
Vens Celine
Publication venue
Publication date: 01/01/2017
Field of study

Institutional Repository Universiteit Antwerpen

Combining instance and feature neighbours for extreme multi-label classification

Author: Cule Boris
Feremans Len
Goethals Bart
Vens Celine
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/09/2020
Field of study

status: Published onlin

Lirias