Search CORE

335 research outputs found

Alternative Quality Measures for Time Series Shapelets

Author: C.A.R. Hoare
C.E. Shannon
E. Keogh
J. Demšar
J. Rodriguez
L. Ye
W.H. Kruskal
Y. Jeong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

Crossref

University of East Anglia digital repository

Classification of time series by shapelet transformation

Author: Anthony Bagnall
C Cortes
C Hoare
C Shannon
C Stransky
D Vries De
Edgaras Baranauskas
H Ding
J Demšar
J Lines
James Mapp
Jason Lines
JJ Rodriguez
Jon Hills
L Breiman
L Ye
M Bober
M Hall
N Friedman
P Duarte-Neto
S Campana
W Kruskal
Y Jeong
Z Xing
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2014
Field of study

Time-series classification (TSC) problems present a specific challenge for classification algorithms: how to measure similarity between series. A \emph{shapelet} is a time-series subsequence that allows for TSC based on local, phase-independent similarity in shape. Shapelet-based classification uses the similarity between a shapelet and a series as a discriminatory feature. One benefit of the shapelet approach is that shapelets are comprehensible, and can offer insight into the problem domain. The original shapelet-based classifier embeds the shapelet-discovery algorithm in a decision tree, and uses information gain to assess the quality of candidates, finding a new shapelet at each node of the tree through an enumerative search. Subsequent research has focused mainly on techniques to speed up the search. We examine how best to use the shapelet primitive to construct classifiers. We propose a single-scan shapelet algorithm that finds the best

k

shapelets, which are used to produce a transformed dataset, where each of the

k

features represent the distance between a time series and a shapelet. The primary advantages over the embedded approach are that the transformed data can be used in conjunction with any classifier, and that there is no recursive search for shapelets. We demonstrate that the transformed data, in conjunction with more complex classifiers, gives greater accuracy than the embedded shapelet tree. We also evaluate three similarity measures that produce equivalent results to information gain in less time. Finally, we show that by conducting post-transform clustering of shapelets, we can enhance the interpretability of the transformed data. We conduct our experiments on 29 datasets: 17 from the UCR repository, and 12 we provide ourselve

Crossref

University of East Anglia digital repository

Binary Shapelet Transform for Multiclass Time Series Classification

Author: Bagnall Anthony
Bostrom Aaron
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Shapelets have recently been proposed as a new primitive for time series classification. Shapelets are subseries of series that best split the data into its classes. In the original research, shapelets were found recursively within a decision tree through enumeration of the search space. Subsequent research indicated that using shapelets as the basis for transforming datasets leads to more accurate classifiers. Both these approaches evaluate how well a shapelet splits all the classes. However, often a shapelet is most useful in distinguishing between members of the class of the series it was drawn from against all others. To assess this conjecture, we evaluate a one vs all encoding scheme. This technique simplifies the quality assessment calculations, speeds up the execution through facilitating more frequent early abandon and increases accuracy for multi-class problems. We also propose an alternative shapelet evaluation scheme which we demonstrate significantly speeds up the full search

Crossref

University of East Anglia digital repository

A Shapelet Transform for Time Series Classification

Author: Bagnall A
Davis L
Hills J
Lines J
Publication venue
Publication date: 14/08/2012
Field of study

University of East Anglia digital repository

Time series classification with ensembles of elastic distance measures

Author: A Stefan
Anthony Bagnall
H Deng
J Demšar
J Lin
J Lin
J Rodriguez
J Tanner
Jason Lines
L Breiman
M Baydogan
M Hall
PF Marteau
T Górecki
Y Jeong
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/05/2015
Field of study

Several alternative distance measures for comparing time series have recently been proposed and evaluated on time series classification (TSC) problems. These include variants of dynamic time warping (DTW), such as weighted and derivative DTW, and edit distance-based measures, including longest common subsequence, edit distance with real penalty, time warp with edit, and move–split–merge. These measures have the common characteristic that they operate in the time domain and compensate for potential localised misalignment through some elastic adjustment. Our aim is to experimentally test two hypotheses related to these distance measures. Firstly, we test whether there is any significant difference in accuracy for TSC problems between nearest neighbour classifiers using these distance measures. Secondly, we test whether combining these elastic distance measures through simple ensemble schemes gives significantly better accuracy. We test these hypotheses by carrying out one of the largest experimental studies ever conducted into time series classification. Our first key finding is that there is no significant difference between the elastic distance measures in terms of classification accuracy on our data sets. Our second finding, and the major contribution of this work, is to define an ensemble classifier that significantly outperforms the individual classifiers. We also demonstrate that the ensemble is more accurate than approaches not based in the time domain. Nearly all TSC papers in the data mining literature cite DTW (with warping window set through cross validation) as the benchmark for comparison. We believe that our ensemble is the first ever classifier to significantly outperform DTW and as such raises the bar for future work in this area

Crossref

University of East Anglia digital repository

Feature-based time-series analysis

Author: Fulcher Ben D.
Publication venue
Publication date: 01/10/2017
Field of study

This work presents an introduction to feature-based time-series analysis. The time series as a data type is first described, along with an overview of the interdisciplinary time-series analysis literature. I then summarize the range of feature-based representations for time series that have been developed to aid interpretable insights into time-series structure. Particular emphasis is given to emerging research that facilitates wide comparison of feature-based representations that allow us to understand the properties of a time-series dataset that make it suited to a particular feature-based representation or analysis algorithm. The future of time-series analysis is likely to embrace approaches that exploit machine learning methods to partially automate human learning to aid understanding of the complex dynamical patterns in the time series we measure from the world.Comment: 28 pages, 9 figure

arXiv.org e-Print Archive

Crossref