Search CORE

8 research outputs found

A framework for automated anomaly detection in high frequency water-quality data from in situ sensors

Author: Alsibai Omar
Hyndman Rob J.
Kandanaarachchi Sevvandi
King Olivia C.
Leigh Catherine
McGree James M.
Mengersen Kerrie
Neelamraju Catherine
Peterson Erin E.
Strauss Jennifer
Talagala Priyanga Dilini
Turner Ryan S.
Publication venue
Publication date: 01/01/2019
Field of study

River water-quality monitoring is increasingly conducted using automated in situ sensors, enabling timelier identification of unexpected values. However, anomalies caused by technical issues confound these data, while the volume and velocity of data prevent manual detection. We present a framework for automated anomaly detection in high-frequency water-quality data from in situ sensors, using turbidity, conductivity and river level data. After identifying end-user needs and defining anomalies, we ranked their importance and selected suitable detection methods. High priority anomalies included sudden isolated spikes and level shifts, most of which were classified correctly by regression-based methods such as autoregressive integrated moving average models. However, using other water-quality variables as covariates reduced performance due to complex relationships among variables. Classification of drift and periods of anomalously low or high variability improved when we applied replaced anomalous measurements with forecasts, but this inflated false positive rates. Feature-based methods also performed well on high priority anomalies, but were also less proficient at detecting lower priority anomalies, resulting in high false negative rates. Unlike regression-based methods, all feature-based methods produced low false positive rates, but did not and require training or optimization. Rule-based methods successfully detected impossible values and missing observations. Thus, we recommend using a combination of methods to improve anomaly detection performance, whilst minimizing false detection rates. Furthermore, our framework emphasizes the importance of communication between end-users and analysts for optimal outcomes with respect to both detection performance and end-user needs. Our framework is applicable to other types of high frequency time-series data and anomaly detection applications

arXiv.org e-Print Archive

Queensland University of Technology ePrints Archive

RMIT Research Repository

Forecasting: theory and practice

Author: Alessia Paccagnini
Alexander Dokumentov
Alisa Yusupova
Andrew B. Martinez
Anne B. Koehler
Aris A. Syntetos
Bahman Rostami-Tabar
Christoph Bergmeir
Clara Cordeiro
Claudio Carnevale
Daniele Apiletti
David E. Rapach
David F. Hendry
Devon K. Barrow
Diego J. Pedregal
Dilek Önkal
Dimitrios Thomakos
Evangelos Spiliotis
Ezio Todini
Feng Li
Fernando Luiz Cyrino Oliveira
Florian Ziel
Fotios Petropoulos
Georgios Sermpinis
Han Lin Shang
Ioannis Panapakidis
J. James Reade
Jennifer L. Castle
Jethro Browell
John E. Boylan
Jooyoung Jeon
Jose M. Pavía
Juan Ramón Trapero Arenas
Konstantia Litsiou
Konstantinos Nikolopoulos
Len Tashman
Luigi Grossi
M. Sinan Gönül
Manuela Pedio
Mariangela Guidolin
Massimo Guidolin
Michael Gilliland
Michael P. Clements
Michał Rubaszek
Mohamed Zied Babai
Nigel Harvey
Nikolaos Kourentzes
Pasquale Cirillo
Patrícia Ramos
Paul Goodwin
Philip Hans Franses
Pierre Pinson
Piotr Fiszeder
Priyanga Dilini Talagala
Renato Guseo
Ricardo J. Bessa
Robert L. Winkler
Ross Hollyman
Shari De Baets
Sheik Meeran
Sonia Leva
Spyros Makridakis
Stephan Kolassa
Theodore Modis
Thiyanga S. Talagala
Thordis Thorarinsdottir
Tim Januschowski
Ulrich Gunter
Vassilios Assimakopoulos
Victor Richmond R. Jose
Xiaojia Guo
Xiaoqian Wang
Yael Grushka-Cockayne
Yanfei Kang
Publication venue: country:USA
Publication date: 01/01/2020
Field of study

Forecasting has always been in the forefront of decision making and planning. The uncertainty that surrounds the future is both exciting and challenging, with individuals and organisations seeking to minimise risks and maximise utilities. The lack of a free-lunch theorem implies the need for a diverse set of forecasting methods to tackle an array of applications. This unique article provides a non-systematic review of the theory and the practice of forecasting. We offer a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts. We then demonstrate how such theoretical concepts are applied in a variety of real-life contexts, including operations, economics, finance, energy, environment, and social good. We do not claim that this review is an exhaustive list of methods and applications. The list was compiled based on the expertise and interests of the authors. However, we wish that our encyclopedic presentation will offer a point of reference for the rich work that has been undertaken over the last decades, with some key insights for the future of the forecasting theory and practice

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

A Feature-Based Procedure for Detecting Technical Outliers in Water-Quality Data From In Situ Sensors

Author: Hyndman Rob J.
Leigh Catherine
Mengersen Kerrie
Smith-Miles Kate
Talagala Priyanga Dilini
Publication venue: 'American Geophysical Union (AGU)'
Publication date: 01/11/2019
Field of study

Outliers due to technical errors in water-quality data from in situ sensors can reduce data quality and have a direct impact on inference drawn from subsequent data analysis. However, outlier detection through manual monitoring is infeasible given the volume and velocity of data the sensors produce. Here we introduce an automated procedure, named oddwater, that provides early detection of outliers in water-quality data from in situ sensors caused by technical issues. Our oddwater procedure is used to first identify the data features that differentiate outlying instances from typical behaviors. Then, statistical transformations are applied to make the outlying instances stand out in a transformed data space. Unsupervised outlier scoring techniques are applied to the transformed data space, and an approach based on extreme value theory is used to calculate a threshold for each potential outlier. Using two data sets obtained from in situ sensors in rivers flowing into the Great Barrier Reef lagoon, Australia, we show that oddwater successfully identifies outliers involving abrupt changes in turbidity, conductivity, and river level, including sudden spikes, sudden isolated drops, and level shifts, while maintaining very low false detection rates. We have implemented this oddwater procedure in the open source R package oddwater.</p

Queensland University of Technology ePrints Archive

University of Melbourne Institutional Repository

A framework for automated anomaly detection in high frequency water-quality data from in situ sensors

Author: Alsibai Omar
Hyndman Rob
Kandanaarachchi Sevvandi
King Olivia
Leigh Catherine
McGree James
Mengersen Kerrie
Neelamraju Catherine
Peterson Erin
Strauss Jennifer
Talagala Priyanga Dilini
Turner Ryan
Publication venue: 'Elsevier BV'
Publication date: 10/05/2019
Field of study

Monitoring the water quality of rivers is increasingly conducted using automated in situ sensors, enabling timelier identification of unexpected values or trends. However, the data are confounded by anomalies caused by technical issues, for which the volume and velocity of data preclude manual detection. We present a framework for automated anomaly detection in high-frequency water-quality data from in situ sensors, using turbidity, conductivity and river level data collected from rivers flowing into the Great Barrier Reef. After identifying end-user needs and defining anomalies, we ranked anomaly importance and selected suitable detection methods. High priority anomalies included sudden isolated spikes and level shifts, most of which were classified correctly by regression-based methods such as autoregressive integrated moving average models. However, incorporation of multiple water-quality variables as covariates reduced performance due to complex relationships among variables. Classifications of drift and periods of anomalously low or high variability were more often correct when we applied mitigation, which replaces anomalous measurements with forecasts for further forecasting, but this inflated false positive rates. Feature-based methods also performed well on high priority anomalies and were similarly less proficient at detecting lower priority anomalies, resulting in high false negative rates. Unlike regression-based methods, however, all feature-based methods produced low false positive rates and have the benefit of not requiring training or optimization. Rule-based methods successfully detected a subset of lower priority anomalies, specifically impossible values and missing observations. We therefore suggest that a combination of methods will provide optimal performance in terms of correct anomaly detection, whilst minimizing false detection rates. Furthermore, our framework emphasizes the importance of communication between end-users and anomaly detection developers for optimal outcomes with respect to both detection performance and end-user application. To this end, our framework has high transferability to other types of high frequency time-series data and anomaly detection applications

Queensland University of Technology ePrints Archive

Anomaly Detection in Streaming Nonstationary Temporal Data

Author: Embrechts P.
Galambos J.
Hyndman R. J.
Jiang Q.
Kate Smith-Miles
Krohn D. A.
Lavin A.
Luca S.
Luca S.
Ma J.
Mario A. Muñoz
Marshall A. W.
Priyanga Dilini Talagala
Ranawana R.
Rob J. Hyndman
Schwarz K. T
Sevvandi Kandanaarachchi
Sundaram S.
Zhuang L.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Forecasting: theory and practice

Author: Alessia Paccagnini
Alexander Dokumentov
Alisa Yusupova
Anastasios Panagiotelis
Andrew B. Martinez
Anne B. Koehler
Aris A. Syntetos
Bahman Rostami-Tabar
Christoph Bergmeir
Clara Cordeiro
Claudio Carnevale
Daniele Apiletti
David E. Rapach
David F. Hendry
David T. Frazier
Devon K. Barrow
Diego J. Pedregal
Dilek Önkal
Dimitrios Thomakos
Evangelos Spiliotis
Ezio Todini
Feng Li
Fernando Luiz Cyrino Oliveira
Florian Ziel
Fotios Petropoulos
Gael M. Martin
Georgios Sermpinis
Han Lin Shang
Ioannis Panapakidis
J. James Reade
Jakub Bijak
Jennifer L. Castle
Jethro Browell
Joanne Ellison
John E. Boylan
Jooyoung Jeon
Jose M. Pavía
Juan Ramón Trapero Arenas
Konstantia Litsiou
Konstantinos Nikolopoulos
Len Tashman
Luigi Grossi
M. Sinan Gönül
Manuela Pedio
Mariangela Guidolin
Massimo Guidolin
Michael Gilliland
Michael P. Clements
Michał Rubaszek
Mohamed Zied Babai
Nigel Harvey
Nikolaos Kourentzes
Pasquale Cirillo
Patrícia Ramos
Paul Goodwin
Philip Hans Franses
Pierre Pinson
Piotr Fiszeder
Priyanga Dilini Talagala
Renato Guseo
Ricardo J. Bessa
Robert L. Winkler
Ross Hollyman
Shari De Baets
Sheik Meeran
Sonia Leva
Souhaib Ben Taieb
Spyros Makridakis
Stephan Kolassa
Theodore Modis
Thiyanga S. Talagala
Thordis Thorarinsdottir
Tim Januschowski
Ulrich Gunter
Vassilios Assimakopoulos
Victor Richmond R. Jose
Xiaojia Guo
Xiaoqian Wang
Yael Grushka-Cockayne
Yanfei Kang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

Forecasting has always been at the forefront of decision making and planning. The uncertainty that surrounds the future is both exciting and challenging, with individuals and organisations seeking to minimise risks and maximise utilities. The large number of forecasting applications calls for a diverse set of forecasting methods to tackle real-life challenges. This article provides a non-systematic review of the theory and the practice of forecasting. We provide an overview of a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts. We then demonstrate how such theoretical concepts are applied in a variety of real-life contexts. We do not claim that this review is an exhaustive list of methods and applications. However, we wish that our encyclopedic presentation will offer a point of reference for the rich work that has been undertaken over the last decades, with some key insights for the future of forecasting theory and practice. Given its encyclopedic nature, the intended mode of reading is non-linear. We offer cross-references to allow the readers to navigate through the various topics. We complement the theoretical concepts and applications covered by large lists of free or open-source software implementations and publicly-available databases

Archivio istituzionale della ricerca - Università di Brescia

Forecasting: theory and practice

Author: Apiletti Daniele
Arenas Juan Ram´on Trapero
Assimakopoulos Vassilios
Babai Mohamed Zied
Barrow Devon K.
Bergmeir Christoph
Bessa Ricardo J.
Bijak Jakub
Boylan John E.
Browell Jethro
Carnevale Claudio
Castle Jennifer L.
Cirillo Pasquale
Clements Michael P.
Cordeiro Clara
De Baets Shari
Dokumentov Alexander
Ellison Joanne
Fiszeder Piotr
Franses Philip Hans
Frazier David T.
Gilliland Michael
Gonul M. Sinan
Goodwin Paul
Grossi Luigi
Grushka-Cockayne Yael
Guidolin Mariangela
Guidolin Massimo
Gunter Ulrich
Guo Xiaojia
Guseo Renato
Harvey Nigel
Hendry David F.
Hollyman Ross
Januschowski Tim
Jeon Jooyoung
Jose Victor Richmond R.
Kang Yanfei
Koehler Anne B.
Kolassa Stephan
Kourentzes Nikolaos
Leva Sonia
Li Feng
Litsiou Konstantia
Makridakis Spyros
Martin Gael M.
Martinez Andrew B.
Meeran Sheik
Modis Theodore
Nikolopoulos Konstantinos
Oliveira Fernando Luiz Cyrino
Onkal Dilek
Paccagnini Alessia
Panagiotelis Anastasios
Panapakidis Ioannis
Pav´ıa Jose M.
Pedio Manuela
Pedregal Diego J.
Petropoulos Fotios
Pinson Pierre
Ramos Patr´ıcia
Rapach David E.
Reade J. James
Rostami-Tabar Bahman
Rubaszek Michał
Sermpinis Georgios
Shang Han Lin
Spiliotis Evangelos
Syntetos Aris A.
Taieb Souhaib Ben
Talagala Priyanga Dilini
Talagala Thiyanga S.
Tashman Len
Thomakos Dimitrios
Thorarinsdottir Thordis
Todini Ezio
Wang Xiaoqian
Winkler Robert L.
Yusupova Alisa
Ziel Florian
Publication venue: Elsevier
Publication date: 01/01/2020
Field of study