3 research outputs found
The Influence of Global Constraints on Similarity Measures for Time-Series Databases
A time series consists of a series of values or events obtained over repeated
measurements in time. Analysis of time series represents and important tool in
many application areas, such as stock market analysis, process and quality
control, observation of natural phenomena, medical treatments, etc. A vital
component in many types of time-series analysis is the choice of an appropriate
distance/similarity measure. Numerous measures have been proposed to date, with
the most successful ones based on dynamic programming. Being of quadratic time
complexity, however, global constraints are often employed to limit the search
space in the matrix during the dynamic programming procedure, in order to speed
up computation. Furthermore, it has been reported that such constrained
measures can also achieve better accuracy. In this paper, we investigate two
representative time-series distance/similarity measures based on dynamic
programming, Dynamic Time Warping (DTW) and Longest Common Subsequence (LCS),
and the effects of global constraints on them. Through extensive experiments on
a large number of time-series data sets, we demonstrate how global constrains
can significantly reduce the computation time of DTW and LCS. We also show
that, if the constraint parameter is tight enough (less than 10-15% of
time-series length), the constrained measure becomes significantly different
from its unconstrained counterpart, in the sense of producing qualitatively
different 1-nearest neighbor graphs. This observation explains the potential
for accuracy gains when using constrained measures, highlighting the need for
careful tuning of constraint parameters in order to achieve a good trade-off
between speed and accuracy
Development of Predictive Analytics for Demand Forecasting and Inventory Management in Supply Chain using Machine Learning Techniques
Forecasting demand effectively and managing inventories efficiently are critical components of modern supply chain management. By understanding full scope of demand possibilities, businesses gain ability to fine-tune inventory levels, navigate situations involving stockouts and overstock, and move toward a more resilient and precise supply chain. This thesis focuses on strategies to enhance these critical functions.
We start with examining impact of customer segmentation on forecasting precision by introducing a novel cluster-based demand forecasting framework that harnesses ensemble learning techniques. Our results showcase the effectiveness of the clustered-ensembled approach with minimal forecast errors. However, the constraints related to data availability and segmentation indicate areas that warrant further investigation in future research.
The significance of demand accuracy becomes most apparent when we consider its impact on safety stock. In second objective, we explore multivariate time series forecasting for optimal safety stock and inventory management, utilizing deep learning models and a cost optimization framework. This strategy outperforms individual models, demonstrating enhanced forecasting accuracy and stability across diverse product domains. Calculating safety stock based on proposed demand prediction framework leads to optimized safety stock levels. This not only prevents costly stockouts but also minimizes surplus inventory, resulting in reduced overall holding costs and improved inventory efficiency.
Although the first two objectives provided optimized results, relying on point predictions to calculate safety stock is not ideal. Unlike traditional point forecasting, distribution forecasting aims to cover the entire range of potential demand outcomes, essentially creating a comprehensive map of possibilities. The third objective of this thesis introduces recurrent mixture density networks (RMDNs) for refined distribution demand forecasting and safety stock estimation. These innovative models consistently outperform traditional LSTM models, offering more precise stockout and overstock predictions. This approach not only reduces inventory costs but also enhances supply chain efficiency.
In summary, this thesis provides valuable insights and methodologies for businesses aiming to enhance demand forecasting accuracy and optimize inventory management practices in the retail industry. By leveraging customer segmentation, ensemble deep learning, and distribution forecasting techniques, organizations can enhance decision-making processes, reduce operational costs, and thrive in the dynamic landscape of supply chain operations
Uloga mera sličnosti u analizi vremenskih serija
The subject of this dissertation encompasses a comprehensive overview and analysis of the impact of Sakoe-Chiba global constraint on the most commonly used elastic similarity measures in the field of time-series data mining with a focus on classification accuracy. The choice of similarity measure is one of the most significant aspects of time-series analysis - it should correctly reflect the resemblance between the data presented in the form of time series. Similarity measures represent a critical component of many tasks of mining time series, including: classification, clustering, prediction, anomaly detection, and others. The research covered by this dissertation is oriented on several issues: 1. review of the effects of global constraints on the performance of computing similarity measures, 2. a detailed analysis of the influence of constraining the elastic similarity measures on the accuracy of classical classification techniques, 3. an extensive study of the impact of different weighting schemes on the classification of time series, 4. development of an open source library that integrates the main techniques and methods required for analysis and mining time series, and which is used for the realization of these experimentsPredmet istraživanja ove disertacije obuhvata detaljan pregled i analizu uticaja Sakoe-Chiba globalnog ograničenja na najčešće korišćene elastične mere sličnosti u oblasti data mining-a vremenskih serija sa naglaskom na tačnost klasifikacije. Izbor mere sličnosti jedan je od najvažnijih aspekata analize vremenskih serija - ona treba verno reflektovati sličnost između podataka prikazanih u obliku vremenskih serija. Mera sličnosti predstavlјa kritičnu komponentu mnogih zadataka mining-a vremenskih serija, uklјučujući klasifikaciju, grupisanje (eng. clustering), predviđanje, otkrivanje anomalija i drugih. Istraživanje obuhvaćeno ovom disertacijom usmereno je na nekoliko pravaca: 1. pregled efekata globalnih ograničenja na performanse računanja mera sličnosti, 2. detalјna analiza posledice ograničenja elastičnih mera sličnosti na tačnost klasifikacije klasičnih tehnika klasifikacije, 3. opsežna studija uticaj različitih načina računanja težina (eng. weighting scheme) na klasifikaciju vremenskih serija, 4. razvoj biblioteke otvorenog koda (Framework for Analysis and Prediction - FAP) koja će integrisati glavne tehnike i metode potrebne za analizu i mining vremenskih serija i koja je korišćena za realizaciju ovih eksperimenata.Predmet istraživanja ove disertacije obuhvata detaljan pregled i analizu uticaja Sakoe-Chiba globalnog ograničenja na najčešće korišćene elastične mere sličnosti u oblasti data mining-a vremenskih serija sa naglaskom na tačnost klasifikacije. Izbor mere sličnosti jedan je od najvažnijih aspekata analize vremenskih serija - ona treba verno reflektovati sličnost između podataka prikazanih u obliku vremenskih serija. Mera sličnosti predstavlja kritičnu komponentu mnogih zadataka mining-a vremenskih serija, uključujući klasifikaciju, grupisanje (eng. clustering), predviđanje, otkrivanje anomalija i drugih. Istraživanje obuhvaćeno ovom disertacijom usmereno je na nekoliko pravaca: 1. pregled efekata globalnih ograničenja na performanse računanja mera sličnosti, 2. detaljna analiza posledice ograničenja elastičnih mera sličnosti na tačnost klasifikacije klasičnih tehnika klasifikacije, 3. opsežna studija uticaj različitih načina računanja težina (eng. weighting scheme) na klasifikaciju vremenskih serija, 4. razvoj biblioteke otvorenog koda (Framework for Analysis and Prediction - FAP) koja će integrisati glavne tehnike i metode potrebne za analizu i mining vremenskih serija i koja je korišćena za realizaciju ovih eksperimenata