Search CORE

3 research outputs found

Detecting process duration drift using gamma mixture models in a left-truncated and right-censored environment

Author: Burke Kevin
Donnelly Mark
Khan Kashaf
McClean Sally
Yang Lingkai
Publication venue
Publication date: 12/06/2024
Field of study

Within the realm of business context, process duration signifies time spent by customers between successive activities. This temporal perspective offers important insight to customer behaviour, highlighting potential bottlenecks, and influencing business management decisions. The distribution of these process duration often changes over time due to factors such as seasonality, emerging legislation, changes to supply chains and customer demand. Referred to as concept drift, these variations pose challenges for robust process modelling, understanding, and refinement. Subsequently, gamma mixture models are widely employed to model durations. These source data can, however, become left-truncated and right-censored within any specific observation window thereby necessitating a (well-known) modification to the likelihood function. The approach reported in this paper leveraged this adapted likelihood across a series of observation windows, applying the likelihood ratio test to identify duration changes/concept drift. Due to its flexibility in modelling any duration distribution, the gamma mixture model was used with Nelder-Mead optimized likelihood for the left-truncated and right-censored data. The number of gamma components was determined by the Bayesian information criterion. The proposed framework underwent validation through simulated exponential samples, leading to recommendations for its practical application. Subsequently, we applied the methodology to three real-life event logs exhibiting diverse characteristics. Experimental results showcase the effectiveness of our approach in terms of data fitting, as compared to Kaplan-Meier curves, and in detecting instances of drift. This comprehensive validation underscores the practical utility and reliability of our framework for dynamic business scenarios

Ulster University's Research Portal

Learning correlations using the mixture-of-subsets model

Author: Agrawal R.
Aldous D. J.
Christopher Jermaine
Dhillon I. S.
Manas Somaiya
McLachlan G. J.
Nagesh H.
Sanjay Ranka
Yang J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref