3,050 research outputs found
Categorical surrogation of agent‐based models: A comparative study of machine learning classifiers
Agent-based modelling has gained recognition in the last years because it provides a natural way to explore the behaviour of social systems. However, agent-based models usually have a considerable number of parameters that make it computationally prohibitive to explore the complete space of parameter combinations. A promising approach to overcome the computational constraints of agent-based models is the use of machine learning-based surrogates or metamodels, which can be used as efficient proxies of the original agent-based model. As the use of metamodels of agent-based simulations is still an incipient area of research, there are no guidelines on which algorithms are the most suitable candidates. In order to contribute to filling this gap, we conduct here a systematic comparative analysis to evaluate different machine learning-based approaches to agent-based model surrogation. A key innovation of our work is the focus on classification methods for categorical metamodeling, which is highly relevant because agent-based simulations are very often validated in a qualitative way. To analyse the performance of the classifiers we use three types of ndicators measures of correctness, efficiency, and robustness¿and compare their results for different datasets and sample sizes using an agent-based artificial market as a case study
Using Machine Learning as a Surrogate Model for Agent-Based Simulations
In this proof-of-concept work, we evaluate the performance of multiple machine-learning methods as surrogate models for use in the analysis of agent-based models (ABMs). Analysing agent-based modelling outputs can be challenging, as the relationships between input parameters can be non-linear or even chaotic even in relatively simple models, and each model run can require significant CPU time. Surrogate modelling, in which a statistical model of the ABM is constructed to facilitate detailed model analyses, has been proposed as an alternative to computationally costly Monte Carlo methods. Here we compare multiple machine-learning methods for ABM surrogate modelling in order to determine the approaches best suited as a surrogate for modelling the complex behaviour of ABMs. Our results suggest that, in most scenarios, artificial neural networks (ANNs) and gradient-boosted trees outperform Gaussian process surrogates, currently the most commonly used method for the surrogate modelling of complex computational models. ANNs produced the most accurate model replications in scenarios with high numbers of model runs, although training times were longer than the other methods. We propose that agent-based modelling would benefit from using machine-learning methods for surrogate modelling, as this can facilitate more robust sensitivity analyses for the models while also reducing CPU time consumption when calibrating and analysing the simulation
LIMEtree: Interactively Customisable Explanations Based on Local Surrogate Multi-output Regression Trees
Systems based on artificial intelligence and machine learning models should
be transparent, in the sense of being capable of explaining their decisions to
gain humans' approval and trust. While there are a number of explainability
techniques that can be used to this end, many of them are only capable of
outputting a single one-size-fits-all explanation that simply cannot address
all of the explainees' diverse needs. In this work we introduce a
model-agnostic and post-hoc local explainability technique for black-box
predictions called LIMEtree, which employs surrogate multi-output regression
trees. We validate our algorithm on a deep neural network trained for object
detection in images and compare it against Local Interpretable Model-agnostic
Explanations (LIME). Our method comes with local fidelity guarantees and can
produce a range of diverse explanation types, including contrastive and
counterfactual explanations praised in the literature. Some of these
explanations can be interactively personalised to create bespoke, meaningful
and actionable insights into the model's behaviour. While other methods may
give an illusion of customisability by wrapping, otherwise static, explanations
in an interactive interface, our explanations are truly interactive, in the
sense of allowing the user to "interrogate" a black-box model. LIMEtree can
therefore produce consistent explanations on which an interactive exploratory
process can be built
Bayesian calibration of stochastic agent based model via random forest
Agent-based models (ABM) provide an excellent framework for modeling
outbreaks and interventions in epidemiology by explicitly accounting for
diverse individual interactions and environments. However, these models are
usually stochastic and highly parametrized, requiring precise calibration for
predictive performance. When considering realistic numbers of agents and
properly accounting for stochasticity, this high dimensional calibration can be
computationally prohibitive. This paper presents a random forest based
surrogate modeling technique to accelerate the evaluation of ABMs and
demonstrates its use to calibrate an epidemiological ABM named CityCOVID via
Markov chain Monte Carlo (MCMC). The technique is first outlined in the context
of CityCOVID's quantities of interest, namely hospitalizations and deaths, by
exploring dimensionality reduction via temporal decomposition with principal
component analysis (PCA) and via sensitivity analysis. The calibration problem
is then presented and samples are generated to best match COVID-19
hospitalization and death numbers in Chicago from March to June in 2020. These
results are compared with previous approximate Bayesian calibration (IMABC)
results and their predictive performance is analyzed showing improved
performance with a reduction in computation
Promising and worth-to-try future directions for advancing state-of-the-art surrogates methods of agent-based models in social and health computational sciences
The execution and runtime performance of model-based analysis tools for
realistic large-scale ABMs (Agent-Based Models) can be excessively long. This
due to the computational demand exponentially proportional to the model size
(e.g. Population size) and the number of model parameters. Even the runtime of
a single simulation of a realistic ABM may demand huge computational resources
when attempting to employ realistic population size. The main aim of this
ad-hoc brief report is to highlight some of surrogate models that were adequate
and computationally less demanding for nonlinear dynamical models in various
modeling application areas.To the author knowledge, these methods have been
not, at least extensively, employed for ABMs within the field of (SHCS) Social
Health Computational Sciences, yet. Thus, they might be, but not necessarily,
useful in progressing state of the art for establishing surrogate models for
ABMs in the field of SHCS.Comment: 4 page
- …