We consider the problem of sequentially choosing observation regions along a line, with an aim of maximising the detection of events of interest. Such a problem may arise when monitoring the movements of endangered or migratory species, detecting crossings of a border, policing activities at sea, and in many other settings. In each case, the key operational challenge is to learn an allocation of surveillance resources which maximises successful detection of events of interest. We present a combinatorial multi-armed bandit model with Poisson rewards and a novel filtered feedback mechanism - arising from the failure to detect certain intrusions - where reward distributions are dependent on the actions selected. Our solution method is an upper confidence bound approach and we derive upper and lower bounds on its expected performance. We prove that the gap between these bounds is of constant order, and demonstrate empirically that our approach is more reliable in simulated problems than competing algorithms

Glazebrook, Kevin

Grant, James A.

Leslie, David S.

Letchford, Adam

Szechtman, Roberto

arXiv

The article of record as published may be found at https://doi.org/10.1016/j.ejor.2019.11.004Supplementary material associated with this article can be found, in the online version, at doi:10.1016/j.ejor.2019.11.004We consider the problem of sequentially choosing observation regions along a line, with an aim of maximising the detection of events of interest. Such a problem may arise when monitoring the movements of endangered or migratory species, detecting crossings of a border, policing activities at sea, and in many other settings. In each case, the key operational challenge is to learn an allocation of surveillance resources which maximises successful detection of events of interest. We present a combinatorial multiarmed bandit model with Poisson rewards and a novel filtered feedback mechanism arising from the failure to detect certain intrusions where reward distributions are dependent on the actions selected. Our solution method is an upper confidence bound approach and we derive upper and lower bounds on its expected performance. We prove that the gap between these bounds is of constant order, and demonstrate empirically that our approach is more reliable in simulated problems than competing algorithms.EPSRC funded EP/L015692/1 STOR-i Centre for Doctoral Training.EPSRC funded EP/L015692/1 STOR-i Centre for Doctoral Training

Grant, J.A.

Leslie, D.S.

Glazebrook, K.

Szechtman, R.

Letchford, A.N.

Calhoun, Institutional Archive of the Naval Postgraduate School

Adaptive policies for perimeter surveillance problems

The article of record as published may be found at https://doi.org/10.1016/j.ejor.2019.11.004We consider the problem of sequentially choosing observation regions along a line, with an aim of maximising the detection of events of interest. Such a problem may arise when monitoring the movements of
endangered or migratory species, detecting crossings of a border, policing activities at sea, and in many
other settings. In each case, the key operational challenge is to learn an allocation of surveillance resources which maximises successful detection of events of interest. We present a combinatorial multiarmed bandit model with Poisson rewards and a novel filtered feedback mechanism - arising from the
failure to detect certain intrusions - where reward distributions are dependent on the actions selected.
Our solution method is an upper confidence bound approach and we derive upper and lower bounds on
its expected performance. We prove that the gap between these bounds is of constant order, and demonstrate empirically that our approach is more reliable in simulated problems than competing algorithms.EPSRCEP/L015692/

Letchford, Adam N.

English

Lancaster E-Prints

https://calhoun.nps.edu/bitstream/handle/10945/65150/Adaptive_policies_for_perimeter_surveillance_problems.pdf?sequence=1

Adaptive policies for perimeter surveillance problems

Abstract

Similar works

Full text

Available Versions

Calhoun, Institutional Archive of the Naval Postgraduate School

Calhoun, Institutional Archive of the Naval Postgraduate School

Lancaster E-Prints