As the Industrial Internet of Things (IIoT) grows, systems are increasingly
being monitored by arrays of sensors returning time-series data at
ever-increasing 'volume, velocity and variety' (i.e. Industrial Big Data). An
obvious use for these data is real-time systems condition monitoring and
prognostic time to failure analysis (remaining useful life, RUL). (e.g. See
white papers by Senseye.io, and output of the NASA Prognostics Center of
Excellence (PCoE).) However, as noted by Agrawal and Choudhary 'Our ability to
collect "big data" has greatly surpassed our capability to analyze it,
underscoring the emergence of the fourth paradigm of science, which is
data-driven discovery.' In order to fully utilize the potential of Industrial
Big Data we need data-driven techniques that operate at scales that process
models cannot. Here we present a prototype technique for data-driven anomaly
detection to operate at industrial scale. The method generalizes to application
with almost any multivariate dataset based on independent ordinations of
repeated (bootstrapped) partitions of the dataset and inspection of the joint
distribution of ordinal distances.Comment: 9 pages; 11 figure