8 research outputs found

    Feature Extraction for Change-Point Detection using Stationary Subspace Analysis

    Full text link
    Detecting changes in high-dimensional time series is difficult because it involves the comparison of probability densities that need to be estimated from finite samples. In this paper, we present the first feature extraction method tailored to change point detection, which is based on an extended version of Stationary Subspace Analysis. We reduce the dimensionality of the data to the most non-stationary directions, which are most informative for detecting state changes in the time series. In extensive simulations on synthetic data we show that the accuracy of three change point detection algorithms is significantly increased by a prior feature extraction step. These findings are confirmed in an application to industrial fault monitoring.Comment: 24 pages, 20 figures, journal preprin

    FDR-control in multiscale change-point segmentation.

    No full text
    Fast multiple change-point segmentation methods, which additionally provide faithful statistical statements on the number, locations and sizes of the segments, have recently received great attention. In this paper, we propose a multiscale segmentation method, FDRSeg, which controls the false discovery rate (FDR) in the sense that the number of false jumps is bounded linearly by the number of true jumps. In this way, it adapts the detection power to the number of true jumps. We prove a non-asymptotic upper bound for its FDR in a Gaussian setting, which allows to calibrate the only parameter of FDRSeg properly. Moreover, we show that FDRSeg estimates change-point locations, as well as the signal, in a uniform sense at optimal minimax convergence rates up to a log-factor. The latter is w.r.t. Lp-risk, p≥1, over classes of step functions with bounded jump sizes and either bounded, or even increasing, number of change-points. FDRSeg can be efficiently computed by an accelerated dynamic program; its computational complexity is shown to be linear in the number of observations when there are many change-points. The performance of the proposed method is examined by comparisons with some state of the art methods on both simulated and real datasets. An R-package is available online

    Topics in Nonstationary Time Series Analysis

    Get PDF
    Several interesting applications in areas such as neuroscience, economics, finance and seismology have led to the collection nonstationary time series data wherein the statistical properties of the observed process change across time. The analysis of nonstationary time series data is an important and challenging task with useful applications. In comparison to stationarity, modeling temporal dependence in nonstationary time series is more non-trivial, and numerous methods have been proposed to tackle this problem. Stationarity in time series is more coveted than nonstationarity and many of the existing techniques attempt to transform the problem of nonstationarity to a stationary time series setting. Change point detection is one such method that attempts to find time points wherein the statistical properties of the time series changed. We develop a nonparametric method to detect multiple change points in multivariate piecewise stationary processes when the locations and number of change points are unknown. Based on a test statistic that measures differences in the spectral density matrices through the Lâ‚‚ norm, we sequentially identify points of local maxima in the test statistic and test for the significance of each of them being change points. In addition, the components responsible for the change in the covariance structure at each detected change point are identified. The asymptotic properties of the test for significant change points under the null and alternative hypothesis are derived. Another related method for handling nonstationarity is the recent technique of stationary subspace analysis (SSA) that aims at finding linear transformations of nonstationary processes that are stationary. We propose an SSA procedure for general multivariate second-order nonstationary processes. It relies on the asymptotic uncorrelatedness of the discrete Fourier transform of a stationary time series to define a measure of departure from stationarity; it is then minimized to find the stationary subspace. The dimension of the subspace is estimated using a sequential testing procedure and its asymptotic properties are discussed. We illustrate the broader applicability and better performance of our method in comparison to existing SSA methods through simulations and discuss an application in neuroeconomics. Here we apply our method to filter out noise in EEG brain signals from an economic choice task experiment. This improves prediction performance and more importantly reduces the number of trials needed from individuals in neuroeconomic experiments thereby aligning with the principle of simple and controlled designs in experimental and behavioral economics

    Topics in Nonstationary Time Series Analysis

    Get PDF
    Several interesting applications in areas such as neuroscience, economics, finance and seismology have led to the collection nonstationary time series data wherein the statistical properties of the observed process change across time. The analysis of nonstationary time series data is an important and challenging task with useful applications. In comparison to stationarity, modeling temporal dependence in nonstationary time series is more non-trivial, and numerous methods have been proposed to tackle this problem. Stationarity in time series is more coveted than nonstationarity and many of the existing techniques attempt to transform the problem of nonstationarity to a stationary time series setting. Change point detection is one such method that attempts to find time points wherein the statistical properties of the time series changed. We develop a nonparametric method to detect multiple change points in multivariate piecewise stationary processes when the locations and number of change points are unknown. Based on a test statistic that measures differences in the spectral density matrices through the Lâ‚‚ norm, we sequentially identify points of local maxima in the test statistic and test for the significance of each of them being change points. In addition, the components responsible for the change in the covariance structure at each detected change point are identified. The asymptotic properties of the test for significant change points under the null and alternative hypothesis are derived. Another related method for handling nonstationarity is the recent technique of stationary subspace analysis (SSA) that aims at finding linear transformations of nonstationary processes that are stationary. We propose an SSA procedure for general multivariate second-order nonstationary processes. It relies on the asymptotic uncorrelatedness of the discrete Fourier transform of a stationary time series to define a measure of departure from stationarity; it is then minimized to find the stationary subspace. The dimension of the subspace is estimated using a sequential testing procedure and its asymptotic properties are discussed. We illustrate the broader applicability and better performance of our method in comparison to existing SSA methods through simulations and discuss an application in neuroeconomics. Here we apply our method to filter out noise in EEG brain signals from an economic choice task experiment. This improves prediction performance and more importantly reduces the number of trials needed from individuals in neuroeconomic experiments thereby aligning with the principle of simple and controlled designs in experimental and behavioral economics

    Multiscale Change-Point Inference

    Full text link
    We introduce a new estimator SMUCE (simultaneous multiscale change-point estimator) for the change-point problem in exponential family regression. An unknown step function is estimated by minimizing the number of change-points over the acceptance region of a multiscale test at a level \alpha. The probability of overestimating the true number of change-points K is controlled by the asymptotic null distribution of the multiscale test statistic. Further, we derive exponential bounds for the probability of underestimating K. By balancing these quantities, \alpha will be chosen such that the probability of correctly estimating K is maximized. All results are even non-asymptotic for the normal case. Based on the aforementioned bounds, we construct asymptotically honest confidence sets for the unknown step function and its change-points. At the same time, we obtain exponential bounds for estimating the change-point locations which for example yield the minimax rate O(1/n) up to a log term. Finally, SMUCE asymptotically achieves the optimal detection rate of vanishing signals. We illustrate how dynamic programming techniques can be employed for efficient computation of estimators and confidence regions. The performance of the proposed multiscale approach is illustrated by simulations and in two cutting-edge applications from genetic engineering and photoemission spectroscopy
    corecore