700 research outputs found
Sequential Cross-Validated Bandwidth Selection Under Dependence and Anscombe-Type Extensions to Random Time Horizons
To detect changes in the mean of a time series, one may use previsible
detection procedures based on nonparametric kernel prediction smoothers which
cover various classic detection statistics as special cases. Bandwidth
selection, particularly in a data-adaptive way, is a serious issue and not well
studied for detection problems. To ensure data adaptation, we select the
bandwidth by cross-validation, but in a sequential way leading to a functional
estimation approach. This article provides the asymptotic theory for the method
under fairly weak assumptions on the dependence structure of the error terms,
which cover, e.g., GARCH() processes, by establishing (sequential)
functional central limit theorems for the cross-validation objective function
and the associated bandwidth selector. It turns out that the proof can be based
in a neat way on \cite{KurtzProtter1996}'s results on the weak convergence of
\ito integrals and a diagonal argument.
Our gradual change-point model covers multiple change-points in that it
allows for a nonlinear regression function after the first change-point
possibly with further jumps and Lipschitz continuous between those
discontinuities.
In applications, the time horizon where monitoring stops latest is often
determined by a random experiment, e.g. a first-exit stopping time applied to a
cumulated cost process or a risk measure, possibly stochastically dependent
from the monitored time series. Thus, we also study that case and establish
related limit theorems in the spirit of \citet{Anscombe1952}'s result. The
result has various applications including statistical parameter estimation and
monitoring financial investment strategies with risk-controlled early
termination, which are briefly discussed
Efficient estimation of generalized additive nonparametric regression models.
We define new procedures for estimating generalized additive nonparametric regression models that are more efficient than the Linton and Härdle (1996, Biometrika 83, 529–540) integration-based method and achieve certain oracle bounds. We consider criterion functions based on the Linear exponential family, which includes many important special cases. We also consider the extension to multiple parameter models like the gamma distribution and to models for conditional heteroskedasticity.
Estimation of multiple-regime regressions with least absolutes deviation
This paper considers least absolute deviations estimation of a regression model with multiple change points occurring at unknown times. Some asymptotic results, including rates of convergence and asymptotic distributions, for the estimated change points and the estimated regression coefficient are derived. Results are obtained without assuming that each regime spans a positive fraction of the sample size. In addition, the number of change points is allowed to grow as the sample size increases. Estimation of the number of change points is also considered. A feasible computational algorithm is developed. An application is also given, along with some monte carlo simulations.Multiple change points, multiple-regime regressions, least absolute deviation, asymptotic distribution
Input variable selection in time-critical knowledge integration applications: A review, analysis, and recommendation paper
This is the post-print version of the final paper published in Advanced Engineering Informatics. The published article is available from the link below. Changes resulting from the publishing process, such as peer review, editing, corrections, structural formatting, and other quality control mechanisms may not be reflected in this document. Changes may have been made to this work since it was submitted for publication. Copyright @ 2013 Elsevier B.V.The purpose of this research is twofold: first, to undertake a thorough appraisal of existing Input Variable Selection (IVS) methods within the context of time-critical and computation resource-limited dimensionality reduction problems; second, to demonstrate improvements to, and the application of, a recently proposed time-critical sensitivity analysis method called EventTracker to an environment science industrial use-case, i.e., sub-surface drilling.
Producing time-critical accurate knowledge about the state of a system (effect) under computational and data acquisition (cause) constraints is a major challenge, especially if the knowledge required is critical to the system operation where the safety of operators or integrity of costly equipment is at stake. Understanding and interpreting, a chain of interrelated events, predicted or unpredicted, that may or may not result in a specific state of the system, is the core challenge of this research. The main objective is then to identify which set of input data signals has a significant impact on the set of system state information (i.e. output). Through a cause-effect analysis technique, the proposed technique supports the filtering of unsolicited data that can otherwise clog up the communication and computational capabilities of a standard supervisory control and data acquisition system.
The paper analyzes the performance of input variable selection techniques from a series of perspectives. It then expands the categorization and assessment of sensitivity analysis methods in a structured framework that takes into account the relationship between inputs and outputs, the nature of their time series, and the computational effort required. The outcome of this analysis is that established methods have a limited suitability for use by time-critical variable selection applications. By way of a geological drilling monitoring scenario, the suitability of the proposed EventTracker Sensitivity Analysis method for use in high volume and time critical input variable selection problems is demonstrated.E
- …