667 research outputs found
Semiparametric regression analysis with missing response at random
We develop inference tools in a semiparametric partially linear regression model with missing response data. A class of estimators is defined that includes as special cases: a semiparametric regression imputation estimator, a marginal average estimator and a (marginal) propensity score weighted estimator. We show that any of our class of estimators is asymptotically normal. The three special estimators have the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoskedastic Gaussian case. We show that the Jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, that severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-square variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chi-square, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators.
Semiparametric Regression Analysis under Imputation for Missing Response Data
We develop inference tools in a semiparametric regression model with missing response data. A semiparametric regression imputation estimator, a marginal average estimator and a (marginal) propensity score weighted estimator are defined. All the estimators are proved to be asymptotically normal, with the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoskedastic Gaussian case. We show that the Jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, and that severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-square variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chi-square, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation-based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators. Furthermore, a real data analysis is given to illustrate our methods.Asymptotic normality, empirical likelihood, semiparametric imputation.
Estimation of Partial Linear Error-in-Variables Models with Validation Data
AbstractConsider the partial linear models of the formY=Xτβ+g(T)+e, where thep-variate explanatoryXis erroneously measured, and bothTand the responseYare measured exactly. LetXbe the surrogate variable forXwith measurement error. Let the primary data set be that containing independent observations on (Y,X,T) and the validation data set be that containing independent observations on (X,X,T), where the exact observations onXmay be obtained by some expensive or difficult procedures for only a small subset of subjects enrolled in the study. In this paper, without specifying any structure equation and the distribution assumption ofXgivenX, a semiparametric method with the primary data is employed to obtain the estimators ofβandg(·) based on the least-squares criterion with the help of validation data. The proposed estimators are proved to be strongly consistent. The asymptotic representation and the asymptotic normality of the estimator ofβare derived, respectively. The rate of the weak consistency of the estimator ofg(·) is also obtained
Determination and estimation of optimal quarantine duration for infectious diseases with application to data analysis of COVID-19
Quarantine measure is a commonly used non-pharmaceutical intervention during
the outbreak of infectious diseases. A key problem for implementing quarantine
measure is to determine the duration of quarantine. In this paper, a policy
with optimal quarantine duration is developed. The policy suggests different
quarantine durations for every individual with different characteristic. The
policy is optimal in the sense that it minimizes the average quarantine
duration of uninfected people with the constraint that the probability of
symptom presentation for infected people attains the given value closing to 1.
The optimal solution for the quarantine duration is obtained and estimated by
some statistic methods with application to analyzing COVID-19 data
Empirical Likelihood Inference over Decentralized Networks
As a nonparametric statistical inference approach, empirical likelihood has
been found very useful in numerous occasions. However, it encounters serious
computational challenges when applied directly to the modern massive dataset.
This article studies empirical likelihood inference over decentralized
distributed networks, where the data are locally collected and stored by
different nodes. To fully utilize the data, this article fuses Lagrange
multipliers calculated in different nodes by employing a penalization
technique. The proposed distributed empirical log-likelihood ratio statistic
with Lagrange multipliers solved by the penalized function is asymptotically
standard chi-squared under regular conditions even for a divergent machine
number. Nevertheless, the optimization problem with the fused penalty is still
hard to solve in the decentralized distributed network. To address the problem,
two alternating direction method of multipliers (ADMM) based algorithms are
proposed, which both have simple node-based implementation schemes.
Theoretically, this article establishes convergence properties for proposed
algorithms, and further proves the linear convergence of the second algorithm
in some specific network structures. The proposed methods are evaluated by
numerical simulations and illustrated with analyses of census income and Ford
gobike datasets
Estimation and confidence bands of a conditional survival function with censoring indicators missing at random
AbstractThe nonparametric estimator of the conditional survival function proposed by Beran is a useful tool to evaluate the effects of covariates in the presence of random right censoring. However, censoring indicators of right censored data may be missing for different reasons in many applications. We propose some estimators of the conditional cumulative hazard and survival functions which allow to handle this situation. We also construct the likelihood ratio confidence bands for them and obtain their asymptotic properties. Simulation studies are used to evaluate the performances of the estimators and their confidence bands
Empirical likelihood for single-index varying-coefficient models
In this paper, we develop statistical inference techniques for the unknown
coefficient functions and single-index parameters in single-index
varying-coefficient models. We first estimate the nonparametric component via
the local linear fitting, then construct an estimated empirical likelihood
ratio function and hence obtain a maximum empirical likelihood estimator for
the parametric component. Our estimator for parametric component is
asymptotically efficient, and the estimator of nonparametric component has an
optimal convergence rate. Our results provide ways to construct the confidence
region for the involved unknown parameter. We also develop an adjusted
empirical likelihood ratio for constructing the confidence regions of
parameters of interest. A simulation study is conducted to evaluate the finite
sample behaviors of the proposed methods.Comment: Published in at http://dx.doi.org/10.3150/11-BEJ365 the Bernoulli
(http://isi.cbs.nl/bernoulli/) by the International Statistical
Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm
Empirical likelihood-based dimension reduction inference for linear error-in-responses models with validation study
In this paper, linear errors-in-response models are considered in the presence of validation data on the responses. A semiparametric dimension reduction technique is employed to define an estimator of Ø with asymptotic normality, the estimated empirical loglikelihoods and the adjusted empirical loglikelihoods for the vector of regression coefficients and linear combinations of the regression coefficients, respectively. The estimated empirical log-likelihoods are shown to be asymptotically distributed as weighted sums of independent Χ21 and the adjusted empirical loglikelihoods are proved to be asymptotically distributed as standard chi-squares, respectively. A simulation study is conducted to compare the proposed methods in terms of coverage accuracies and average lengths of the confidence intervals
Fitting the Smile Revisited: A Least Squares Kernel Estimator for the Implied Volatility Surface
Nonparametric methods for estimating the implied volatility surface or the implied volatility smile are very popular, since they do not impose a specific functional form on the estimate. Traditionally, these methods are two-step estimators. The first step requires to extract implied volatility data from observed option prices, in the second step the actual fitting algorithm is applied. These two-step estimators may be seriously biased when option prices are observed with measurement errors. Moreover, after the nonlinear transformation of the option prices the error distribution will be complicated and less tractable. In this study, we propose a one-step estimator for the implied volatility surface based on a least squares kernel smoother of the Black-Scholes formula. Consistency and the asymptotic distribution of the estimate are provided. We demonstrate the estimator using German DAX index option data to recover the smile and the implied volatility surface
- …