667 research outputs found

    Semiparametric regression analysis with missing response at random

    Get PDF
    We develop inference tools in a semiparametric partially linear regression model with missing response data. A class of estimators is defined that includes as special cases: a semiparametric regression imputation estimator, a marginal average estimator and a (marginal) propensity score weighted estimator. We show that any of our class of estimators is asymptotically normal. The three special estimators have the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoskedastic Gaussian case. We show that the Jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, that severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-square variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chi-square, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators.

    Semiparametric Regression Analysis under Imputation for Missing Response Data

    Get PDF
    We develop inference tools in a semiparametric regression model with missing response data. A semiparametric regression imputation estimator, a marginal average estimator and a (marginal) propensity score weighted estimator are defined. All the estimators are proved to be asymptotically normal, with the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoskedastic Gaussian case. We show that the Jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, and that severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-square variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chi-square, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation-based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators. Furthermore, a real data analysis is given to illustrate our methods.Asymptotic normality, empirical likelihood, semiparametric imputation.

    Estimation of Partial Linear Error-in-Variables Models with Validation Data

    Get PDF
    AbstractConsider the partial linear models of the formY=Xτβ+g(T)+e, where thep-variate explanatoryXis erroneously measured, and bothTand the responseYare measured exactly. LetXbe the surrogate variable forXwith measurement error. Let the primary data set be that containing independent observations on (Y,X,T) and the validation data set be that containing independent observations on (X,X,T), where the exact observations onXmay be obtained by some expensive or difficult procedures for only a small subset of subjects enrolled in the study. In this paper, without specifying any structure equation and the distribution assumption ofXgivenX, a semiparametric method with the primary data is employed to obtain the estimators ofβandg(·) based on the least-squares criterion with the help of validation data. The proposed estimators are proved to be strongly consistent. The asymptotic representation and the asymptotic normality of the estimator ofβare derived, respectively. The rate of the weak consistency of the estimator ofg(·) is also obtained

    Determination and estimation of optimal quarantine duration for infectious diseases with application to data analysis of COVID-19

    Full text link
    Quarantine measure is a commonly used non-pharmaceutical intervention during the outbreak of infectious diseases. A key problem for implementing quarantine measure is to determine the duration of quarantine. In this paper, a policy with optimal quarantine duration is developed. The policy suggests different quarantine durations for every individual with different characteristic. The policy is optimal in the sense that it minimizes the average quarantine duration of uninfected people with the constraint that the probability of symptom presentation for infected people attains the given value closing to 1. The optimal solution for the quarantine duration is obtained and estimated by some statistic methods with application to analyzing COVID-19 data

    Empirical Likelihood Inference over Decentralized Networks

    Full text link
    As a nonparametric statistical inference approach, empirical likelihood has been found very useful in numerous occasions. However, it encounters serious computational challenges when applied directly to the modern massive dataset. This article studies empirical likelihood inference over decentralized distributed networks, where the data are locally collected and stored by different nodes. To fully utilize the data, this article fuses Lagrange multipliers calculated in different nodes by employing a penalization technique. The proposed distributed empirical log-likelihood ratio statistic with Lagrange multipliers solved by the penalized function is asymptotically standard chi-squared under regular conditions even for a divergent machine number. Nevertheless, the optimization problem with the fused penalty is still hard to solve in the decentralized distributed network. To address the problem, two alternating direction method of multipliers (ADMM) based algorithms are proposed, which both have simple node-based implementation schemes. Theoretically, this article establishes convergence properties for proposed algorithms, and further proves the linear convergence of the second algorithm in some specific network structures. The proposed methods are evaluated by numerical simulations and illustrated with analyses of census income and Ford gobike datasets

    Estimation and confidence bands of a conditional survival function with censoring indicators missing at random

    Get PDF
    AbstractThe nonparametric estimator of the conditional survival function proposed by Beran is a useful tool to evaluate the effects of covariates in the presence of random right censoring. However, censoring indicators of right censored data may be missing for different reasons in many applications. We propose some estimators of the conditional cumulative hazard and survival functions which allow to handle this situation. We also construct the likelihood ratio confidence bands for them and obtain their asymptotic properties. Simulation studies are used to evaluate the performances of the estimators and their confidence bands

    Empirical likelihood for single-index varying-coefficient models

    Full text link
    In this paper, we develop statistical inference techniques for the unknown coefficient functions and single-index parameters in single-index varying-coefficient models. We first estimate the nonparametric component via the local linear fitting, then construct an estimated empirical likelihood ratio function and hence obtain a maximum empirical likelihood estimator for the parametric component. Our estimator for parametric component is asymptotically efficient, and the estimator of nonparametric component has an optimal convergence rate. Our results provide ways to construct the confidence region for the involved unknown parameter. We also develop an adjusted empirical likelihood ratio for constructing the confidence regions of parameters of interest. A simulation study is conducted to evaluate the finite sample behaviors of the proposed methods.Comment: Published in at http://dx.doi.org/10.3150/11-BEJ365 the Bernoulli (http://isi.cbs.nl/bernoulli/) by the International Statistical Institute/Bernoulli Society (http://isi.cbs.nl/BS/bshome.htm

    Empirical likelihood-based dimension reduction inference for linear error-in-responses models with validation study

    Full text link
    In this paper, linear errors-in-response models are considered in the presence of validation data on the responses. A semiparametric dimension reduction technique is employed to define an estimator of Ø with asymptotic normality, the estimated empirical loglikelihoods and the adjusted empirical loglikelihoods for the vector of regression coefficients and linear combinations of the regression coefficients, respectively. The estimated empirical log-likelihoods are shown to be asymptotically distributed as weighted sums of independent Χ21 and the adjusted empirical loglikelihoods are proved to be asymptotically distributed as standard chi-squares, respectively. A simulation study is conducted to compare the proposed methods in terms of coverage accuracies and average lengths of the confidence intervals

    Fitting the Smile Revisited: A Least Squares Kernel Estimator for the Implied Volatility Surface

    Full text link
    Nonparametric methods for estimating the implied volatility surface or the implied volatility smile are very popular, since they do not impose a specific functional form on the estimate. Traditionally, these methods are two-step estimators. The first step requires to extract implied volatility data from observed option prices, in the second step the actual fitting algorithm is applied. These two-step estimators may be seriously biased when option prices are observed with measurement errors. Moreover, after the nonlinear transformation of the option prices the error distribution will be complicated and less tractable. In this study, we propose a one-step estimator for the implied volatility surface based on a least squares kernel smoother of the Black-Scholes formula. Consistency and the asymptotic distribution of the estimate are provided. We demonstrate the estimator using German DAX index option data to recover the smile and the implied volatility surface
    corecore