168 research outputs found

    Imputation of continuous variables missing at random using the method of simulated scores

    Get PDF
    For multivariate datasets with missing values, we present a procedure of statistical inference and state its "optimal" properties. Two main assumptions are needed: (1) data are missing at random (MAR); (2) the data generating process is a multivariate normal linear regression. Disentangling the problem of convergence of the iterative estimation/imputation procedure, we show that the estimator is a "method of simulated scores" (a particular case of McFadden's "method of simulated moments"); thus the estimator is equivalent to maximum likelihood if the number of replications is conveniently large, and the whole procedure can be considered an optimal parametric technique for imputation of missing data.Simulates scores; missing data; estimation/imputation; structural form; reduced form

    Poor identification and estimation problems in panel data models with random effects and autocorrelated errors

    Get PDF
    A dramatically large number of corner solutions occur when estimating by (Gaussian) maximum likelihood a simple model for panel data with random effects and autocorrelated errors. This can invalidate results of applications to panel data with a short time dimension, even in a correctly specified model. We explain this unpleasant effect (usually underestimated, almost ignored in the literature) showing that the expected log-likelihood is nearly flat, thus rising problems of poor identification.panel data, maximum likelihood, identification.

    Negative variance estimates in panel data models

    Get PDF
    Negative values for estimated variances can arise in a panel data context. Empirical and theoretical literature dismisses the problem as not serious and a practical solution is to replace negative variances by its boundary value, i.e. zero. While this is not a concern when the individual variance components is "small" with respect to idiosyncratic variance component (making it indistinguishable from zero in practice), we claim that a negative estimated variance can also arise with a "large" individual variance component, when the orthogonality condition between the individual effects and regressors fails. Estimation problems are considered in the (feasible) generalized least squares and maximum likelihood frameworks.Panel data, random effect estimation, negative variances, maximum likelihood

    Identification of linear panel data models when instruments are not available

    Get PDF
    One of the major virtues of panel data models is the possibility to control for unobserved and unobservable heterogeneity at the unit (individual, firm, sector...) level, even when this is correlated with the variables included on the right hand side of the equation. By assuming an additive error structure, identification of the model parameters spans from transformations of the data that wipe out the individual component. We propose an alternative identification strategy, where the equation of interest is embedded in a structural system that properly accounts for the endogeneity of the variables on the right hand side (without distinguishing correlation with the individual component or the idiosyncratic term). We show that, under certain conditions, the system is identified even in the case where no exogenous variable is available, due to the presence of cross-equation restrictions. Estimation of the model parameters can rely on an iterated Zellner-type estimator, with remarkable performance gains over traditional GMM approaches.panel data, identification, cross-equation restrictions

    Moment Conditions and Neglected Endogeneity in Panel Data Models

    Get PDF
    This paper develops a new moment condition for estimation of linear panel data models. When added to the set of instruments devised by Anderson, Hsiao (1981, 1982) for the dynamic model, the proposed approach can outperform the GMM methods customarily employed for estimation. The proposal builds on the properties of the iterated GLS, that, contrary to conventional wisdom, can lead to a consistent estimator in particular cases where endogeneity of the explanatory variables is neglected. The targets achieved are a reduction in the number of moment conditions and a better performance over the most widely adopted techniques.panel data, dynamic model, GMM estimation, endogeneity

    Autocorrelation and masked heterogeneity in panel data models estimated by maximum likelihood

    Get PDF
    In a panel data model with random effects, when autocorrelation in the error is considered, (Gaussian) maximum likelihood estimation produces a dramatically large number of corner solutions: the variance of the random effect appears (incorrectly) to be zero, and a larger autocorrelation is (incorrectly) assigned to the idiosyncratic component. Thus heterogeneity could (incorrectly) be lost in applications to panel data with customarily available time dimension, even in a correctly specified model. The problem occurs in linear as well as nonlinear models. This paper aims at pointing out how serious this problem can be (largely neglected by the panel data literature). A set of Monte Carlo experiments is conducted to highlight its relevance, and we explain this unpleasant effect showing that, along a direction, the expected log-likelihood is nearly flat. We also provide two examples of applications with corner solutions.Panel data, autocorrelation, random effects, maximum likelihood, expected log-likelihood

    A sĆ­ncope das postĆ“nicas mediais na variedade urbana do portuguĆŖs de MoƧambique: uma anĆ”lise sociolinguĆ­stica

    Get PDF
    IndisponĆ­ve

    Imputation of continuous variables missing at random using the method of simulated scores

    Get PDF
    For multivariate datasets with missing values, we present a procedure of statistical inference and state its "optimal" properties. Two main assumptions are needed: (1) data are missing at random (MAR); (2) the data generating process is a multivariate normal linear regression. Disentangling the problem of convergence of the iterative estimation/imputation procedure, we show that the estimator is a "method of simulated scores" (a particular case of McFadden's "method of simulated moments"); thus the estimator is equivalent to maximum likelihood if the number of replications is conveniently large, and the whole procedure can be considered an optimal parametric technique for imputation of missing data

    Simulation-based estimation of Tobit model with random effects

    Get PDF
    The estimation of limited dependent variable panel data models usually involves objective functions in which integrals appear without a closed form solution: this is the case of the panel data Tobit model with random effects. Recently, simulation methods have shown to be useful in the inference process, as they offer methods to approximate such integrals (Laroque, Salanie, 1989; GouriĀ“eroux, Monfort, 1991, 1993; Hajivassiliou, McFadden, 1998; Mealli, Rampichini, 1999; Inkmann, 2000). Although the asymptotic performances of such methods are known and their application has been successfully undertaken, more precise ideas on their finite sample performance and computational efficiency is still needed. In this paper we propose to use the method of indirect inference, using different auxiliary models, and the simulated maximum likelihood to estimate these models. We use a panel data Tobit model with a simple correlation structure in the unobservables (i.e. a one-factor structure), but the model could be easily extended. Using both simulated and real data, we show the perfomances of the proposed methods in finite samples. The application on real data is concerned with a model of female labour supply

    Simulation-based estimation of Tobit model with random effects

    Get PDF
    The estimation of limited dependent variable panel data models usually involves objective functions in which integrals appear without a closed form solution: this is the case of the panel data Tobit model with random effects. Recently, simulation methods have shown to be useful in the inference process, as they offer methods to approximate such integrals (Laroque, Salanie, 1989; GouriĀ“eroux, Monfort, 1991, 1993; Hajivassiliou, McFadden, 1998; Mealli, Rampichini, 1999; Inkmann, 2000). Although the asymptotic performances of such methods are known and their application has been successfully undertaken, more precise ideas on their finite sample performance and computational efficiency is still needed. In this paper we propose to use the method of indirect inference, using different auxiliary models, and the simulated maximum likelihood to estimate these models. We use a panel data Tobit model with a simple correlation structure in the unobservables (i.e. a one-factor structure), but the model could be easily extended. Using both simulated and real data, we show the perfomances of the proposed methods in finite samples. The application on real data is concerned with a model of female labour supply
    • ā€¦
    corecore