41 research outputs found
Semi-Parametric Empirical Best Prediction for small area estimation of unemployment indicators
The Italian National Institute for Statistics regularly provides estimates of
unemployment indicators using data from the Labor Force Survey. However, direct
estimates of unemployment incidence cannot be released for Local Labor Market
Areas. These are unplanned domains defined as clusters of municipalities; many
are out-of-sample areas and the majority is characterized by a small sample
size, which render direct estimates inadequate. The Empirical Best Predictor
represents an appropriate, model-based, alternative. However, for non-Gaussian
responses, its computation and the computation of the analytic approximation to
its Mean Squared Error require the solution of (possibly) multiple integrals
that, generally, have not a closed form. To solve the issue, Monte Carlo
methods and parametric bootstrap are common choices, even though the
computational burden is a non trivial task. In this paper, we propose a
Semi-Parametric Empirical Best Predictor for a (possibly) non-linear mixed
effect model by leaving the distribution of the area-specific random effects
unspecified and estimating it from the observed data. This approach is known to
lead to a discrete mixing distribution which helps avoid unverifiable
parametric assumptions and heavy integral approximations. We also derive a
second-order, bias-corrected, analytic approximation to the corresponding Mean
Squared Error. Finite sample properties of the proposed approach are tested via
a large scale simulation study. Furthermore, the proposal is applied to
unit-level data from the 2012 Italian Labor Force Survey to estimate
unemployment incidence for 611 Local Labor Market Areas using auxiliary
information from administrative registers and the 2011 Census
Semiparametric M-quantile Regression for count data
Lung cancer incidence over 2005–2010 for 326 Local Authority Districts in England is investigated by ecological regression. Motivated from mis-specification of a Negative Binomial additive model, a semiparametric Negative Binomial M-quantile regression model is introduced. The additive part relates to those univariate or bivariate smoothing components, which are included in the model to capture nonlinearities in the predictor or to account for spatial dependence. All such components are estimated using penalized splines. The results show the capability of the semiparametric Negative Binomial M-quantile regression model to handle data with a strong spatial structure
Bayesian Ideas in Survey Sampling: The Legacy of Basu
Survey sampling and, more generally, Official Statistics are experiencing an
important renovation time. On one hand, there is the need to exploit the
huge information potentiality that the digital revolution made available in
terms of data. On the other hand, this process occurred simultaneously with
a progressive deterioration of the quality of classical sample surveys, due
to a decreasing willingness to participate and an increasing rate of missing
responses. The switch from survey-based inference to a hybrid system involv-
ing register-based information has made more stringent the debate and the
possible resolution of the design-based versus model-based approaches con-
troversy. In this new framework, the use of statistical models seems unavoid-
able and it is today a relevant part of the official statistician toolkit. Models
are important in several different contexts, from Small area estimation to
non sampling error adjustment, but they are also crucial for correcting bias
due to over and undercoverage of administrative data, in order to prevent
potential selection bias, and to deal with different definitions and/or errors in
the measurement process of the administrative sources. The progressive shift
from a design-based to a model-based approach in terms of super-population
is a matter of fact in the practice of the National Statistical Institutes. How-
ever, the introduction of Bayesian ideas in official statistics still encounters
difficulties and resistance. In this work, we attempt a non-systematic review
of the Bayesian development in this area and try to highlight the extra ben-
efit that a Bayesian approach might provide. Our general conclusion is that,
while the general picture is today clear and most of the basic topics of survey
sampling can be easily rephrased and tackled from a Bayesian perspective,
much work is still necessary for the availability of a ready-to-use platform
of Bayesian survey sampling in the presence of complex sampling design,
non-ignorable missing data patterns, and large datasets
Multinomial logistic estimation in dual frame surveys
We consider estimation techniques from dual frame surveys in the case of estimation of proportions when the variable of interest has multinomial outcomes. We propose to describe the joint distribution of the class indicators by a multinomial logistic model. Logistic generalized regression estimators and model calibration estimators are introduced for class frequencies in a population. Theoretical asymptotic properties of the proposed estimators are shown and discussed. Monte Carlo experiments are also carried out to compare the efficiency of the proposed procedures for finite size samples and in the presence of different sets of auxiliary variables. The simulation studies indicate that the multinomial logistic formulation yields better results than the classical estimators that implicitly assume individual linear models for the variables. The proposed methods are also applied in an attitude survey
Estimation techniques for ordinal data in multiple frame surveys with complex sampling designs
Surveys usually include questions where individuals must select one in a series of possible options that can be sorted. On the other hand, multiple frame surveys are becoming a widely used method to decrease bias due to undercoverage of the target population. In this work, we propose statistical techniques for handling ordinal data coming from a multiple frame survey using complex sampling designs and auxiliary information. Our aim is to estimate proportions when the variable of interest has ordinal outcomes. Two estimators are constructed following model‐assisted generalised regression and model calibration techniques. Theoretical properties are investigated for these estimators. Simulation studies with different sampling procedures are considered to evaluate the performance of the proposed estimators in finite size samples. An application to a real survey on opinions towards immigration is also included.Ministerio de Economía y CompetitividadConsejería de Economía, Innovación, Ciencia y EmpleoPRIN-SURWE
Bedside sonography assessment of extravascular lung water increase after major pulmonary resection in non-small cell lung cancer patients
Background: Extra vascular lung water (EVLW) following pulmonary resection increases due to fluid infusion and rises in capillary surface and permeability of the alveolar capillary membranes. EVLW increase clinically correlates to pulmonary oedema and it may generate impairments of gas exchanges and acute lung injury. An early and reliable assessment of postoperative EVLW, especially following major pulmonary resection, is useful in terms of reducing the risk of postoperative complications. The currently used methods, though satisfying these criteria, tend to be invasive and cumbersome and these factors might limit its use. The presence and burden of EVLW has been reported to correlate with sonographic B-line artefacts (BLA) assessed by lung ultrasound (LUS). This observational study investigated if bedside LUS could detect EVLW increases after major pulmonary resection. Due to the clinical association between EVLW increase and impairment of gas exchange, secondary aims of the study included investigating for associations between any observed EVLW increases and both respiratory ratio (PaO2/FiO2) and fluid retention, measured by brain natriuretic peptide (BNP). Methods: Overall, 74 major pulmonary resection patients underwent bedside LUS before surgery and at postoperative days 1 and 4, in the inviolate hemithorax which were divided into four quadrants. BLA were counted with a four-level method. The respiratory ratio PaO2/FiO2 and fluid retention were both assessed. Results: BLA resulted being increased at postoperative day 1 (OR 9.25; 95% CI, 5.28-16.20; P<0.0001 vs. baseline), and decreased at day 4 (OR 0.50; 95% CI, 0.31-0.80; P=0.004 vs. day 1). Moreover, the BLA increase was associated with both increased BNP (OR 1.005; 95% CI, 1.003-1.008; P<0.0001) and body weight (OR 1.040; 95% CI, 1.008-1.073; P=0.015). Significant inverse correlations were observed between the BLA values and the PaO2/FiO2 respiratory ratios. Conclusions: Our results suggest that LUS, due to its non-invasiveness, affordability and capacity to detect increases in EVLW, might be useful in better managing postoperative patients
Population Empirical Likelihood Estimation in Dual Frame Surveys
Dual frame surveys are a device to reduce the costs derived from data
collection in surveys and improve coverage for the whole target population. Since
their introduction, in the 1960’s, dual frame surveys have gained much attention
and several estimators have been formulated based on a number of different approaches. In this work, we propose new dual frame estimators based on the population empirical likelihood method originally proposed by Chen and Kim (2014) and
using both the dual and the single frame approach. The extension of the proposed
methodology to more than two frame surveys is also sketched. The performance
of the proposed estimators in terms of relative bias and relative mean squared
error is tested through simulation experiments. These experiments indicate that
the proposed estimators yield better results than other likelihood-based estimators
proposed in the literature.Ministerio de Economía y Competitividad of Spai