4 research outputs found
Accounting for self-protective responses in randomized response data from a social security survey using the zero-inflated Poisson model
In 2004 the Dutch Department of Social Affairs conducted a survey to assess
the extent of noncompliance with social security regulations. The survey was
conducted among 870 recipients of social security benefits and included a
series of sensitive questions about regulatory noncompliance. Due to the
sensitive nature of the questions the randomized response design was used.
Although randomized response protects the privacy of the respondent, it is
unlikely that all respondents followed the design. In this paper we introduce a
model that allows for respondents displaying self-protective response behavior
by consistently giving the nonincriminating response, irrespective of the
outcome of the randomizing device. The dependent variable denoting the total
number of incriminating responses is assumed to be generated by the application
of randomized response to a latent Poisson variable denoting the true number of
rule violations. Since self-protective responses result in an excess of
observed zeros in relation to the Poisson randomized response distribution,
these are modeled as observed zero-inflation. The model includes predictors of
the Poisson parameters, as well as predictors of the probability of
self-protective response behavior.Comment: Published in at http://dx.doi.org/10.1214/07-AOAS135 the Annals of
Applied Statistics (http://www.imstat.org/aoas/) by the Institute of
Mathematical Statistics (http://www.imstat.org
Refinement of the extended crosswise model with a number sequence randomizer : evidence from three different studies in the UK
The Extended Crosswise Model (ECWM) is a randomized response model with neutral response categories, relatively simple instructions, and the availability of a goodness-of-fit test. This paper refines this model with a number sequence randomizer that virtually precludes the possibility to give evasive responses. The motivation for developing this model stems from a strategic priority of WADA (World Anti-Doping Agency) to monitor the prevalence of doping use by elite athletes. For this model we derived a maximum likelihood estimator that allows for binary logistic regression analysis. Three studies were conducted on online platforms with a total of over 6, 000 respondents; two on controlled substance use and one on compliance with COVID-19 regulations in the UK during the first lockdown. The results of these studies are promising. The goodness-of-fit tests showed little to no evidence for response biases, and the ECWM yielded higher prevalence estimates than direct questions for sensitive questions, and similar ones for non-sensitive questions. Furthermore, the randomizer with the shortest number sequences yielded the smallest response error rates on a control question with known prevalence
The analysis of randomized response “ever” and “last year” questions: a non-saturated multinomial model
Randomized response (RR) is a well-known interview technique designed to eliminate evasive response bias that arises from asking sensitive questions. The most frequently asked questions in RR are either whether respondents were “ever” carriers of the sensitive characteristic, or whether they were carriers in a recent period, for instance, “last year”. The present paper proposes a design in which both questions are asked, and derives a multinomial model for the joint analysis of these two questions. Compared to the separate analyses with the binomial model, the model makes a useful distinction between last year and former carriers of the sensitive characteristic, it is more efficient in estimating the prevalence of last year carriers, and it has a degree of freedom that allows for a goodness-of-fit test. Furthermore, it is easily extended to a multinomial logistic regression model to investigate the effects of covariates on the prevalence estimates. These benefits are illustrated in two studies on the use of anabolic androgenic steroids in the Netherlands, one using Kuk and one using both the Kuk and forced response. A salient result of our analyses is that the multinomial model provided ample evidence of response biases in the forced response condition
The multidimensional randomized response design: Estimating different aspects of the same sensitive behavior
The conventional randomized response design is unidimensional in the sense that it measures a single dimension of a sensitive attribute, like its prevalence, frequency, magnitude, or duration. This paper introduces a multidimensional design characterized by categorical questions that each measure a different aspect of the same sensitive attribute. The benefits of the multidimensional design are (i) a substantial gain in power and efficiency, and the potential to (ii) evaluate the goodness-of-fit of the model, and (iii) test hypotheses about evasive response biases in case of a misfit. The method is illustrated for a two-dimensional design measuring both the prevalence and the magnitude of social security fraud