129 research outputs found

    Bayesian nonparametric models for spatially indexed data of mixed type

    Get PDF
    We develop Bayesian nonparametric models for spatially indexed data of mixed type. Our work is motivated by challenges that occur in environmental epidemiology, where the usual presence of several confounding variables that exhibit complex interactions and high correlations makes it difficult to estimate and understand the effects of risk factors on health outcomes of interest. The modeling approach we adopt assumes that responses and confounding variables are manifestations of continuous latent variables, and uses multivariate Gaussians to jointly model these. Responses and confounding variables are not treated equally as relevant parameters of the distributions of the responses only are modeled in terms of explanatory variables or risk factors. Spatial dependence is introduced by allowing the weights of the nonparametric process priors to be location specific, obtained as probit transformations of Gaussian Markov random fields. Confounding variables and spatial configuration have a similar role in the model, in that they only influence, along with the responses, the allocation probabilities of the areas into the mixture components, thereby allowing for flexible adjustment of the effects of observed confounders, while allowing for the possibility of residual spatial structure, possibly occurring due to unmeasured or undiscovered spatially varying factors. Aspects of the model are illustrated in simulation studies and an application to a real data set

    Health Information and Health Outcomes: An Application of the Regression Discontinuity Design to the 1995 UK Contraceptive Pill Scare Case

    Get PDF
    This paper provides a general formulation of the regression discontinuity (RD) design and shows its general applicability to many epidemiological problems. It then applies the RD method to estimate the effects of the 1995 pill scare in the UK, using individual birth records and aggregate monthly statistics. The results show that, following the announce- ment of the health warning on the þird generation" pill, conception rates increased by about 7%, with a 9% increase in abortion rates and a 6-7% rise in birth rates. No e®ect was found on still births, very low birth weight, sex ratios, or average birth weight. There is evidence of a slight increase in the rates of low birth weight births and multiple births and of a considerable reduction in the rate of births with congenital anomalies. Hetero- geneity by mother's age and social class is very pronounced, with most of the e®ects being experienced by women aged less than 25 and of lower socioeconomic status

    Classification Loss Function for Parameter Ensembles in Bayesian Hierarchical Models

    Full text link
    Parameter ensembles or sets of point estimates constitute one of the cornerstones of modern statistical practice. This is especially the case in Bayesian hierarchical models, where different decision-theoretic frameworks can be deployed to summarize such parameter ensembles. The estimation of these parameter ensembles may thus substantially vary depending on which inferential goals are prioritised by the modeller. In this note, we consider the problem of classifying the elements of a parameter ensemble above or below a given threshold. Two threshold classification losses (TCLs) --weighted and unweighted-- are formulated. The weighted TCL can be used to emphasize the estimation of false positives over false negatives or the converse. We prove that the weighted and unweighted TCLs are optimized by the ensembles of unit-specific posterior quantiles and posterior medians, respectively. In addition, we relate these classification loss functions on parameter ensembles to the concepts of posterior sensitivity and specificity. Finally, we find some relationships between the unweighted TCL and the absolute value loss, which explain why both functions are minimized by posterior medians.Comment: Submitted to Probability and Statistics Letter

    Quantile regression with aggregated data

    Get PDF
    Administrative data can contain a wealth of information for empirical research. Just to cite two examples, administrative data on schools can be used to study pupils’ educational attainments while hospital data can be useful for health research. However, access to administrative information is often restricted to aggregated data and this can lead to biased results. The estimation bias caused by using aggregated rather than individual data is known as the ecological bias. In this paper we consider for the first time this issue in the context of quantile regressions. We show how data can be aggregated to obtain unbiased estimation of quantile regressions with categorical covariates and how the bias can be reduced when researchers are interested to estimate quantile regression where some of the covariates are continuous

    Diagram-based Analysis of Causal Systems (DACS): elucidating inter-relationships between determinants of acute lower respiratory infections among children in sub-Saharan Africa.

    Get PDF
    Effective interventions require evidence on how individual causal pathways jointly determine disease. Based on the concept of systems epidemiology, this paper develops Diagram-based Analysis of Causal Systems (DACS) as an approach to analyze complex systems, and applies it by examining the contributions of proximal and distal determinants of childhood acute lower respiratory infections (ALRI) in sub-Saharan Africa
    • …
    corecore