7 research outputs found

    Bayesian Analysis of Doubly Inflated Poisson Regression for Correlated Count Data: Application to DMFT Data

    Get PDF
    Outcome variables in clinical studies sometimes include count data with inflation in two points (usually zero and k (k>0)). Doubly inflated models can be adopted for modeling these types of data. In statistical modeling, the association among subjects due to longitudinal or cluster study designs is considered by random effects models. In this article, we proposed a doubly inflated random effects model using the Bayesian approach for correlated count data with inflation in two values, and compared this model with Bayesian zero-inflated Poisson and Bayesian Poisson models. The parameters’ estimates by these models were obtained by Markov Chain Monte Carlo method using OpenBUGS software. Bayesian models were compared using the deviance information criterion. To this end, we utilized the total number of decayed, missed, and filled teeth of 12-year-old children and also conducted a simulation study.  Results of real data and the simulation study revealed that the proposed model is fitted better than previous models.&nbsp

    Bayesian Variable Selection for Multivariate Spatially Varying Coefficient Regression

    Get PDF
    Physical activity has many well-documented health benefits for cardiovascular fitness and weight control. For pregnant women, the American College of Obstetricians and Gynecologists currently recommends 30 minutes of moderate exercise on most, if not all, days; however, very few pregnant women achieve this level of activity. Traditionally, studies have focused on examining individual or interpersonal factors to identify predictors of physical activity. There is a renewed interest in whether characteristics of the physical environment in which we live and work may also influence physical activity levels. We consider one of the first studies of pregnant women that examines the impact of characteristics of the built environment on physical activity levels. Using a socioecologic framework, we study the associations between physical activity and several factors including personal characteristics, meteorological/air quality variables, and neighborhood characteristics for pregnant women in four counties of North Carolina. We simultaneously analyze six types of physical activity and investigate cross-dependencies between these activity types. Exploratory analysis suggests that the associations are different in different regions. Therefore we use a multivariate regression model with spatially-varying regression coefficients. This model includes a regression parameter for each covariate at each spatial location. For our data with many predictors, some form of dimension reduction is clearly needed. We introduce a Bayesian variable selection procedure to identify subsets of important variables. Our stochastic search algorithm determines the probabilities that each covariate’s effect is null, non-null but constant across space, and spatially-varying. We found that individual level covariates had a greater influence on women’s activity levels than neighborhood environmental characteristics, and some individual level covariates had spatially-varying associations with the activity levels of pregnant women

    Marginalized two-part models for semicontinuous data with application to medical costs

    Get PDF
    In health services research, it is common to encounter semicontinuous data characterized by a point mass at zero followed by a right-skewed continuous distribution with positive support. Examples include health expenditures, in which the zeros represent a subpopulation of patients who do not use health services, while the continuous distribution describes the level of expenditures among health services users. Semicontinuous data are typically analyzed using two-part mixture models that separately model the probability of health services use and the distribution of positive expenditures among users. However, because the second part conditions on a nonzero response, conventional two-part models do not provide a marginal interpretation of covariate effects on the overall population of health service users and non-users, even though this is often of greatest interest to investigators. Here, we propose a marginalized two-part model that yields more interpretable effect estimates in two-part models by parameterizing the model in terms of the marginal mean. This model maintains many of the important features of conventional two-part models, such as capturing zero-inflation and skewness, but allows investigators to examine covariate effects on the overall marginal mean, a target of primary interest in many applications. Using a simulation study, we examine properties of the maximum likelihood estimators from this model. We illustrate the approach by evaluating the effect of a behavioral weight loss intervention on health care expenditures in the Veterans Affairs (VA) health care system. We then extend this marginalized two-part model to clustered or longitudinal data structures by incorporating random effects. This longitudinal marginalized two-part model is fit following a fully Bayesian approach with non-informative or weakly informative prior distributions, and we illustrate it by analyzing the effect of a copayment increase in the VA health system. Finally, using simulation studies, we compare the performance of the marginalized two-part model to commonly used one-part generalized linear models (GLMs) fit via quasi-likelihood estimation over a range of simulated data scenarios with varying percentages of zero-valued observations.Doctor of Public Healt

    Statistical Modelling of Breastfeeding Data

    Get PDF
    This thesis addresses some key methodological problems in statistical modelling of breastfeeding data. Meta-analysis techniques were used to analyse aggregated breastfeeding data. Generalised linear mixed model and an extended Cox model were used with time-varying exposures to analyse longitudinal and time-to-event breastfeeding data, respectively. Shared frailty models were applied to correlated breastfeeding duration data controlling for heterogeneity. A novel two-part mixed-effects model was proposed for modelling clustered time-to-event breastfeeding data with clumping at zero

    Augmented Mixed Beta Regression Models For Periodontal Proportion Data

    No full text
    Continuous (clustered) proportion data often arise in various domains of medicine and public health where the response variable of interest is a proportion (or percentage) quantifying disease status for the cluster units, ranging between zero and one. However, because of the presence of relatively disease-free as well as heavily diseased subjects in any study, the proportion values can lie in the interval [0,1]. While beta regression can be adapted to assess covariate effects in these situations, its versatility is often challenged because of the presence/excess of zeros and ones because the beta support lies in the interval (0,1). To circumvent this, we augment the probabilities of zero and one with the beta density, controlling for the clustering effect. Our approach is Bayesian with the ability to borrow information across various stages of the complex model hierarchy and produces a computationally convenient framework amenable to available freeware. The marginal likelihood is tractable and can be used to develop Bayesian case-deletion influence diagnostics based on q-divergence measures. Both simulation studies and application to a real dataset from a clinical periodontology study quantify the gain in model fit and parameter estimation over other ad hoc alternatives and provide quantitative insight into assessing the true covariate effects on the proportion responses. © 2014 John Wiley & Sons, Ltd.332137593771UL1TR000114; NIH; National Institutes of HealthKieschnick, R., McCullough, B.D., Regression analysis of variates observed on (0,1): percentages, proportions and fractions (2003) Statistical Modelling, 3 (3), pp. 193-213Branscum, A.J., Johnson, W.O., Thurmond, M.C., Bayesian beta regression: applications to household expenditure data and genetic distance between foot-and-mouth disease viruses (2007) Australian & New Zealand Journal of Statistics, 49 (3), pp. 287-301Song, P.X.K., Tan, M., Marginal models for longitudinal continuous proportional data (2000) Biometrics, 56 (2), pp. 496-502Qiu, Z., Song, P.X.K., Tan, M., Simplex mixed-effects models for longitudinal proportional data (2008) Scandinavian Journal of Statistics, 35 (4), pp. 577-596Aitchison, J., The Statistical Analysis of Compositional Data (1982) Journal of the Royal Statistical Society. Series B (Methodological), 44 (2), pp. 139-177Ferrari, S., Cribari-Neto, F., Beta regression for modelling rates and proportions (2004) Journal of Applied Statistics, 31 (7), pp. 799-815Johnson, N., Kotz, S., Balakrishnan, N., (1994) Continuous Univariate Distributions, 2. , John Wiley & Sons: New YorkFernandes, J., Salinas, C., London, S., Wiegand, R., Hill, E., Slate, E., Grewal, J., Lopes-Virella, M., Prevalence of periodontal disease in Gullah African American diabetics (2006) Journal of Dental Research, 85, p. 997Smithson, M., Verkuilen, J., A better lemon squeezer? Maximum-likelihood regression with beta-distributed dependent variables (2006) Psychological Methods, 11 (1), p. 54Lachos, V.H., Bandyopadhyay, D., Dey, D.K., Linear and nonlinear mixed-effects models for censored HIV viral loads using normal/independent distributions (2011) Biometrics, 67 (4), pp. 1594-1604Verkuilen, J., Smithson, M., Mixed and mixture regression models for continuous bounded responses using the beta distribution (2012) Journal of Educational and Behavioral Statistics, 37 (1), pp. 82-113Figueroa-Zuiga, J., Arellano-Valle, R.B., Ferrari, S.L., Mixed beta regression: a bayesian perspective (2013) Computational Statistics & Data Analysis, 61, pp. 137-147Spiegelhalter, D.J., Best, N.G., Carlin, B.P., van der Linde, A., Bayesian measures of model complexity and fit (2002) Journal of the Royal Statistical Society-Series B, 64 (4), pp. 583-639Hatfield, L.A., Boye, M.E., Hackshaw, M.D., Carlin, B.P., Multilevel Bayesian models for survival times and longitudinal patient-reported outcomes with many zeros (2012) Journal of the American Statistical Association, 107, pp. 875-885Cook, R.D., Assessment of local influence (1986) Journal of the Royal Statistical Society, Series B, 48, pp. 133-169Thomas, A., OHara, B., Ligges, U., Sturtz, S., Making BUGS open (2006) R News, 6 (1), pp. 12-17Simas, A., Barreto-Souza, W., Rocha, A., Improved estimators for a general class of beta regression models (2010) Computational Statistics and Data Analysis, 54 (2), pp. 348-366Zeileis, A., Cribari-Neto, F., Grn, B., Beta regression in R (2010) Journal of Statistical Software, 34 (2), pp. 1-24Ospina, R., Ferrari, S., Inflated beta distributions (2010) Statistical Papers, 51 (1), pp. 111-126Dunson, D., Commentary: practical advantages of Bayesian analysis of epidemiologic data (2001) American Journal of Epidemiology, 153 (12), p. 1222Gelman, A., Prior distributions for variance parameters in hierarchical models (2006) Bayesian Analysis, 1 (3), pp. 515-534Thomas, A., O'Hara, B., Ligges, U., Sturtz, S., Making BUGS Open (2006) R News, 6 (1), pp. 12-17Cowles, M.K., Carlin, B.P., Markov chain Monte Carlo convergence diagnostics: a comparative review (1996) Journal of the American Statistical Association, 91 (434), pp. 883-904Carlin, B., Louis, T., (2008) Bayesian Methods for Data Analysis (Texts in Statistical Science), , Chapman and Hall/CRC: New YorkRaftery, A., Newton, M., Satagopan, J., Krivitsky, P., Estimating the integrated likelihood via posterior simulation using the harmonic mean identity (with discussion) (2007) Bayesian Statistics 8, 8, pp. 1-45. , In, Vol., Berger J, Dawid A, Heckerman D, Smith A, West M (eds). Oxford University Press: London, UKCeleux, G., Forbes, F., Robert, C.P., Titterington, D.M., Deviance information criteria for missing data models (2006) Bayesian Analysis, 1 (4), pp. 651-673Gelman, A., Carlin, J.B., Stern, H.S., Rubin, D., (2004) Bayesian Data Analysis, , Chapman & Hall/CRC: Boca Raton, FLCook, R.D., Weisberg, S., (1982) Residuals and Influence in Regression, , Chapman & Hall/CRC: Boca Raton, FLCsisz, I., Information-type measures of difference of probability distributions and indirect observations (1967) Studia Scientiarum Mathematicarum Hungarica, 2, pp. 299-318Weiss, R., An approach to Bayesian sensitivity analysis (1996) Journal of the Royal Statistical Society. Series B (Methodological), 58 (4), pp. 739-750Lachos, V.H., Castro, L.M., Dey, D.K., Bayesian inference in nonlinear mixed-effects models using normal independent distributions (2013) Computational Statistics & Data Analysis, 64, pp. 237-252Peng, F., Dey, D.K., Bayesian analysis of outlier problems using divergence measures (1995) The Canadian Journal of Statistics, 23, pp. 199-213Johnson-Spruill, I., Hammond, P., Davis, B., McGee, Z., Louden, D., Health of Gullah families in South Carolina with type 2 diabetes: diabetes self-management analysis from project sugar (2009) The Diabetes Educator, 35 (1), pp. 117-123Bandyopadhyay, D., Reich, B.J., Slate, E.H., Bayesian modeling of multivariate spatial binary data with applications to dental caries (2009) Statistics in Medicine, 28 (28), pp. 3492-3508Lachenbruch, P.A., Analysis of data with excess zeros (2002) Statistical Methods in Medical Research, 11 (4), pp. 297-302Ghosh, P., Albert, P.S., A Bayesian analysis for longitudinal semicontinuous data with an application to an acupuncture clinical trial (2009) Computational Statistics & Data Analysis, 53 (3), pp. 699-706Zhang, P., Qiu, Z., Fu, Y., Song, P.X.K., Robust transformation mixed-effects models for longitudinal continuous proportional data (2009) Canadian Journal of Statistics, 37 (2), pp. 266-28
    corecore