5,756 research outputs found

    Component-Based Evaluation using GLMM

    Get PDF

    Contribution of common and rare variants to bipolar disorder susceptibility in extended pedigrees from population isolates.

    Get PDF
    Current evidence from case/control studies indicates that genetic risk for psychiatric disorders derives primarily from numerous common variants, each with a small phenotypic impact. The literature describing apparent segregation of bipolar disorder (BP) in numerous multigenerational pedigrees suggests that, in such families, large-effect inherited variants might play a greater role. To identify roles of rare and common variants on BP, we conducted genetic analyses in 26 Colombia and Costa Rica pedigrees ascertained for bipolar disorder 1 (BP1), the most severe and heritable form of BP. In these pedigrees, we performed microarray SNP genotyping of 838 individuals and high-coverage whole-genome sequencing of 449 individuals. We compared polygenic risk scores (PRS), estimated using the latest BP1 genome-wide association study (GWAS) summary statistics, between BP1 individuals and related controls. We also evaluated whether BP1 individuals had a higher burden of rare deleterious single-nucleotide variants (SNVs) and rare copy number variants (CNVs) in a set of genes related to BP1. We found that compared with unaffected relatives, BP1 individuals had higher PRS estimated from BP1 GWAS statistics (P = 0.001 ~ 0.007) and displayed modest increase in burdens of rare deleterious SNVs (P = 0.047) and rare CNVs (P = 0.002 ~ 0.033) in genes related to BP1. We did not observe rare variants segregating in the pedigrees. These results suggest that small-to-moderate effect rare and common variants are more likely to contribute to BP1 risk in these extended pedigrees than a few large-effect rare variants

    A review of R-packages for random-intercept probit regression in small clusters

    Get PDF
    Generalized Linear Mixed Models (GLMMs) are widely used to model clustered categorical outcomes. To tackle the intractable integration over the random effects distributions, several approximation approaches have been developed for likelihood-based inference. As these seldom yield satisfactory results when analyzing binary outcomes from small clusters, estimation within the Structural Equation Modeling (SEM) framework is proposed as an alternative. We compare the performance of R-packages for random-intercept probit regression relying on: the Laplace approximation, adaptive Gaussian quadrature (AGQ), Penalized Quasi-Likelihood (PQL), an MCMC-implementation, and integrated nested Laplace approximation within the GLMM-framework, and a robust diagonally weighted least squares estimation within the SEM-framework. In terms of bias for the fixed and random effect estimators, SEM usually performs best for cluster size two, while AGQ prevails in terms of precision (mainly because of SEM's robust standard errors). As the cluster size increases, however, AGQ becomes the best choice for both bias and precision

    The Estimation of Item Response Models with the lmer Function from the lme4 Package in R

    Get PDF
    In this paper we elaborate on the potential of the lmer function from the lme4 package in R for item response (IRT) modeling. In line with the package, an IRT framework is described based on generalized linear mixed modeling. The aspects of the framework refer to (a) the kind of covariates -- their mode (person, item, person-by-item), and their being external vs. internal to responses, and (b) the kind of effects the covariates have -- fixed vs. random, and if random, the mode across which the effects are random (persons, items). Based on this framework, three broad categories of models are described: Item covariate models, person covariate models, and person-by-item covariate models, and within each category three types of more specific models are discussed. The models in question are explained and the associated lmer code is given. Examples of models are the linear logistic test model with an error term, differential item functioning models, and local item dependency models. Because the lme4 package is for univariate generalized linear mixed models, neither the two-parameter, and three-parameter models, nor the item response models for polytomous response data, can be estimated with the lmer function.

    An improved mosquito electrocuting trap that safely reproduces epidemiologically relevant metrics of mosquito human-feeding behaviours as determined by human landing catch

    Get PDF
    Background: Reliable quantification of mosquito host—seeking behaviours is required to determine the efficacy of vector control methods. For malaria, the gold standard approach remains the risky human landing catch (HLC). Here compare the performance of an improved prototype of the mosquito electrocuting grid trap (MET) as a safer alternative with HLC for measuring malaria vector behaviour in Dar es Salaam, Tanzania. Methods: Mosquito trapping was conducted at three sites within Dar es Salaam representing a range of urbanicity over a 7-month period (December 2012–July 2013, 168 sampling nights). At each site, sampling was conducted in a block of four houses, with two houses being allocated to HLC and the other to MET on each night of study. Sampling was conducted both indoors and outdoors (from 19:00 to 06:00 each night) at all houses, with trapping method (HLC and MET) being exchanged between pairs of houses at each site using a crossover design. Results: The MET caught significantly more Anopheles gambiae sensu lato than the HLC, both indoors (RR [95 % confidence interval (CI)]) = 1.47 [1.23–1.76], P < 0.0001 and outdoors = 1.38 [1.14–1.67], P < 0.0001). The sensitivity of MET compared with HLC did not detectably change over the course of night for either An. gambiae s.l. (OR [CI]) = 1.01 [0.94–1.02], P = 0.27) or Culex spp. (OR [CI]) = 0.99 [0.99–1.0], P = 0.17) indoors and declined only slightly outdoors: An. gambiae s.l. (OR [CI]) = 0.92 [0.86–0.99], P = 0.04), and Culex spp. (OR [CI]) = 0.99 [0.98–0.99], P = 0.03). MET-based estimates of the proportions of mosquitoes caught indoors (P i ) or during sleeping hours (P fl ), as well as the proportion of human exposure to bites that would otherwise occurs indoors (π i ), were statistically indistinguishable from those based on HLC for An. gambiae s.l. (P = 0.43, 0.07 and 0.48, respectively) and Culex spp. (P = 0.76, 0.24 and 0.55, respectively). Conclusions: This improved MET prototype is highly sensitive tool that accurately quantifies epidemiologically-relevant metrics of mosquito biting densities, behaviours and human exposure distribution

    Monte Carlo likelihood inference for missing data models

    Full text link
    We describe a Monte Carlo method to approximate the maximum likelihood estimate (MLE), when there are missing data and the observed data likelihood is not available in closed form. This method uses simulated missing data that are independent and identically distributed and independent of the observed data. Our Monte Carlo approximation to the MLE is a consistent and asymptotically normal estimate of the minimizer θ∗\theta^* of the Kullback--Leibler information, as both Monte Carlo and observed data sample sizes go to infinity simultaneously. Plug-in estimates of the asymptotic variance are provided for constructing confidence regions for θ∗\theta^*. We give Logit--Normal generalized linear mixed model examples, calculated using an R package.Comment: Published at http://dx.doi.org/10.1214/009053606000001389 in the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Hierarchical models for semi-competing risks data with application to quality of end-of-life care for pancreatic cancer

    Full text link
    Readmission following discharge from an initial hospitalization is a key marker of quality of health care in the United States. For the most part, readmission has been used to study quality of care for patients with acute health conditions, such as pneumonia and heart failure, with analyses typically based on a logistic-Normal generalized linear mixed model. Applying this model to the study readmission among patients with increasingly prevalent advanced health conditions such as pancreatic cancer is problematic, however, because it ignores death as a competing risk. A more appropriate analysis is to imbed such studies within the semi-competing risks framework. To our knowledge, however, no comprehensive statistical methods have been developed for cluster-correlated semi-competing risks data. In this paper we propose a novel hierarchical modeling framework for the analysis of cluster-correlated semi-competing risks data. The framework permits parametric or non-parametric specifications for a range of model components, including baseline hazard functions and distributions for key random effects, giving analysts substantial flexibility as they consider their own analyses. Estimation and inference is performed within the Bayesian paradigm since it facilitates the straightforward characterization of (posterior) uncertainty for all model parameters including hospital-specific random effects. The proposed framework is used to study the risk of readmission among 5,298 Medicare beneficiaries diagnosed with pancreatic cancer at 112 hospitals in the six New England states between 2000-2009, specifically to investigate the role of patient-level risk factors and to characterize variation in risk across hospitals that is not explained by differences in patient case-mix
    • …
    corecore