65 research outputs found

    Separate regression modelling of the Gaussian and Exponential components of an EMG response from respiratory physiology.

    Get PDF
    If Y1 \sim N(\mu ;\sigma^2) and Y2 \sim Exp(\nu), with Y1 independent of Y2, then their sum Y = Y1 +Y2 follows an Exponentially Modified Gaussian (EMG) distribution. In many applications it is of interest to model the two components separately, in order to investigate their (possibly) different important predictors. We show how this can be done through a GAMLSS with EMG response, and apply this separate regression modelling strategy to a dataset on lung function variables from the SAPALDIA cohort study

    Networks as mediating variables: a Bayesian latent space approach

    Get PDF
    The use of network analysis to investigate social structures has recently seen a rise due to the high availability of data and the numerous insights it can provide into different fields. Most analyses focus on the topological characteristics of networks and the estimation of relationships between the nodes. We adopt a different perspective by considering the whole network as a random variable conveying the effect of an exposure on a response. This point of view represents a classical mediation setting, where the interest lies in estimating the indirect effect, that is, the effect propagated through the mediating variable. We introduce a latent space model mapping the network into a space of smaller dimension by considering the hidden positions of the units in the network. The coordinates of each node are used as mediators in the relationship between the exposure and the response. We further extend mediation analysis in the latent space framework by using Generalised Linear Models instead of linear ones, as previously done in the literature, adopting an approach based on derivatives to obtain the effects of interest. A Bayesian approach allows us to get the entire distribution of the indirect effect, generally unknown, and compute the corresponding highest density interval, which gives accurate and interpretable bounds for the mediated effect. Finally, an application to social interactions among a group of adolescents and their attitude toward substance use is presented

    Using Zero-inflated Models to Analyze Environmental Data Sets with Many Zeroes

    Get PDF
    L’analisi di dati di conteggio pu`o essere talvolta complessa a causa di un numero di zeri superiore a quello atteso sotto il modello Poissoniano, che rappresenta l’assunzione standard per la modellazione di questo tipo di dati. Obbiettivo primario della comunicazione `e quello di impiegare modelli alternativi a quello di Poisson, che contemplino la possibilit`a di trattare esplicitamente questo eccesso di zeri, per valutare eventuali differenze in termini di bont`a di adattamento e di stima dei parametri regressivi.Vengono discussi modelli Zero Inflated Posson (ZIP), Zero Inflated Negative Binomial (ZINB) e Hurdle Poisson (HP) e applicati a due insiemi di dati ambientali reali con un elevato numero di zeri

    An ensemble approach to short‐term forecast of COVID‐19 intensive care occupancy in Italian regions

    Get PDF
    The availability of intensive care beds during the COVID‐19 epidemic is crucial to guarantee the best possible treatment to severely affected patients. In this work we show a simple strategy for short‐term prediction of COVID‐19 intensive care unit (ICU) beds, that has proved very effective during the Italian outbreak in February to May 2020. Our approach is based on an optimal ensemble of two simple methods: a generalized linear mixed regression model, which pools information over different areas, and an area‐specific nonstationary integer autoregressive methodology. Optimal weights are estimated using a leave‐last‐out rationale. The approach has been set up and validated during the first epidemic wave in Italy. A report of its performance for predicting ICU occupancy at regional level is included

    Nowcasting COVID-19 incidence indicators during the Italian first outbreak

    Get PDF
    A novel parametric regression model is proposed to fit incidence data typically collected during epidemics. The proposal is motivated by real-time monitoring and short-term forecasting of the main epidemiological indicators within the first outbreak of COVID-19 in Italy. Accurate short-term predictions, including the potential effect of exogenous or external variables are provided. This ensures to accurately predict important characteristics of the epidemic (e.g., peak time and height), allowing for a better allocation of health resources over time. Parameter estimation is carried out in a maximum likelihood framework. All computational details required to reproduce the approach and replicate the results are provided.publishedVersio

    Covid‐19 in Italy: Modelling, communications, and collaborations

    Get PDF
    When Covid-19 arrived in Italy in early 2020, a group of statisticians came together to provide tools to make sense of the unfolding epidemic and to counter misleading media narratives. Here, members of StatGroup-19 reflect on their work to dat

    Heterogeneity of obesity-asthma association disentangled by latent class analysis, the SAPALDIA cohort

    Get PDF
    Abstract Although evidence for the heterogeneity of asthma accumulated, consensus for definitions of asthma phenotypes is still lacking. Obesity may have heterogeneous effects on various asthma phenotypes. We aimed to distinguish asthma phenotypes by latent class analysis and to investigate their associations with different obesity parameters in adults using a population-based Swiss cohort (SAPALDIA). We applied latent class analysis to 959 self-reported asthmatics using information on disease activity, atopy, and age of onset. Associations with obesity were examined by multinomial logistic regression, after adjustments for age, sex, smoking status, educational level, and study centre. Body mass index, percent body fat, waist hip ratio, waist height ratio, and waist circumference were used as obesity measure. Four asthma classes were identified, including persistent multiple symptom-presenting asthma (n = 122), symptom-presenting asthma (n = 290), symptom-free atopic asthma (n = 294), and symptom-free non-atopic asthma (n = 253). Obesity was positively associated with symptom-presenting asthma classes but not with symptom-free ones. Percent body fat showed the strongest association with the persistent multiple symptom-presenting asthma. We observed heterogeneity of associations with obesity across asthma classes, indicating different asthma aetiologies

    Generalized symmetry models for hypercubic concordance tables.

    No full text
    Frequency data obtained classifying a sample of 'units' by the same categorical variable repeatedly over 'components', can be arranged in a hypercubic concordance table (h.c.t.). This kind of data naturally arises in a number of different areas such as longitudinal studies, studies using matched and clustered data, item-response analysis, agreement analysis. In spite of the substantial diversity of the mechanisms that can generate them, data arranged in a h.c.t. can ail be analyzed via models of symmetry and quasi-symmetry, which exploit the special structure of the h.c.t. The paper extends the definition of such models to any dimension, introducing the class of generalized symmetry models, which provides a unified framework for inference on categorical data that can be represented in a h.c.t.. Within this framework it is possible to derive the common structure which underlies these models and clarify their meaning;their usefulness in applied work is illustrated by a re-analysis of two real example

    A penalized approach for the bivariate ordered logistic model with applications to social and medical data

    Get PDF
    Bivariate ordered logistic models (BOLMs) are appealing to jointly model the marginal distribution of two ordered responses and their association, given a set of covariates. When the number of categories of the responses increases, the number of global odds ratios to be estimated also increases, and estimation gets problematic. In this work we propose a non-parametric approach for the maximum likelihood (ML) estimation of a BOLM, wherein penalties to the differences between adjacent row and column effects are applied. Our proposal is then compared to the Goodman and Dale models. Some simulation results as well as analyses of two real data sets are presented and discussed

    A matrix-valued Bernoulli distribution

    Get PDF
    Matrix-valued distributions are used in continuous multivariate analysis to model sample data matrices of continuous measurements; their use seems to be neglected for binary, or more generally categorical, data. In this paper we propose a matrix-valued Bernoulli distribution, based on the log-linear representation introduced by Cox (1972) for the Multivariate Bernoulli distribution with correlated components
    corecore