246 research outputs found

    Pooling multiple imputations when the sample happens to be the population

    Get PDF
    Current pooling rules for multiply imputed data assume infinite populations. In some situations this assumption is not feasible as every unit in the population has been observed, potentially leading to over-covered population estimates. We simplify the existing pooling rules for situations where the sampling variance is not of interest. We compare these rules to the conventional pooling rules and demonstrate their use in a situation where there is no sampling variance. Using the standard pooling rules in situations where sampling variance should not be considered, leads to overestimation of the variance of the estimates of interest, especially when the amount of missingness is not very large. As a result, populations estimates are over-covered, which may lead to a loss of statistical power. We conclude that the theory of multiple imputation can be extended to the situation where the sample happens to be the population. The simplified pooling rules can be easily implemented to obtain valid inference in cases where we have observed essentially all units and in simulation studies addressing the missingness mechanism only.Comment: 6 pages, 1 figure, 1 tabl

    Broken Stick Model for Irregular Longitudinal Data

    Get PDF
    Many longitudinal studies collect data that have irregular observation times, often requiring the application of linear mixed models with time-varying outcomes. This paper presents an alternative that splits the quantitative analysis into two steps. The first step converts irregularly observed data into a set of repeated measures through the broken stick model. The second step estimates the parameters of scientific interest from the repeated measurements at the subject level. The broken stick model approximates each subject's trajectory by a series of connected straight lines. The breakpoints, specified by the user, divide the time axis into consecutive intervals common to all subjects. Specification of the model requires just three variables: time, measurement and subject. The model is a special case of the linear mixed model, with time as a linear B-spline and subject as the grouping factor. The main assumptions are: Subjects are exchangeable, trajectories between consecutive breakpoints are straight, random effects follow a multivariate normal distribution, and unobserved data are missing at random. The R package brokenstick v2.5.0 offers tools to calculate, predict, impute and visualize broken stick estimates. The package supports two optimization methods, including options to constrain the variance-covariance matrix of the random effects. We demonstrate six applications of the model: Detection of critical periods, estimation of the time-to-time correlations, profile analysis, curve interpolation, multiple imputation and personalized prediction of future outcomes by curve matching

    Π’ΠΊΠ»Π°Π΄ ΠΈΠ½Ρ‚Π΅Π»Π»ΠΈΠ³Π΅Π½Ρ†ΠΈΠΈ Π² исслСдованиС ΡΠΎΡ†ΠΈΠ°Π»ΡŒΠ½ΠΎΠΉ памяти

    Get PDF
    An important goal of growth monitoring is to identify genetic disorders, diseases or other conditions that manifest themselves through an abnormal growth. The two main conditions that can be detected by height monitoring are Turner's syndrome and growth hormone deficiency. Conditions or risk factors that can be detected by monitoring weight or body mass index include hypernatremic dehydration, celiac disease, cystic fibrosis and obesity. Monitoring infant head growth can be used to detect macrocephaly, developmental disorder and ill health in childhood. This paper describes statistical methods to obtain evidence-based referral criteria in growth monitoring. The referral criteria that we discuss are based on either anthropometric measurement(s) at a fixed age using (1) a Centile or a Standard Deviation Score, (2) a Standard Deviation corrected for parental height, (3) a Likelihood Ratio Statistic and (4) an ellipse, or on multiple measurements over time using (5) a growth rate and (6) a growth curve model. We review the potential uses of these methods, and outline their strengths and limitations

    ΠœΠΈΡ€ΠΎΠ²ΠΎΠΉ ΠΎΠΏΡ‹Ρ‚ создания ΠΈ функционирования свободных экономичСских Π·ΠΎΠ½ ΠΈ возмоТности Π΅Π³ΠΎ использования Π² Π£ΠΊΡ€Π°ΠΈΠ½Π΅

    Get PDF
    БСйчас Π² ΠΌΠΈΡ€Π΅ Ρ„ΡƒΠ½ΠΊΡ†ΠΈΠΎΠ½ΠΈΡ€ΡƒΠ΅Ρ‚, ΠΏΠΎ Ρ€Π°Π·Π½Ρ‹ΠΌ Π΄Π°Π½Π½Ρ‹ΠΌ, ΠΎΡ‚ 400 Π΄ΠΎ 2000 свободных экономичСских Π·ΠΎΠ½. Π’ΠΏΠ΅Ρ€Π²Ρ‹Π΅ Π‘Π­Π— Π±Ρ‹Π»ΠΈ созданы Π² БША ΠΏΠΎ Π°ΠΊΡ‚Ρƒ 1934 Π³. Π² Π²ΠΈΠ΄Π΅ Π·ΠΎΠ½ внСшнСй Ρ‚ΠΎΡ€Π³ΠΎΠ²Π»ΠΈ. ЦСлью ΠΈΡ… Π±Ρ‹Π»Π° активизация Π²Π½Π΅ΡˆΠ½Π΅Ρ‚ΠΎΡ€Π³ΠΎΠ²ΠΎΠΉ Π΄Π΅ΡΡ‚Π΅Π»ΡŒΠ½ΠΎΡΡ‚ΠΈ посрСдством использования эффСктивных ΠΌΠ΅Ρ…Π°Π½ΠΈΠ·ΠΌΠΎΠ² сниТСния Ρ‚Π°ΠΌΠΎΠΆΠ΅Π½Π½Ρ‹Ρ… ΠΈΠ·Π΄Π΅Ρ€ΠΆΠ΅ΠΊ. ΠŸΡ€ΠΈ этом Π³Π»Π°Π²Π½Ρ‹ΠΌ ΠΎΠ±Ρ€Π°Π·ΠΎΠΌ ΠΏΡ€Π΅Π΄ΠΏΠΎΠ»Π°Π³Π°Π»ΠΎΡΡŒ сокращСниС ΠΈΠΌΠΏΠΎΡ€Ρ‚Π½Ρ‹Ρ… Ρ‚Π°Ρ€ΠΈΡ„ΠΎΠ² Π½Π° Π΄Π΅Ρ‚Π°Π»ΠΈ ΠΈ ΠΊΠΎΠΌΠΏΠΎΠ½Π΅Π½Ρ‚Ρ‹ для производства Π°Π²Ρ‚ΠΎΠΌΠΎΠ±ΠΈΠ»Π΅ΠΉ. Π’ Π·ΠΎΠ½Ρ‹ внСшнСй Ρ‚ΠΎΡ€Π³ΠΎΠ²Π»ΠΈ Π±Ρ‹Π»ΠΈ ΠΏΡ€Π΅Π²Ρ€Π°Ρ‰Π΅Π½Ρ‹ склады, Π΄ΠΎΠΊΠΈ, аэропорты. ΠŸΡ€Π΅Π΄ΠΏΡ€ΠΈΡΡ‚ΠΈΡ, Π΄Π΅ΠΉΡΡ‚Π²ΡƒΡŽΡ‰ΠΈΠ΅ Π² ΡƒΠΊΠ°Π·Π°Π½Π½Ρ‹Ρ… Π·ΠΎΠ½Π°Ρ…, Π²Ρ‹Π²ΠΎΠ΄ΠΈΠ»ΠΈΡΡŒ ΠΈΠ·-ΠΏΠΎΠ΄ Ρ‚Π°ΠΌΠΎΠΆΠ΅Π½Π½ΠΎΠ³ΠΎ контроля Π² БША, Ссли ΠΈΠΌΠΏΠΎΡ€Ρ‚ΠΈΡ€ΡƒΠ΅ΠΌΡ‹Π΅ Π² Π·ΠΎΠ½Ρƒ Ρ‚ΠΎΠ²Π°Ρ€Ρ‹ Π·Π°Ρ‚Π΅ΠΌ Π½Π°ΠΏΡ€Π°Π²Π»ΡΠ»ΠΈΡΡŒ Π² Ρ‚Ρ€Π΅Ρ‚ΡŒΡŽ страну. Π’Π°ΠΌΠΎΠΆΠ΅Π½Π½Ρ‹Π΅ ΠΈΠ·Π΄Π΅Ρ€ΠΆΠΊΠΈ сниТались ΠΈ Ρ‚ΠΎΠ³Π΄Π°, ΠΊΠΎΠ³Π΄Π° Π² Π·ΠΎΠ½Π΅ ΠΎΡΡƒΡ‰Π΅ΡΡ‚Π²Π»ΡΠ»Π°ΡΡŒ "Π΄ΠΎΠ²ΠΎΠ΄ΠΊΠ°" ΠΏΡ€ΠΎΠ΄ΡƒΠΊΡ†ΠΈΠΈ Ρ„ΠΈΡ€ΠΌ БША для ΠΏΠΎΡΠ»Π΅Π΄ΡƒΡŽΡ‰Π΅Π³ΠΎ экспорта. Если ΠΆΠ΅ Ρ‚ΠΎΠ²Π°Ρ€Ρ‹ ΠΈΠ· Π·ΠΎΠ½Ρ‹ шли Π² БША, ΠΎΠ½ΠΈ Π² ΠΎΠ±ΡΠ·Π°Ρ‚Π΅Π»ΡŒΠ½ΠΎΠΌ порядкС ΠΏΡ€ΠΎΡ…ΠΎΠ΄ΠΈΠ»ΠΈ всС Ρ‚Π°ΠΌΠΎΠΆΠ΅Π½Π½Ρ‹Π΅ ΠΏΡ€ΠΎΡ†Π΅Π΄ΡƒΡ€Ρ‹, прСдусмотрСнныС Π·Π°ΠΊΠΎΠ½ΠΎΠ΄Π°Ρ‚Π΅Π»ΡŒΡΡ‚Π²ΠΎΠΌ страны

    mice: Multivariate Imputation by Chained Equations in R

    Get PDF
    The R package mice imputes incomplete multivariate data by chained equations. The software mice 1.0 appeared in the year 2000 as an S-PLUS library, and in 2001 as an R package. mice 1.0 introduced predictor selection, passive imputation and automatic pooling. This article documents mice, which extends the functionality of mice 1.0 in several ways. In mice, the analysis of imputed data is made completely general, whereas the range of models under which pooling works is substantially extended. mice adds new functionality for imputing multilevel data, automatic predictor selection, data handling, post-processing imputed values, specialized pooling routines, model selection tools, and diagnostic graphs. Imputation of categorical data is improved in order to bypass problems caused by perfect prediction. Special attention is paid to transformations, sum scores, indices and interactions using passive imputation, and to the proper setup of the predictor matrix. mice can be downloaded from the Comprehensive R Archive Network. This article provides a hands-on, stepwise approach to solve applied incomplete data problems

    Evaluation and prediction of individual growth trajectories

    Get PDF
    Background: Conventional growth charts offer limited guidance to track individual growth. Aim: To explore new approaches to improve the evaluation and prediction of individual growth trajectories. Subjects and methods: We generalise the conditional SDS gain to multiple historical measurements, using the Cole correlation model to find correlations at exact ages, the sweep operator to find regression weights and a specified longitudinal reference. We explain the various steps of the methodology and validate and demonstrate the method using empirical data from the SMOCC study with 1985 children measured during ten visits at ages 0–2years. Results: The method performs according to statistical theory. We apply the method to estimate the referral rates for a given screening policy. We visualise the child’s trajectory as an adaptive growth chart featuring two new graphical elements: amplitude (for evaluation) and flag (for prediction). The relevant calculations take about 1 millisecond per child. Conclusion: Longitudinal references capture the dynamic nature of child growth. The adaptive growth chart for individual monitoring works with exact ages, corrects for regression to the mean, has a known distribution at any pair of ages and is fast. We recommend the method for evaluating and predicting individual child growth

    Global Scales for Early Development v1.0: Technical report

    Get PDF

    Looking back at the Gifi system of nonlinear multivariate analysis

    No full text
    Gifi was the nom de plume for a group of researchers led by Jan de Leeuw at the University of Leiden. Between 1970 and 1990 the group produced a stream of theoretical papers and computer programs in the area of nonlinear multivariate analysis that were very innovative. In an informal way this paper discusses the so-called Gifi system of nonlinear multivariate analysis, that entails homogeneity analysis (which is closely related to multiple correspondence analysis) and generalizations. The history is discussed, giving attention to the scientific philosophy of this group, and links to machine learning are indicated

    Looking Back at the Gifi System of Nonlinear Multivariate Analysis

    Get PDF
    Gifi was the nom de plume for a group of researchers led by Jan de Leeuw at the University of Leiden. Between 1970 and 1990 the group produced a stream of theoretical papers and computer programs in the area of nonlinear multivariate analysis that were very innovative. In an informal way this paper discusses the so-called Gifi system of nonlinear multivariate analysis, that entails homogeneity analysis (which is closely related to multiple correspondence analysis) and generalizations. The history is discussed, giving attention to the scientific philosophy of this group, and links to machine learning are indicated
    • …
    corecore