Search CORE

30 research outputs found

State of the Multiple Imputation Software

Author: Yucel Recai M.
Publication venue: 'Foundation for Open Access Statistic'
Publication date: 01/12/2011
Field of study

Owing to its practicality as well as strong inferential properties, multiple imputation has been increasingly popular in the analysis of incomplete data. Methods that are not only computationally elegant but also applicable in wide spectrum of statistical incomplete data problems have also been increasingly implemented in a numerous computing environments. Unfortunately, however, the speed of this development has not been replicated in reaching to "sophisticated" users. While the researchers have been quite successful in developing the underlying software, documentation in a style that would be most reachable to the greater scientific society has been lacking. The main goal of this special volume is to close this gap by articles that illustrate these software developments. Here I provide a brief history of multiple imputation and relevant software and highlight the contents of the contributions. Potential directions for the future of the software development is also provided

Directory of Open Access Journals

PubMed Central

Journal of Statistical Software

A likelihood-based approach to mixed modeling with ambiguity in cluster identifiers

Author: Andrea S. Foulkes
Chiu
Demidenko
Dempster
Diggle
Excoffier
Fitzmaurice
Foulkes
Foulkes
Foulkes
Foulkes
Jamshidian
Jennrich
Laird
Laird
Lake
Lin
Lindstrom
Little
McCulloch
Meng
Muthen
Pinheiro
Recai Yucel
Sanchez
Satten
Schafer
Tzeng
Verbeke
Verbeke
Vonesh
Wolfinger
Xiaohong Li
Publication venue: Oxford University Press
Publication date: 01/01/2008
Field of study

This manuscript describes a novel, linear mixed-effects model–fitting technique for the setting in which correlated data indicators are not completely observed. Mixed modeling is a useful analytical tool for characterizing genotype–phenotype associations among multiple potentially informative genetic loci. This approach involves grouping individuals into genetic clusters, where individuals in the same cluster have similar or identical multilocus genotypes. In haplotype-based investigations of unrelated individuals, corresponding cluster assignments are unobservable since the alignment of alleles within chromosomal copies is not generally observed. We derive an expectation conditional maximization approach to estimation in the mixed modeling setting, where cluster assignments are ambiguous. The approach has broad relevance to the analysis of data with missing correlated data identifiers. An example is provided based on data arising from a cohort of human immunodeficiency virus type-1–infected individuals at risk for antiretroviral therapy–associated dyslipidemia

Crossref

ScholarWorks@UMass Amherst

PubMed Central

Longitudinal Analysis of Cepstral Peak Prominence in Children

Author: Elizabeth Heller Murray
Recai Yucel
Publication venue: Open Science Framework
Publication date: 12/04/2024
Field of study

Objectives: To evaluate whether the acoustic measure of cepstral peak prominence changes during typical development in children 2–7. Methods: Data were retrospectively analyzed from the Arizona Child Acoustic Database Repository in this longitudinal cohort study. The Repository contains longitudinal data recordings from 63 total children between 2–7 years of age. Thirty-one children met the inclusion criteria for the current analysis (at least five time points of usable speech data, no history of speech or language difficulties, no significant dysphonia, and were monolingual speakers of American English). Cepstral peak prominence measures were calculated in Praat for each child, at each timepoint. Additional acoustic measures of vocal fundamental frequency, vocal intensity, and stimuli length were also calculated. These measures were chosen as previous work has shown they may impact cepstral peak prominence values. Results: Linear mixed-effects regression models examined the relationship between cepstral peak prominence and age, after controlling for vocal fundamental frequency, vocal intensity, and stimuli length. Within-participant effects of age were found, indicating a trajectory change in which cepstral peak prominence increases with age in this population. This positive relationship between a cepstral peak prominence and age was nonlinear, with a steeper slope between age and cepstral peak prominence after five years of age. Conclusions: This is the first study to examine the typical developmental trajectory of cepstral peak prominence children between 2–7 years, a critical period of vocal development. Cepstral peak prominence increased with age, suggesting an increase in periodicity of vocal fold vibration that coincides with the significant vocal fold structural changes occurring during this time. Outcomes present important normative information on vocal development, essential for effectively understanding the difference between what vocal changes are part of normative development and what changes indicate a voice disorder

OSF Preprints

Impact of non-normal random effects on inference by multiple imputation: A simulation assessment

Author: Demirtas Hakan
Yucel Recai M.
Publication venue
Publication date
Field of study

Multivariate extensions of well-known linear mixed-effects models have been increasingly utilized in inference by multiple imputation in the analysis of multilevel incomplete data. The normality assumption for the underlying error terms and random effects plays a crucial role in simulating the posterior predictive distribution from which the multiple imputations are drawn. The plausibility of this normality assumption on the subject-specific random effects is assessed. Specifically, the performance of multiple imputation created under a multivariate linear mixed-effects model is investigated on a diverse set of incomplete data sets simulated under varying distributional characteristics. Under moderate amounts of missing data, the simulation study confirms that the underlying model leads to a well-calibrated procedure with negligible biases and actual coverage rates close to nominal rates in estimates of the regression coefficients. Estimation quality of the random-effect variance and association measures, however, are negatively affected from both the misspecification of the random-effect distribution and number of incompletely-observed variables. Some of the adverse impacts include lower coverage rates and increased biases.

Research Papers in Economics

Imputation of Binary Treatment Variables With Measurement Error in Administrative Data

Author: Yucel Recai M.
Zaslavsky Alan M.
Publication venue
Publication date
Field of study

Research Papers in Economics

Muslim/Non-Muslim Locational Attainment in Philadelphia: A New Fault Line in Residential Inequality?

Author: Colleen E. Wynn
Joseph Gibbons
Recai Yucel
Samantha Friedman
Publication venue: OSF
Publication date: 29/01/2019
Field of study

This study examines Muslim/non-Muslim disparities in locational attainment. We pooled data from the 2004, 2006, and 2008 waves of the Public Health Management Corporation’s Southeastern Pennsylvania Household Survey. These data contain respondents’ religious identities and are geocoded at the census-tract level, allowing us to merge American Community Survey data and examine neighborhood-level outcomes to gauge respondents’ locational attainment. Net of controls, our multivariate analyses reveal that among blacks and nonblacks, Muslims live in neighborhoods that have significantly lower shares of whites and greater representations of blacks. Among blacks, Muslims are significantly less likely to reside in suburbs, relative to non-Muslims. The Muslim disadvantages for blacks and nonblacks in neighborhood poverty and neighborhood median income, however, become insignificant. Our results provide support for the tenets of the spatial assimilation and place stratification models and suggest that Muslim/non-Muslim disparities in locational attainment comprise a new fault line in residential stratification

OSF Preprints

Recommended from our members

An Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected Children

Author: Foulkes Andrea S
Li Xiaohong
Rich Stephen M.
Yucel Recai M.
Publication venue: SelectedWorks
Publication date: 06/01/2012
Field of study

Characterizing genetic variability in the human pathogenic Plasmodium species, the group of parasites that cause Malaria, may have broad global health implications. Specifically, discerning the combinations of mutations that lead to viable parasites and the population level frequencies of these clonal sequences will allow for targeted vaccine development and individualized treatment choices. This presents an analytical challenge, however, since haplotypic phase (i.e. the alignment of bases on a single DNA strand) is generally unobservable in multiply infected individuals. This manuscript describes an expectation maximization (EM) approach to maximum likelihood estimation of haplotype frequencies in this missing data setting. The approach is applied to a cohort of N=341 malaria infected children in Uganda, Cameroon and Sudan to characterize regional differences. A simulation study is also presented to characterize method performance and assess sensitivity to distributional assumptions

ScholarWorks@UMass Amherst

An Expectation Maximization Approach to Estimate Malaria Haplotype Frequencies in Multiply Infected Children

Author: Foulkes Andrea S
Li Xiaohong
Rich Stephen M.
Yucel Recai M.
Publication venue
Publication date
Field of study

Research Papers in Economics