Search CORE

10 research outputs found

R script analysing variants described in the paper

Author: Bruce S. Weir (185248)
Jan Graffelman (226494)
Publication venue
Publication date: 17/01/2018
Field of study

The file contains R code that loads the data from the R package HardyWeinberg. Functions are called that analyse the data and reproduce the tables given in the paper

Dryad Digital Repository (Duke University)

The Francis Crick Institute

Simulation results.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Overall percentage of missing data, percentage of SNPs with missings, probabilities of missingness for the three genotypes and the root mean squared error (RSME) for the inbreeding coefficient () when missings are discarded, imputed by MICE or imputed by IMPUTE2, under MCAR and MNAR.</p

The Francis Crick Institute

Intensity plot of a G/T polymorphism for 146 individuals.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Missing values (NA, 33% of the data) indicated by black crosses occur mainly at the boundaries of homozygotes and heterozygotes.</p

The Francis Crick Institute

Scatter plot of inbreeding coefficients for 1070 non-monomorphic SNPs with missings obtained by multiple imputation (MICE) and single imputation (IMPUTE2).

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Scatter plot of inbreeding coefficients for 1070 non-monomorphic SNPs with missings obtained by multiple imputation (MICE) and single imputation (IMPUTE2).</p

The Francis Crick Institute

Estimation of inbreeding coefficients by multiple imputation and by omitting missings.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Left panel: using allele intensities only. Right panel: using allele intensities and covariate SNPs in LD (complete and incomplete) with . Symbols indicate the result of two significance tests: a test for HWP discarding missings and a test for HWP with imputation of missings. Circles: SNPs with both tests non-significant; Diamonds: SNPs with both tests significant; Upward triangles: SNPs with a significant chi-square test when missings are omitted, but an insignificant test when missings are imputed. Downward triangles: SNPs with a non-significant chi-square test when missings are omitted, but a significant test when missings are imputed.</p

The Francis Crick Institute

Significance tests of equal mean intensities for missing and non-missing genotyping results.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Number and percentage of significance tests are given for 140 non-monomorphic SNPs with between 10 and 50% missing values (). Results are given for tests with and without homocedasticity assumption ( is the intensity variance of the completely observed genotypes, is the intensity variance of the missing genotypes).</p

The Francis Crick Institute

Ternary plots of m = 50 imputed data set for the G/T polymorphism of Figure 1.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Curves in the ternary plots indicate the HW parabola, and the limits of the 95% acceptance region of a test for HWP. Left panel: imputed data sets with allele intensities as covariates (model 3). Right panel: imputed data sets with allele intensities and 1 covariate SNP (model 5).</p

The Francis Crick Institute

Number of imputed SNPs, number and percentage of significant SNPs with missings imputed, mean, median and maximum of the fraction of missing information () for multinomial logit models with five different sets of predictors.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

The last column (% reversal) indicates the percentage of SNPs whose test results changed status (from significant to non-significant or the reverse) in comparison with a test omitting missings.</p

The Francis Crick Institute

Ternary plots and Q-Q plots for Hardy-Weinberg proportions.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Curves in the ternary plots indicate the HW parabola, and the limits of the 95% acceptance region of a test for HWP. Top row plots are for 545 fully observed SNPs. Bottom row plots are for 140 SNPs with 10 to 50% missings (missings were discarded in these plots). The Q-Q plots show two lines, a solid reference line and an estimate of the linear tendency in the cloud of points (dashed).</p

The Francis Crick Institute

Inbreeding coefficients, confidence intervals, p-values and missing data statistics (relative increase in variance (), and fraction of missing information ()) for multiple imputation with different multinomial logit models, and for single imputation with IMPUTE2.

Author: Jan Graffelman (226494)
Milagros Sánchez (504611)
Samantha Cook (345876)
Victor Moreno (748751)
Publication venue
Publication date
Field of study

Inbreeding coefficients, confidence intervals, p-values and missing data statistics (relative increase in variance (), and fraction of missing information ()) for multiple imputation with different multinomial logit models, and for single imputation with IMPUTE2.</p

The Francis Crick Institute

R script analysing variants described in the paper

Simulation results.

Intensity plot of a G/T polymorphism for 146 individuals.

Scatter plot of inbreeding coefficients for 1070 non-monomorphic SNPs with missings obtained by multiple imputation (MICE) and single imputation (IMPUTE2).

Estimation of inbreeding coefficients by multiple imputation and by omitting missings.

Significance tests of equal mean intensities for missing and non-missing genotyping results.

Ternary plots of <i>m</i> = 50 imputed data set for the G/T polymorphism of Figure 1.

Number of imputed SNPs, number and percentage of significant SNPs with missings imputed, mean, median and maximum of the fraction of missing information () for multinomial logit models with five different sets of predictors.

Ternary plots and Q-Q plots for Hardy-Weinberg proportions.

Inbreeding coefficients, confidence intervals, <i>p</i>-values and missing data statistics (relative increase in variance (), and fraction of missing information ()) for multiple imputation with different multinomial logit models, and for single imputation with IMPUTE2.