Search CORE

721 research outputs found

PredictABEL: an R package for the assessment of risk prediction models

Author: A. Cecile J. W. Janssens
AC Janssens
Cornelia M. van Duijn
DW Hosmer
EW Steyerberg
JA Hanley
JM Seddon
K McGeechan
MA Hlatky
MJ Khoury
MJ Pencina
NJ Nagelkerke
NR Cook
Suman Kundu
YS Aulchenko
YS Aulchenko
Yurii S. Aulchenko
Publication venue: Springer Netherlands
Publication date: 01/01/2011
Field of study

The rapid identification of genetic markers for multifactorial diseases from genome-wide association studies is fuelling interest in investigating the predictive ability and health care utility of genetic risk models. Various measures are available for the assessment of risk prediction models, each addressing a different aspect of performance and utility. We developed PredictABEL, a package in R that covers descriptive tables, measures and figures that are used in the analysis of risk prediction studies such as measures of model fit, predictive ability and clinical utility, and risk distributions, calibration plot and the receiver operating characteristic plot. Tables and figures are saved as separate files in a user-specified format, which include publication-quality EPS and TIFF formats. All figures are available in a ready-made layout, but they can be customized to the preferences of the user. The package has been developed for the analysis of genetic risk prediction studies, but can also be used for studies that only include non-genetic risk factors. PredictABEL is freely available at the websites of GenABEL (http://www.genabel.org) and CRAN (http://cran.r-project.org/)

Crossref

Springer - Publisher Connector

PubMed Central

EUR Research Repository

Erasmus University Digital Repository

Applying different genomic evaluation approaches on QTLMAS2010 dataset

Author: AR Gilmour
AW George
BJ Hayes
CS Haley
G Hadjipavlou
G Seaton
Javad Nadaf
N Amin
N Cameron
R Pong-Wong
RE Kass
Ricardo Pong-Wong
THE Meuwissen
YS Aulchenko
YS Aulchenko
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

Edinburgh Research Explorer

Heritability of fasting glucose levels in a young genetically isolated population

Author: Aulchenko YS
Duijn Cornelia
Oostra Ben
Pols Huib
Rivadeneira Fernando
Santos RLP
Zillikens M.C.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2006
Field of study

EUR Research Repository

Genetic factors influence the clustering of depression among individuals with lower socioeconomic status

Author: Aulchenko YS
Choy WC
Claes SJ
Duijn Cornelia
Janssens Cecile
Lopez Leon S (Sandra)
Mackenbach Johan
Oostra Ben
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

EUR Research Repository

Association between Type 2 Diabetes Loci and Measures of Fatness

Author: AL Gloyn
Ben A. Oostra
Cornelia M. van Duijn
D Altshuler
E Zeggini
E Zeggini
JE Cecil
LJ Scott
LM Pardo
M Ghoussaini
M. Carola Zillikens
Michael Nicholas Weedon
Peter Henneman
Pieter J. Snijders
PJ Campbell
R Sladek
S Cauchi
S Wild
SA Bacanu
SF Grant
Slavica Pecioska
SS Deeb
TM Frayling
YS Aulchenko
YS Aulchenko
Yurii S. Aulchenko
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

Background: Type 2 diabetes (T2D) is a metabolic disorder characterized by disturbances of carbohydrate, fat and protein metabolism and insulin resistance. The majority of T2D patients are obese and obesity by itself may be a cause of insulin resistance. Our aim was to evaluate whether the recently identified T2D risk alleles are associated with human measures of fatness as characterized with Dual Energy X-ray Absorptiometry (DEXA). Methodology/Principal Findings: Genotypes and phenotypes of approximately 3,000 participants from cross-sectional ERF study were analyzed. Nine single nucleotide polymorphisms (SNPs) in CDKN2AB, CDKAL1, FTO, HHEX, IGF2BP2, KCNJ11, PPARG, SLC30A8 and TCF7L2 were genotyped. We used linear regression to study association between individual SNPs and the combined allelic risk score with body mass index (BMI), fat mass index (FMI), fat percentage (FAT), waist circumference (WC) and waist to hip ratio (WHR). Significant association was observed between rs8050136 (FTO) and BMI (p = 0.003), FMI (p = 0.007) and WC (p = 0.03); fat percentage was borderline significant (p = 0.053). No other SNPs alone or combined in a risk score demonstrated significant association to the measures of fatness. Conclusions/Significance: From the recently identified T2D risk variants only the risk variant of the FTO gene (rs8050136) showed statistically significant association with BMI, FMI, and WC

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Erasmus University Digital Repository

Rapid and robust association mapping of expression quantitative trait loci

Author: Alex C Lam
Chris S Haley
Dirk-Jan de Koning
E Boerwinkle
GR Abecasis
JD Storey
JD Storey
M Morley
Michael Schouten
T Pastinen
YS Aulchenko
Yurii S Aulchenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

ParallABEL: an R library for generalized parallelization of genome-wide association studies

Author: F Dudbridge
G Vera
H Mishima
J Hill
K Misawa
L Ma
LA Hindorff
NM Laird
Pichaya Tandayya
R Ihaka
RM Plenge
Surakameth Mahasirimongkol
TA Pearson
Unitsa Sangket
Wasun Chantratita
YS Aulchenko
Yurii S Aulchenko
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Genome-Wide Association (GWA) analysis is a powerful method for identifying loci associated with complex traits and drug response. Parts of GWA analyses, especially those involving thousands of individuals and consuming hours to months, will benefit from parallel computation. It is arduous acquiring the necessary programming skills to correctly partition and distribute data, control and monitor tasks on clustered computers, and merge output files.Results: Most components of GWA analysis can be divided into four groups based on the types of input data and statistical outputs. The first group contains statistics computed for a particular Single Nucleotide Polymorphism (SNP), or trait, such as SNP characterization statistics or association test statistics. The input data of this group includes the SNPs/traits. The second group concerns statistics characterizing an individual in a study, for example, the summary statistics of genotype quality for each sample. The input data of this group includes individuals. The third group consists of pair-wise statistics derived from analyses between each pair of individuals in the study, for example genome-wide identity-by-state or genomic kinship analyses. The input data of this group includes pairs of SNPs/traits. The final group concerns pair-wise statistics derived for pairs of SNPs, such as the linkage disequilibrium characterisation. The input data of this group includes pairs of individuals. We developed the ParallABEL library, which utilizes the Rmpi library, to parallelize these four types of computations. ParallABEL library is not only aimed at GenABEL, but may also be employed to parallelize various GWA packages in R. The data set from the North American Rheumatoid Arthritis Consortium (NARAC) includes 2,062 individuals with 545,080, SNPs' genotyping, was used to measure ParallABEL performance. Almost perfect speed-up was achieved for many types of analyses. For example, the computing time for the identity-by-state matrix was linearly reduced from approximately eight hours to one hour when ParallABEL employed eight processors.Conclusions: Executing genome-wide association analysis using the ParallABEL library on a computer cluster is an effective way to boost performance, and simplify the parallelization of GWA studies. ParallABEL is a user-friendly parallelization of GenABEL

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Erasmus University Digital Repository

Heritability of fasting glucose levels in a young genetically isolated population

Author: Aulchenko YS
Duijn Cornelia
Oostra Ben
Pols Huib
Rivadeneira Fernando
Santos RLP
Zillikens M.C.
Publication venue
Publication date: 01/01/2006
Field of study

EUR Research Repository

ProbABEL package for genome-wide association analysis of imputed data

Author: A Zeileis
AL Price
B Devlin
B Servin
BN Howie
CA Anderson
CE McCulloch
Cornelia M van Duijn
D Clayton
GR Abecasis
H White
J Marchini
J Yu
JM Vink
K Estrada
K Hao
LA Hindorff
LM Pardo
M Perez-Enciso
Maksim V Struchalin
N Amin
NL Heard-Costa
OM Woodward
S Purcell
TI Axenovich
W Kim
WM Chen
Y Li
Y Li
YS Aulchenko
YS Aulchenko
Yurii S Aulchenko
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Over the last few years, genome-wide association (GWA) studies became a tool of choice for the identification of loci associated with complex traits. Currently, imputed single nucleotide polymorphisms (SNP) data are frequently used in GWA analyzes. Correct analysis of imputed data calls for the implementation of specific methods which take genotype imputation uncertainty into account.Results: We developed the ProbABEL software package for the analysis of genome-wide imputed SNP data and quantitative, binary, and time-till-event outcomes under linear, logistic, and Cox proportional hazards models, respectively. For quantitative traits, the package also implements a fast two-step mixed model-based score test for association in samples with differential relationships, facilitating analysis in family-based studies, studies performed in human genetically isolated populations and outbred animal populations.Conclusions: ProbABEL package provides fast efficient way to analyze imputed data in genome-wide context and will facilitate future identification of complex trait loci

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Erasmus University Digital Repository

Are your covariates under control? How normalization can re-introduce covariate effects

Author: A Ronald
AE Locke
Angelica Ronald
B Peng
B Servin
CMA Haworth
E Feingold
Frank Dudbridge
K Wang
LV Wain
Oliver Pain
SE Jones
TM Beasley
WD Berry
YS Aulchenko
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 15/05/2017
Field of study

Many statistical tests rely on the assumption that the residuals of a model are normally distributed. Rank-based inverse normal transformation (INT) of the dependent variable is one of the most popular approaches to satisfy the normality assumption. Studies regularly adjust for covariates and then normalize the residuals. This study investigated the effect of regressing covariates against the dependent variable and then applying rank-based INT to the residuals. The correlation between the dependent variable and covariates at each stage of processing was assessed. An alternative approach was tested of applying rank-based INT to the dependent variable before regressing covariates was tested. Analyses based on both simulated and real data examples demonstrated that applying rank-based INT to the dependent variable residuals after regressing out covariates re-introduces a linear correlation between the dependent variable and covariates in almost all situations. This will increase type-1 errors and reduce power. Our proposed alternative approach, where rank-based INT was applied prior to controlling for covariate effects, gave residuals that were normally distributed and linearly uncorrelated with covariates. This approach is therefore recommended

Crossref

LSHTM Research Online

Birkbeck Institutional Research Online

King's Research Portal

Leicester Research Archive