Search CORE

26 research outputs found

The power and statistical behaviour of allele-sharing statistics when applied to models with two disease loci

Author: Collins Andrew
Feng Bing-Jian
Shugart Yin Y.
Publication venue
Publication date: 01/12/2002
Field of study

We have evaluated the power for detecting a common trait determined by two loci, using seven statistics, of which five are implemented in the computer program SimWalk2, and two are implemented in GENEHUNTER. Unlike most previous reports which involve evaluations of the power of allele-sharing statistics for a single disease locus, we have used a simulated data set of general pedigrees in which a two-locus disease is segregating and evaluated several nonparametric linkage statistics implemented in the two programs. We found that the power for detecting linkage using the S(all) statistic in GENEHUNTER (GH, version 2.1), implemented as statistic E in SimWalk2 (version 2.82), is different in the two. The P values associated with statistic E output by SimWalk2 are consistently more conservative than those from GENEHUNTER except when the underlying model includes heterogeneity at a level of 50% where the P values output are very comparable. On the other hand, when the thresholds are determined empirically under the null hypothesis, S(all) in GENEHUNTER and statistic E have similar powe

Southampton (e-Prints Soton)

Application of family-based association testing to assess the genotype-phenotype association involved in complex traits using single-nucleotide polymorphisms

Author: Guo Mitchell
Shugart Yin Y
Wang Ming-Hsi
Publication venue: BioMed Central
Publication date: 01/12/2005
Field of study

BACKGROUND: We used the FBAT (family-based association test) software to test for association between 300 individual single-nucleotide polymorphisms and P1 (a latent trait of Kofendred Personality Disorder) in 100 simulated replicates of the Aipotu population. Using the Genetic Analysis Workshop 14 dataset, we calculated the power of FBAT to detect linkage disequilibrium on chromosome 3 (D2). Also, we calculated the false-positive rate on chromosome 1, which contains a true locus (D1) but no linkage disequilibrium was simulated between the trait and all the surrounding single-nucleotide polymorphisms. RESULTS: We were able to detect the associations between phenotype P1 and three adjacent markers B03T3056 (average p-value = 0.0002), B03T3057 (average p-value = 0.00072), and B03T3058 (average p-value = 0.0038) with power of 98%, 87%, 71% on chromosome 3, respectively. The overall false positve rate to detect association was 0.06 on chromosome 1. CONCLUSION: The power to detect a significant association in 100 nuclear families affected with the latent trait of Kofendred Personality Disorder by using FBAT was reasonable (based on 100 replicates). In the future, we will compare the performance of FBAT with alternative approaches, such as using FBAT-generalized estimating equations methods to test for association in families affected with complex traits

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Explore Bristol Research

Application of the propensity score in a covariate-based linkage analysis of the Collaborative Study on the Genetics of Alcoholism

Author: Bailey-Wilson Joan E
Doan Betty Q
Frangakis Constantine E
Shugart Yin Y
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: Covariate-based linkage analyses using a conditional logistic model as implemented in LODPAL can increase the power to detect linkage by minimizing disease heterogeneity. However, each additional covariate analyzed will increase the degrees of freedom for the linkage test, and therefore can also increase the type I error rate. Use of a propensity score (PS) has been shown to improve consistently the statistical power to detect linkage in simulation studies. Defined as the conditional probability of being affected given the observed covariate data, the PS collapses multiple covariates into a single variable. This study evaluates the performance of the PS to detect linkage evidence in a genome-wide linkage analysis of microsatellite marker data from the Collaborative Study on the Genetics of Alcoholism. Analytical methods included nonparametric linkage analysis without covariates, with one covariate at a time including multiple PS definitions, and with multiple covariates simultaneously that corresponded to the PS definitions. Several definitions of the PS were calculated, each with increasing number of covariates up to a maximum of five. To account for the potential inflation in the type I error rates, permutation based p-values were calculated. RESULTS: Results suggest that the use of individual covariates may not necessarily increase the power to detect linkage. However the use of a PS can lead to an increase when compared to using all covariates simultaneously. Specifically, PS3, which combines age at interview, sex, and smoking status, resulted in the greatest number of significant markers identified. All methods consistently identified several chromosomal regions as significant, including loci on chromosome 2, 6, 7, and 12. CONCLUSION: These results suggest that the use of a propensity score can increase the power to detect linkage for a complex disease such as alcoholism, especially when multiple important covariates can be used to predict risk and thereby minimize linkage heterogeneity. However, because the PS is calculated as a conditional probability of being affected, it does require the presence of observed covariate data on both affected and unaffected individuals, which may not always be available in real data sets

Crossref

Springer - Publisher Connector

PubMed Central

Explore Bristol Research

knnAUC: an open-source R package for detecting nonlinear dependence between one continuous variable and one binary variable

Author: Hao Meng
Jin Li
Li Yi
Liu Jie
Liu Xiaoyu
Ma Yanyun
Shugart Yin Y
Wang Jiucun
Wang Yi
Xiong Momiao
Yuan Zhenghong
Zhou Weichen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2018
Field of study

Abstract Background Testing the dependence of two variables is one of the fundamental tasks in statistics. In this work, we developed an open-source R package (knnAUC) for detecting nonlinear dependence between one continuous variable X and one binary dependent variables Y (0 or 1). Results We addressed this problem by using knnAUC (k-nearest neighbors AUC test, the R package is available at https://sourceforge.net/projects/knnauc/ ). In the knnAUC software framework, we first resampled a dataset to get the training and testing dataset according to the sample ratio (from 0 to 1), and then constructed a k-nearest neighbors algorithm classifier to get the yhat estimator (the probability of y = 1) of testy (the true label of testing dataset). Finally, we calculated the AUC (area under the curve of receiver operating characteristic) estimator and tested whether the AUC estimator is greater than 0.5. To evaluate the advantages of knnAUC compared to seven other popular methods, we performed extensive simulations to explore the relationships between eight different methods and compared the false positive rates and statistical power using both simulated and real datasets (Chronic hepatitis B datasets and kidney cancer RNA-seq datasets). Conclusions We concluded that knnAUC is an efficient R package to test non-linear dependence between one continuous variable and one binary dependent variable especially in computational biology area.https://deepblue.lib.umich.edu/bitstream/2027.42/146514/1/12859_2018_Article_2427.pd

Directory of Open Access Journals

Deep Blue Documents

Efficient test for nonlinear dependence of two continuous variables

Author: A Gretton
AC Aitken
B Li
B Murrell
B Stroustrup
CF Dietrich
CGAR Network
D Albanese
D Reshef
DN Reshef
DN Reshef
DS Burke
F Galton
FE Croxton
GE Wilding
GJ Székely
GJ Székely
H Kirikoshi
H Scheffe
Hongbao Cao
J Cohen
J Jiang
JB Kinney
JL Myers
K Pearson
L Grosse
L Tierney
Li Jin
MG Kendall
Momiao Xiong
MR Kosorok
MZ Dieter
N Lockyer
NS Altman
P Good
P Huber
PS Horn
R Natrajan
SA Ha
SJ Devlin
T Zhang
WS Cleveland
Y Tanaka
Yi Li
Yi Wang
Yin Yao Shugart
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evidence for contribution of common genetic variants within chromosome 8p21.2-8p21.1 to restricted and repetitive behaviors in autism spectrum disorders

Author: A Smallwood
AP Boyle
B Goldman
B Howie
Benjamin Ackerman
BJ Crespi
BN Howie
BP Coe
BS Abrahams
C Bertocchi
C Betancur
C Cheng
C Gillberg
C Lord
C Lord
D Ma
D Spiker
David Saffen
DH Geschwind
DH Geschwind
DH Geschwind
DS Cannon
E Fombonne
ER Martin
G Bellot
G Tang
GD Fischbach
GH Dunteman
H Hagberg
HG Kim
HM Ozgen
Hui Gao
I Bieche
JC Lambert
JI Tracy
JK Lowe
JL Stone
JM Berg
JM Silverman
K Papanikolaou
K Wang
K Whalley
K Xia
KS Lam
LA Weiss
M Elsabbagh
M Lewis
M Zhu
MD Fallin
ML Cuccaro
N Lacy de
N Ludemann
N Ohkawa
O Delaneau
OJ Veatch
P Szatmari
P Szatmari
PA Holmans
Q Yang
R Anney
R Anney
R Tabares-Seisdedos
S Numata
S Ribich
SL Bishop
SL Bishop
SR Leekam
SS Jeste
T Koide
TE Galesloot
Team RDC
V Hus
VW Hu
VW Hu
W Huang da
W Huang da
Wei Guo
WP Mandy
X Wang
X Zhou
XQ Liu
Y Benjamini
Y Shao
Y Taniyama
Yin Yao Shugart
YS Kim
Yu Tao
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Average -value of 100 replicates for each SNP on chromosome 1 (A) and chromosome 3 (B)

Author: Ming-Hsi Wang (73728)
Mitchell Guo (73729)
Yin Y Shugart (28972)
Publication venue
Publication date
Field of study

Copyright information:Taken from "Application of family-based association testing to assess the genotype-phenotype association involved in complex traits using single-nucleotide polymorphisms"BMC Genetics 2005;6(Suppl 1):S68-S68.Published online 30 Dec 2005PMCID:PMC1866714.</p

The Francis Crick Institute

New insights into the genetic mechanism of IQ in Autism Spectrum Disorders

Author: Hai-De eQin
Harold Zhaokun Wang
Jack eSamuels
Wei eGuo
Yin Y Shugart
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2013
Field of study

Autism spectrum disorders (ASD) is a complex condition with a number of underlying subtypes with different symptoms and presumably different genetic causes. One important difference between these sub-phenotypes is cognition. Some forms of ASD such as Asperger’s keep cognitive development intact while the majority does not. In this study we explored genetic factors that might account for this difference. Using a case-control study based on IQ status in 1657 ASD probands, we analyzed both common and rare variants provided by the Autism Genome Project consortium via dbGAP, identified a set of genes, among them SYT1L and KIAA0319L, that are strongly associated with IQ within a population of ASD patients

Directory of Open Access Journals

PubMed Central

Frontiers - Publisher Connector

The impact of redefining affection status for alcoholism on affected- sib-pair analysis

Author: Chang C. J.
Collins Andrew
Fann Cathy S.J.
Lachman Herb
Shugart Yin Y.
Publication venue: 'Wiley'
Publication date: 01/01/1999
Field of study

The analysis of a complex disease such as alcohol dependence requires a more precise definition of affection status. Collaborative Study on the Genetics of Alcoholism (COGA) provided a variety of qualitative and quantitative measures as well as genotype information, in addition to two criteria of affection status. To identify two groups of phenotypically 'more homogeneous' individuals among alcoholics (COGA criterion), we redefined affection status by using cluster analysis and classification and regression tree, incorporating some important covariates such as event related potentials, monoamine oxidase B activity, status of smoking, age of onset, three variables of personality assessed with the Tridimensional Personality Questionnaire and three latent class variables. With redefined affection status, we repeated nonparametric analysis by three sib pair analysis programs (SIBPAL, SIBPAIR, and BETA) using nine candidate DNA markers identified by Reich et al. [1998] and Long et al. [1998]. The goals of our analysis are 1) to confirm previous results for these nine markers with redefined affection status and 2) to compare the performance from these three programs.</p

Southampton (e-Prints Soton)