32 research outputs found

    Interrogating population structure and its impact on association tests

    Get PDF
    We found from our analysis of the Genetic Analysis Workshop 17 data that the population structure of the 697 unrelated individuals was an important confounding factor for association studies, even if it was not explicitly considered when simulating the phenotypes. We uncovered structures beyond the reported ethnicities and found ample evidence of phenotype–population structure associations. The first 10 principal components of the genotype data of the 697 individuals demonstrated much stronger associations with Q1, Q2, and the disease than did the individuals’ ethnicities. In addition, we observed that population structure was a confounding factor for the Q1-gene association when identifying the significant genes both with and without adjusting for the causal single-nucleotide polymorphisms, the ethnicities, and the principal components. Many false discoveries remained after adjusting for the causal single-nucleotide polymorphisms. Adjusting for the principal components appeared more effective than did adjusting for ethnicity in terms of preventing false discoveries. This analysis was performed with knowledge of the causal loci

    Estimating heritability using family and unrelated individuals data

    Get PDF
    For the family data from Genetic Analysis Workshop 17, we obtained heritability estimates of quantitative traits Q1 and Q4 using the ASSOC program in the S.A.G.E. software package. ASSOC is a family-based method that estimates heritability through the estimation of variance components. The covariate-adjusted mean heritability was 0.650 for Q1 and 0.745 for Q4. For the unrelated individuals data, we estimated the heritability of Q1 as the proportion of total variance that can be accounted for by all single-nucleotide polymorphisms under an additive model. We examined a novel ordinary least-squares method, a naïve restricted maximum-likelihood method, and a calibrated restricted maximum-likelihood method. We applied the different methods to all 200 replicates for Q1. We observed that the ordinary least-squares method yielded many estimates outside the interval [0, 1]. The restricted maximum-likelihood estimates were more stable than the ordinary least-squares estimates. The naïve restricted maximum-likelihood method yielded an average estimate of 0.462 ± 0.1, and the calibrated restricted maximum-likelihood method yielded an average of 0.535 ± 0.121. Our results demonstrate discrepancies in heritability estimates using the family data and the unrelated individuals data

    Genetic Background of Patients from a University Medical Center in Manhattan: Implications for Personalized Medicine

    Get PDF
    Background: The rapid progress currently being made in genomic science has created interest in potential clinical applications; however, formal translational research has been limited thus far. Studies of population genetics have demonstrated substantial variation in allele frequencies and haplotype structure at loci of medical relevance and the genetic background of patient cohorts may often be complex. Methods and Findings: To describe the heterogeneity in an unselected clinical sample we used the Affymetrix 6.0 gene array chip to genotype self-identified European Americans (N = 326), African Americans (N = 324) and Hispanics (N = 327) from the medical practice of Mount Sinai Medical Center in Manhattan, NY. Additional data from US minority groups and Brazil were used for external comparison. Substantial variation in ancestral origin was observed for both African Americans and Hispanics; data from the latter group overlapped with both Mexican Americans and Brazilians in the external data sets. A pooled analysis of the African Americans and Hispanics from NY demonstrated a broad continuum of ancestral origin making classification by race/ethnicity uninformative. Selected loci harboring variants associated with medical traits and drug response confirmed substantial within-and between-group heterogeneity. Conclusion: As a consequence of these complementary levels of heterogeneity group labels offered no guidance at the individual level. These findings demonstrate the complexity involved in clinical translation of the results from genome-wide association studies and suggest that in the genomic era conventional racial/ethnic labels are of little value.National Heart Lung and Blood Institute (NHLBI/NIH)[RO1 HL53353]Andrea and Charles Bronfman Philantropie

    Testing gene-environment interactions in gene-based association studies

    Get PDF
    Gene-based and single-nucleotide polymorphism (SNP) set association studies provide an important complement to SNP analysis. Kernel-based nonparametric regression has recently emerged as a powerful and flexible tool for this purpose. Our goal is to explore whether this approach can be extended to incorporate and test for interaction effects, especially for genes containing rare variant SNPs. Here, we construct nonparametric regression models that can be used to include a gene-environment interaction effect under the framework of the least-squares kernel machine and examine the performance of the proposed method on the Genetic Analysis Workshop 17 unrelated individuals data set. Two hundred simulated replicates were used to explore the power for detecting interaction. We demonstrate through a genome scan of the quantitative phenotype Q1 that the simulated gene-environment interaction effect in the data can be detected with reasonable power by using the least-squares kernel machine method

    On the sensitivity of the one-sided t test to covariance misspecification

    No full text
    Sensitivity analysis stands in contrast to diagnostic testing in that sensitivity analysis aims to answer the question of whether it matters that a nuisance parameter is non-zero, whereas a diagnostic test ascertains explicitly if the nuisance parameter is different from zero. In this paper, we introduce and derive the finite sample properties of a sensitivity statistic measuring the sensitivity of the t statistic to covariance misspecification. Unlike the earlier work by Banerjee and Magnus [A. Banerjee, J.R. Magnus, On the sensitivity of the usual t- and F-tests to covariance misspecification, Journal of Econometrics 95 (2000) 157-176] on the sensitivity of the F statistic, the theorems derived in the current paper hold under both the null and alternative hypotheses. Also, in contrast to Banerjee and Magnus' [see the above cited reference] results on the F test, we find that the decision to accept the null using the OLS based one-sided t test is not necessarily robust against covariance misspecification and depends much on the underlying data matrix. Our results also indicate that autocorrelation does not necessarily weaken the power of the OLS based t test.AR(1) Linear regression MA(1) Power Rule of thumb Sensitivity Size

    On the sensitivity of the one-sided t test to covariance misspecification

    Get PDF
    AbstractSensitivity analysis stands in contrast to diagnostic testing in that sensitivity analysis aims to answer the question of whether it matters that a nuisance parameter is non-zero, whereas a diagnostic test ascertains explicitly if the nuisance parameter is different from zero. In this paper, we introduce and derive the finite sample properties of a sensitivity statistic measuring the sensitivity of the t statistic to covariance misspecification. Unlike the earlier work by Banerjee and Magnus [A. Banerjee, J.R. Magnus, On the sensitivity of the usual t- and F-tests to covariance misspecification, Journal of Econometrics 95 (2000) 157–176] on the sensitivity of the F statistic, the theorems derived in the current paper hold under both the null and alternative hypotheses. Also, in contrast to Banerjee and Magnus’ [see the above cited reference] results on the F test, we find that the decision to accept the null using the OLS based one-sided t test is not necessarily robust against covariance misspecification and depends much on the underlying data matrix. Our results also indicate that autocorrelation does not necessarily weaken the power of the OLS based t test
    corecore