417 research outputs found

    Pedigree and genotype errors in the Framingham Heart Study

    Get PDF
    The pedigree and genotype data from the Framingham Heart Study were examined for errors. Errors in 21 of 329 pedigrees were detected with the program PREST, and of these the errors in 16 pedigrees were resolved. Genotyping errors were then detected with SIMWALK2. Five Mendelian errors were found following the pedigree corrections. Double-recombinant errors were more common, with 142 being detected at mistyping probabilities of 0.25 or greater

    Constrained multivariate association with longitudinal phenotypes

    Get PDF
    The incorporation of longitudinal data into genetic epidemiological studies has the potential to provide valuable information regarding the effect of time on complex disease etiology. Yet, the majority of research focuses on variables collected from a single time point. This aim of this study was to test for main effects on a quantitative trait across time points using a constrained maximum-likelihood measured genotype approach. This method simultaneously accounts for all repeat measurements of a phenotype in families. We applied this method to systolic blood pressure (SBP) measurements from three time points using the Genetic Analysis Workshop 19 (GAW19) whole-genome sequence family simulated data set and 200 simulated replicates. Data consisted of 849 individuals from 20 extended Mexican American pedigrees. Comparisons were made among 3 statistical approaches: (a) constrained, where the effect of a variant or gene region on the mean trait value was constrained to be equal across all measurements; (b) unconstrained, where the variant or gene region effect was estimated separately for each time point; and (c) the average SBP measurement from three time points. These approaches were run for nine genetic variants with known effect sizes (\u3e0.001) for SBP variability and a known gene-centric kernel (MAP4)-based test under the GAW19 simulation model across 200 replicates

    Smoothing of the bivariate LOD score for non-normal quantitative traits

    Get PDF
    Variance component analysis provides an efficient method for performing linkage analysis for quantitative traits. However, type I error of variance components-based likelihood ratio testing may be affected when phenotypic data are non-normally distributed (especially with high values of kurtosis). This results in inflated LOD scores when the normality assumption does not hold. Even though different solutions have been proposed to deal with this problem with univariate phenotypes, little work has been done in the multivariate case. We present an empirical approach to adjust the inflated LOD scores obtained from a bivariate phenotype that violates the assumption of normality. Using the Collaborative Study on the Genetics of Alcoholism data available for the Genetic Analysis Workshop 14, we show how bivariate linkage analysis with leptokurtotic traits gives an inflated type I error. We perform a novel correction that achieves acceptable levels of type I error

    Genetic signal maximization using environmental regression

    Get PDF
    Joint analyses of correlated phenotypes in genetic epidemiology studies are common. However, these analyses primarily focus on genetic correlation between traits and do not take into account environmental correlation. We describe a method that optimizes the genetic signal by accounting for stochastic environmental noise through joint analysis of a discrete trait and a correlated quantitative marker. We conducted bivariate analyses where heritability and the environmental correlation between the discrete and quantitative traits were calculated using Genetic Analysis Workshop 17 (GAW17) family data. The resulting inverse value of the environmental correlation between these traits was then used to determine a new β coefficient for each quantitative trait and was constrained in a univariate model. We conducted genetic association tests on 7,087 nonsynonymous SNPs in three GAW17 family replicates for Affected status with the β coefficient fixed for three quantitative phenotypes and compared these to an association model where the β coefficient was allowed to vary. Bivariate environmental correlations were 0.64 (± 0.09) for Q1, 0.798 (± 0.076) for Q2, and −0.169 (± 0.18) for Q4. Heritability of Affected status improved in each univariate model where a constrained β coefficient was used to account for stochastic environmental effects. No genome-wide significant associations were identified for either method but we demonstrated that constraining β for covariates slightly improved the genetic signal for Affected status. This environmental regression approach allows for increased heritability when the β coefficient for a highly correlated quantitative covariate is constrained and increases the genetic signal for the discrete trait

    Strategy and model building in the fourth dimension: a null model for genotype × age interaction as a Gaussian stationary stochastic process

    Get PDF
    BACKGROUND: Using univariate and multivariate variance components linkage analysis methods, we studied possible genotype × age interaction in cardiovascular phenotypes related to the aging process from the Framingham Heart Study. RESULTS: We found evidence for genotype × age interaction for fasting glucose and systolic blood pressure. CONCLUSIONS: There is polygenic genotype × age interaction for fasting glucose and systolic blood pressure and quantitative trait locus × age interaction for a linkage signal for systolic blood pressure phenotypes located on chromosome 17 at 67 cM

    Linkage disequilibrium across two different single-nucleotide polymorphism genome scans

    Get PDF
    Linkage disequilibrium (LD) content was calculated for the Genetic Analysis Workshop 14 Affymetrix and Illumina single-nucleotide polymorphism (SNP) genome scans of the Collaborative Study on the Genetics of Alcoholism samples. Pair-wise LD was measured as both D' and r(2 )on 505 pedigree founder individuals. The r(2 )estimates were then used to correct the multipoint identity by descent matrix (MIBD) calculation to account for LD and LOD scores on chromosomes 3 and 18 were calculated for COGA's ttdt3 electrophysiological trait using those MIBDs. Extensive LD was observed throughout both marker sets, and it was higher in Affymetrix's more dense SNP map. However, SNP density did not solely account for Affymetrix's higher LD. MIBD estimation procedures assume linkage equilibrium to construct genotypes of non-genotyped pedigree founder individuals, and dense SNP genotyping maps are likely to contain moderate to high LD between markers. LOD score plots calculated after correction for LD followed the same general pattern as uncorrected ones. Since in our study almost half of the pedigree founders were genotyped, it is possible that LD had a minor impact on the LOD scores. Caution should probably be taken when using high density SNP maps when many non-genotyped founders are present in the study pedigrees

    Data for GAW20: Genome-Wide DNA Sequence Variation and Epigenome-Wide DNA Methylation Before and After Fenofibrate Treatment in a Family Study of Metabolic Phenotypes

    Get PDF
    GAW20 provided participants with an opportunity to comprehensively examine genetic and epigenetic variation among related individuals in the context of drug treatment response. GAW20 used data from 188 families (N = 1105) participating in the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN) study (clinicaltrials.gov identifier NCT00083369), which included CD4+ T-cell DNA methylation at 463,995 cytosine-phosphate-guanine (CpG) sites measured before and after a 3-week treatment with fenofibrate, single-nucleotide variation at 906,600 loci, metabolic syndrome components ascertained before and after the drug intervention, and relevant covariates. All GOLDN participants were of European descent, with an average age of 48 years. In addition, approximately half were women and approximately 40% met the diagnostic criteria for metabolic syndrome. Unique advantages of the GAW20data set included longitudinal (3 weeks apart) measurements of DNA methylation, the opportunity to explore the contributions of both genotype and DNA methylation to the interindividual variability in drug treatment response, and the familial relationships between study participants. The principal disadvantage of GAW20/GOLDN data was the spurious correlation between batch effects and fenofibrate effects on methylation, which arose because the pre- and posttreatment methylation data were generated and normalized separately, and any attempts to remove time-dependent technical artifacts would also remove biologically meaningful changes brought on by fenofibrate. Despite this limitation, the GAW20 data set offered informative, multilayered omics data collected in a large population-based study of common disease traits, which resulted in creative approaches to integration and analysis of inherited human variation

    Do rare variant genotypes predict common variant genotypes?

    Get PDF
    The synthetic association hypothesis proposes that common genetic variants detectable in genome-wide association studies may reflect the net phenotypic effect of multiple rare polymorphisms distributed broadly within the focal gene rather than, as often assumed, the effect of common functional variants in high linkage disequilibrium with the focal marker. In a recent study, Dickson and colleagues demonstrated synthetic association in simulations and in two well-characterized, highly polymorphic human disease genes. The converse of this hypothesis is that rare variant genotypes must be correlated with common variant genotypes often enough to make the phenomenon of synthetic association possible. Here we used the exome genotype data provided for Genetic Analysis Workshop 17 to ask how often, how well, and under what conditions rare variant genotypes predict the genotypes of common variants within the same gene. We found nominal evidence of correlation between rare and common variants in 21-30% of cases examined for unrelated individuals; this rate increased to 38-44% for related individuals, underscoring the segregation that underlies synthetic association

    Effect of genotype × alcoholism interaction on linkage analysis of an alcoholism-related quantitative phenotype

    Get PDF
    Studies have shown that genetic and environmental factors and their interactions affect several alcoholism phenotypes. Genotype × alcoholism (G×A) interaction refers to the environmental (alcoholic and non-alcoholic) influences on the autosomal genes contributing to variation in an alcoholism-related quantitative phenotype. The purpose of this study was to examine the effects of G×A interaction on the detection of linkage for alcoholism-related phenotypes. We used phenotypic and genotypic data from the Collaborative Study on the Genetics of Alcoholism relating to 1,388 subjects as part of Genetic Analysis Workshop 14 problem 1. We analyzed the MXDRNK phenotype to detect G×A interaction using SOLAR. Upon detecting significant interaction, we conducted variance-component linkage analyses using microsatellite marker data. For maximum number of drinks per a 24 hour period, the highest LODs were observed on chromosomes 1, 4, and 13 without G×A interaction. Interaction analysis yielded four regions on chromosomes 1, 4, 13, and 15. On chromosome 4, a maximum LOD of 1.5 at the same location as the initial analysis was obtained after incorporating G×A interaction effects. However, after correcting for extra parameters, the LOD score was reduced to a corrected LOD of 1.1, which is similar to the LOD observed in the non-interaction analysis. Thus, we see little differences in LOD scores, while some linkage regions showed large differences in the magnitudes of estimated quantitative trait loci heritabilities between the alcoholic and non-alcoholic groups. These potential hints of differences in genetic effect may influence future analyses of variants under these linkage peaks
    • …
    corecore