868 research outputs found
Host genetics and population structure effects on parasitic disease
Host genetic factors exert significant influences on differential susceptibility to many infectious diseases. In addition, population structure of both host and parasite may influence disease distribution patterns. In this study, we assess the effects of population structure on infectious disease in two populations in which host genetic factors influencing susceptibility to parasitic disease have been extensively studied. The first population is the Jirel population of eastern Nepal that has been the subject of research on the determinants of differential susceptibility to soil-transmitted helminth infections. The second group is a Brazilian population residing in an area endemic for Trypanosoma cruzi infection that has been assessed for genetic influences on differential disease progression in Chagas disease. For measures of Ascaris worm burden, within-population host genetic effects are generally more important than host population structure factors in determining patterns of infectious disease. No significant influences of population structure on measures associated with progression of cardiac disease in individuals who were seropositive for T. cruzi infection were found
Estimating and testing direct genetic effects in directed acyclic graphs using estimating equations
In genetic association studies, it is important to distinguish direct and indirect genetic effects in order to build truly functional models. For this purpose, we consider a directed acyclic graph setting with genetic variants, primary and intermediate phenotypes, and confounding factors. In order to make valid statistical inference on direct genetic effects on the primary phenotype, it is necessary to consider all potential effects in the graph, and we propose to use the estimating equations method with robust Huber-White sandwich standard errors. We evaluate the proposed causal inference based on estimating equations (CIEE) method and compare it with traditional multiple regression methods, the structural equation modeling method, and sequential G-estimation methods through a simulation study for the analysis of (completely observed) quantitative traits and time-to-event traits subject to censoring as primary phenotypes. The results show that CIEE provides valid estimators and inference by successfully removing the effect of intermediate phenotypes from the primary phenotype and is robust against measured and unmeasured confounding of the indirect effect through observed factors. All other methods except the sequential G-estimation method for quantitative traits fail in some scenarios where their test statistics yield inflated type I errors. In the analysis of the Genetic Analysis Workshop 19 dataset, we estimate and test genetic effects on blood pressure accounting for intermediate gene expression phenotypes. The results show that CIEE can identify genetic variants that would be missed by traditional regression analyses. CIEE is computationally fast, widely applicable to different fields, and available as an R package
Genetic Variation at the FTO Locus Influences RBL2 Gene Expression
OBJECTIVE - Genome-wide association studies that compare the statistical association between thousands of DNA variations and a human trait have detected 958 loci across 127 different diseases and traits. However, these statistical associations only provide evidence for genomic regions likely to harbor a causal gene(s) and do not directly identify such genes. We combined gene variation and expression data in a human cohort to identify causal genes. RESEARCH DESIGN AND METHODS - Global gene transcription activity was obtained for each individual in a large human cohort (n = 1,240). These quantitative transcript data were tested for correlation with genotype data generated from the same individuals to identify gene expression patterns influenced by the variants. RESULTS - Variant rs8050136 lies within intron 1 of the FTO gene on chromosome 16 and marks a locus strongly associated with type 2 diabetes and obesity and widely replicated across many populations. We report that genetic variation at this locus does not influence FTO gene expression levels (P = 0.38), but is strongly correlated with expression of RBL2 (P = 2.7 × 10-5), ~270,000 base pairs distant to FTO. CONCLUSIONS - These data suggest that variants at FTO influence RBL2 gene expression at large genetic distances. This observation underscores the complexity of human transcriptional regulation and highlights the utility of large human cohorts in which both genetic variation and global gene expression data are available to identify disease genes. Expedient identification of genes mediating the effects of genome-wide association study - identified loci will enable mechanism-of-action studies and accelerate understanding of human disease processes under genetic influence. © 2010 by the American Diabetes Association
Finding potential cis-regulatory loci using allele-specific chromatin accessibility as weights in a kernel-based variance component test
We present a novel approach to detect potential cis-acting regulatory loci that combines the functional potential, an empirical DNase-seq based estimate of the allele-specificity of DNase-I hypersensitivity sites, with kernel-based variance component association analyses against expression phenotypes. To test our method we used public ENCODE whole genome DNase-I sequencing data, from a single sample, to estimate the functional potentials of the subset of 10,552 noncoding heterozygous single-nucleotide polymorphisms (SNPs) that were also present in the Genetic Analysis Workshop 19 (GAW19) family-based data set. We then built two covariance kernels, one nonweighted and one weighted by the functional potentials, and conducted kernel-based variance component association analyses against the 20,527 transcript expression phenotypes in the GAW19 family-based data set. We found signals of potential cis-regulatory effects, that surpassed the Bonferroni significance threshold, for ten transcripts. Stepwise removal of the cis-located SNPs from the weighted kernel lead to the disappearance of the association signal from our top transcript hit. We found compelling evidence of allele-specific cis-regulation for four transcripts using both kernels, and our results agree with previous research that suggests the involvement of specific cis-located variants in the regulation of their neighboring gene
Phenotypic, genetic, and genome-wide structure in the metabolic syndrome
BACKGROUND: Insulin resistance, obesity, dyslipidemia, and high blood pressure characterize the metabolic syndrome. In an effort to explore the utility of different multivariate methods of data reduction to better understand the genetic influences on the aggregation of metabolic syndrome phenotypes, we calculated phenotypic, genetic, and genome-wide LOD score correlation matrices using five traits (total cholesterol, high density lipoprotein cholesterol, triglycerides, systolic blood pressure, and body mass index) from the Framingham Heart Study data set prepared for the Genetic Analysis Workshop 13, clinic visits 10 and 1 for the original and offspring cohorts, respectively. We next applied factor analysis to summarize the relationship between these phenotypes. RESULTS: Factors generated from the genetic correlation matrix explained the most variation. Factors extracted using the other matrices followed a different pattern and suggest distinct effects. CONCLUSIONS: Given these results, different methods of multivariate data reduction may provide unique clues on the clustering of this complex syndrome
Risk Factors for Soil-Transmitted Helminth Infections during the First 3 Years of Life in the Tropics; Findings from a Birth Cohort.
Background: Soil-transmitted helminths (STH) infect more than 2 billion humans worldwide, causing significant morbidity in children. There are few data on the epidemiology and risk factors for infection in pre-school children. To investigate risk factors for infection in early childhood, we analysed data prospectively collected in the ECUAVIDA birth cohort in Ecuador.
Methods and Findings:
Children were recruited at birth and followed up to 3 years of age with periodic collection of stool samples that were examined microscopically for STH parasites. Data on social, demographic, and environmental risk factors were collected from the mother at time of enrolment. Associations between exposures and detection of STH infections were analysed by multivariable logistic regression. Data were analysed from 1,697 children for whom a stool sample was obtained at 3 years. 42.3% had at least one STH infection in the first 3 years of life and the most common infections were caused by A. lumbricoides (33.2% of children) and T. trichiura (21.2%). Hookworm infection was detected in 0.9% of children. Risk of STH infection was associated with factors indicative of poverty in our study population such as Afro-Ecuadorian ethnicity and low maternal educational level. Maternal STH infections during pregnancy were strong risk factors for any childhood STH infection, infections with either A. lumbricoides or T. trichiura, and early age of first STH infection. Children of mothers with moderate to high infections intensities with A. lumbricoides were most at risk.
Conclusions:
Our data show high rates of infection with STH parasites during the first 3 years of life in an Ecuadorian birth cohort, an observation that was strongly associated with maternal STH infections during pregnancy. The targeted treatment of women of childbearing age, in particular before pregnancy, with anthelmintic drugs could offer a novel approach to the prevention of STH infections in pre-school children
Genetic signal maximization using environmental regression
Joint analyses of correlated phenotypes in genetic epidemiology studies are common. However, these analyses primarily focus on genetic correlation between traits and do not take into account environmental correlation. We describe a method that optimizes the genetic signal by accounting for stochastic environmental noise through joint analysis of a discrete trait and a correlated quantitative marker. We conducted bivariate analyses where heritability and the environmental correlation between the discrete and quantitative traits were calculated using Genetic Analysis Workshop 17 (GAW17) family data. The resulting inverse value of the environmental correlation between these traits was then used to determine a new β coefficient for each quantitative trait and was constrained in a univariate model. We conducted genetic association tests on 7,087 nonsynonymous SNPs in three GAW17 family replicates for Affected status with the β coefficient fixed for three quantitative phenotypes and compared these to an association model where the β coefficient was allowed to vary. Bivariate environmental correlations were 0.64 (± 0.09) for Q1, 0.798 (± 0.076) for Q2, and −0.169 (± 0.18) for Q4. Heritability of Affected status improved in each univariate model where a constrained β coefficient was used to account for stochastic environmental effects. No genome-wide significant associations were identified for either method but we demonstrated that constraining β for covariates slightly improved the genetic signal for Affected status. This environmental regression approach allows for increased heritability when the β coefficient for a highly correlated quantitative covariate is constrained and increases the genetic signal for the discrete trait
Genome-wide linkage scan for loci influencing plasma triglyceride levels
We conducted a genome-wide linkage scan to detect loci that influence the levels of fasting triglycerides in plasma. Fasting triglyceride levels were available at 4 time points (visits), 2 pre- and 2 post-fenofibrate intervention. Multipoint identity-by-descent (MIBD) matrices were derived from genotypes using IBDLD. Variance-component linkage analyses were then conducted using SOLAR (Sequential Oligogenic Linkage Analysis Routines). We found evidence of linkage (logarithm of odds [LOD] ≥3) at 5 chromosomal regions with triglyceride levels in plasma. The highest LOD scores were observed for linkage to the estimated genetic value (additive genetic component) of the log-normalized triglyceride levels in plasma. Our results suggest that a chromosome 10 locus at 37 cM (LODpre = 3.01, LODpost = 3.72) influences fasting triglyceride levels in plasma regardless of the fenofibrate intervention, and that loci in chromosomes 1 at 170 cM and 4 at 24 cM ceases to affect the triglyceride levels when fenofibrate is present, while the regions in chromosomes 6 at 136 to 162 cM and 11 at 39 to 40 cM appear to influence triglyceride levels in response to fenofibrate
- …