90 research outputs found
Quantifying Ascertainment Bias and Determining Proxy Ancestral Alleles in Human Genome-Wide Polymorphic Data for Use in the Determination of Human Demographic History
Thesis advisor: Gabor T. MarthThesis advisor: Eric F. TsungMy work is part of an effort in Dr. Gabor Marth's population genetics lab to extend the work of Marth's 2004 Genetics paper "The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations" by applying its methods to new datasets. My contribution toward this end has been to create computer code (in Perl and Bash) to quantify ascertainment bias and determine proxy ancestral alleles in human genome-wide polymorphic data for post-doctoral fellow Dr. Eric Tsung's use in the determination of human demographic history. The final results of my efforts will be part of a poster by Dr. Tsung (with myself as a second author) displayed at the 2007 Biology of Genomes Symposium at Cold Spring Harbor Laboratory in Cold Spring Harbor, New York. Our goal is to turn that poster into a paper (on which I will be an author) for submission for publication in a major scientific research periodical and which will also be available in the future at http://bioinformatics.bc.edu/marthlab/ascertainmentancestral/.Thesis (BS) — Boston College, 2007.Submitted to: Boston College. College of Arts and Sciences.Discipline: Biology.Discipline: College Honors Program
Genetic contributions to obesity and related complex traits in the Cebu Longitudinal Health and Nutrition Survey
Obesity represents a major world health problem across socio-economic strata. Characterizing known candidate genes and identifying novel genes contributing to obesity and other related heritable risk factors may aid our understanding of its complex etiology and provide new treatment targets. Based in the Philippines, a country experiencing a rapid obesogenic transition, the Cebu Longitudinal Health and Nutrition Survey (CLHNS) is a useful population cohort in which to study the genetics of obesity. For more than 20 years since 1983, researchers have followed a cohort of Filipino mothers and their offspring from the metropolitan area of Cebu, collecting a wide variety of anthropometric, environmental, and dietary phenotypes. In 2005, blood samples were collected from all the participants, allowing the measurement of multiple plasma biomarkers, the extraction of genomic DNA, and the genetic analysis of complex traits. In this dissertation, I describe my genetic studies of three categories of obesity-related complex traits in the CLHNS: body size (body mass index (BMI), weight, and waist circumference), metabolic (adiponectin), and physiological (age of menarche). I first performed a genome-wide association (GWA) study of body size traits in Filipinos to determine the contribution of common genetic variants to those traits. I replicated three well-known BMI loci (BDNF, MC4R, and FTO) and further observed evidence of longitudinal changes in the effects of those genes. Next, I sought to establish the putative causal variant(s) underlying a haplotype at the ADIPOQ gene identified in a previous CLHNS GWA study to be strongly associated with lower circulating plasma adiponectin level. I identified a population-specific missense variant (R221S) that affected the original measurement of the phenotype and explained the observed GWA signal. Next, to identify novel biology underlying the etiology of central adiposity, I performed a meta-analysis of GWA studies of waist circumference (WC) in European individuals from the Genetic Investigation of Anthropometric Traits (GIANT) Consortium. Findings from this large-scale study may suggest other candidate loci for future genetic study in the CLHNS. I observed evidence of novel associations specific to WC and not other related anthropometric traits, such as BMI, height, and waist-hip ratio. These signals included the genes NLRP3, which is part of the obesity-related inflammasome complex, and IRS1, which is previously associated with body fat percentage and an adverse metabolic profile. Finally, to explore the relationships between several obesity genes and age of menarche, I performed replication and mediation analyses in female offspring from the CLHNS. I observed that the menarche signal at LIN28B previously reported in Europeans replicated in Filipinos. While this association appeared not to be mediated by childhood BMI, the results for several obesity genes were inconclusive. Together, these studies further our understanding of the genetic architecture of several obesity traits, and suggest many promising biological candidates for further genetic and molecular study.Doctor of Philosoph
A comprehensive SNP and indel imputability database
Motivation: Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms
Genome-Wide Association Study of Anthropometric Traits and Evidence of Interactions With Age and Study Year in Filipino Women
Increased values of multiple adiposity-related anthropometric traits are important risk factors for many common complex diseases. We performed a genome-wide association (GWA) study for four quantitative traits related to body size and adiposity (body mass index [BMI], weight, waist circumference, and height) in a cohort of 1,792 adult Filipino women from the Cebu Longitudinal Health and Nutrition Survey. This is the first GWA study of anthropometric traits in Filipinos, a population experiencing a rapid transition into a more obesogenic environment. In addition to identifying suggestive evidence of additional SNP association signals (P < 10−5), we replicated (P < 0.05, same direction of additive effect) associations previously reported in European populations of both BMI and weight with MC4R and FTO, of BMI with BDNF, and of height with EFEMP1, ZBTB38, and NPPC, but none with waist circumference. We also replicated loci reported in Japanese or Korean populations as associated with BMI (OTOL1) and height (HIST1H1PS2, C14orf145, GPC5). A difference in local linkage disequilibrium between European and Asian populations suggests a narrowed association region for BDNF, while still including a proposed functional non-synonymous amino acid substitution variant (rs6265, Val66Met). Finally, we observed significant evidence (P < 0.0042) for age-by-genotype interactions influencing BMI for rs17782313 (MC4R) and rs9939609 (FTO), and for a study year-by-genotype interaction for rs4923461 (BDNF). Our results show that several genetic risk factors are associated with anthropometric traits in Filipinos and provide further insight into the effects of BDNF, FTO, and MC4R on BMI
Genetic association with lipids in Filipinos: waist circumference modifies an APOA5 effect on triglyceride levels
Blood levels of lipoprotein cholesterol and triglycerides (TGs) are highly heritable and are major risk factors for cardiovascular disease (CVD). Approximately 100 lipid-associated loci have been identified in populations of European ancestry. We performed a genome-wide association study of lipid traits in 1,782 Filipino women from the Cebu Longitudinal Health and Nutrition Survey, and tested for evidence of interactions with waist circumference. We conducted additional association and interaction analyses in 1,719 of their young adult offspring. Genome-wide significant associations (P < 5 × 10−8) were detected at APOE for low density lipoprotein cholesterol and total cholesterol, and at APOA5 for TGs. Suggestive associations (P < 10−6) were detected at GCKR for TGs, and at CETP and TOM1 for high density lipoprotein cholesterol. Our data also supported the existence of allelic heterogeneity at APOA5, CETP, LIPC, and APOE. The secondary signal (Gly185Cys) at APOA5 exhibited a single nucleotide polymorphism (SNP)-by-waist circumference interaction affecting TGs (Pinteraction = 1.6 × 10−4), manifested by stronger SNP effects as waist circumference increased. These findings provide the first evidence that central obesity may accentuate the effect of the TG-increasing allele of the APOA5 signal, emphasizing that CVD risk could be reduced by central obesity control
Recommended from our members
Expression of SMARCD1 interacts with age in association with asthma control on inhaled corticosteroid therapy.
BackgroundGlobal gene expression levels are known to be highly dependent upon gross demographic features including age, yet identification of age-related genomic indicators has yet to be comprehensively undertaken in a disease and treatment-specific context.MethodsWe used gene expression data from CD4+ lymphocytes in the Asthma BioRepository for Integrative Genomic Exploration (Asthma BRIDGE), an open-access collection of subjects participating in genetic studies of asthma with available gene expression data. Replication population participants were Puerto Rico islanders recruited as part of the ongoing Genes environments & Admixture in Latino Americans (GALA II), who provided nasal brushings for transcript sequencing. The main outcome measure was chronic asthma control as derived by questionnaires. Genomic associations were performed using regression of chronic asthma control score on gene expression with age in years as a covariate, including a multiplicative interaction term for gene expression times age.ResultsThe SMARCD1 gene (SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily D member 1) interacted with age to influence chronic asthma control on inhaled corticosteroids, with a doubling of expression leading to an increase of 1.3 units of chronic asthma control per year (95% CI [0.86, 1.74], p = 6 × 10- 9), suggesting worsening asthma control with increasing age. This result replicated in GALA II (p = 3.8 × 10- 8). Cellular assays confirmed the role of SMARCD1 in glucocorticoid response in airway epithelial cells.ConclusionFocusing on age-dependent factors may help identify novel indicators of asthma medication response. Age appears to modulate the effect of SMARCD1 on asthma control with inhaled corticosteroids
Genome-wide association study for adiponectin levels in Filipino women identifies CDH13 and a novel uncommon haplotype at KNG1–ADIPOQ
Adiponectin is an adipocyte-secreted protein involved in a variety of metabolic processes, including glucose regulation and fatty acid catabolism. We conducted a genome-wide association study to investigate the genetic loci associated with plasma adiponectin in 1776 unrelated Filipino women from the Cebu Longitudinal Health and Nutrition Survey (CLHNS). Our strongest signal for adiponectin mapped to the gene CDH13 (rs3865188, P ≤ 7.2 × 10−16), which encodes a receptor for high-molecular-weight forms of adiponectin. Strong association was also detected near the ADIPOQ gene (rs864265, P = 3.8 × 10−9) and at a novel signal 100 kb upstream near KNG1 (rs11924390, P = 7.6 × 10−7). All three signals were also observed in 1774 young adult CLHNS offspring and in combined analysis including all 3550 mothers and offspring samples (all P ≤ 1.6 × 10−9). An uncommon haplotype of rs11924390 and rs864265 (haplotype frequency = 0.050) was strongly associated with lower adiponectin compared with the most common C–G haplotype in both CLHNS mothers (P = 1.8 × 10−25) and offspring (P = 8.7 × 10−32). Comprehensive imputation of 2653 SNPs in a 2 Mb region using as reference combined CHB, JPT and CEU haplotypes from the 1000 Genomes Project revealed no variants that perfectly tagged this haplotype. Our findings provide the first genome-wide significant evidence of association with plasma adiponectin at the CDH13 locus and identify a novel uncommon KNG1–ADIPOQ haplotype strongly associated with adiponectin levels in Filipinos
Across-cohort QC analyses of GWAS summary statistics from complex traits.
Genome-wide association studies (GWASs) have been successful in discovering SNP trait associations for many quantitative traits and common diseases. Typically, the effect sizes of SNP alleles are very small and this requires large genome-wide association meta-analyses (GWAMAs) to maximize statistical power. A trend towards ever-larger GWAMA is likely to continue, yet dealing with summary statistics from hundreds of cohorts increases logistical and quality control problems, including unknown sample overlap, and these can lead to both false positive and false negative findings. In this study, we propose four metrics and visualization tools for GWAMA, using summary statistics from cohort-level GWASs. We propose methods to examine the concordance between demographic information, and summary statistics and methods to investigate sample overlap. (I) We use the population genetics Fst statistic to verify the genetic origin of each cohort and their geographic location, and demonstrate using GWAMA data from the GIANT Consortium that geographic locations of cohorts can be recovered and outlier cohorts can be detected. (II) We conduct principal component analysis based on reported allele frequencies, and are able to recover the ancestral information for each cohort. (III) We propose a new statistic that uses the reported allelic effect sizes and their standard errors to identify significant sample overlap or heterogeneity between pairs of cohorts. (IV) To quantify unknown sample overlap across all pairs of cohorts, we propose a method that uses randomly generated genetic predictors that does not require the sharing of individual-level genotype data and does not breach individual privacy
Population-specific coding variant underlies genome-wide association with adiponectin level
Adiponectin is a protein hormone that can affect major metabolic processes including glucose regulation and fat metabolism. Our previous genome-wide association (GWA) study of circulating plasma adiponectin levels in Filipino women from the Cebu Longitudinal Health and Nutrition Survey (CLHNS) detected a 100 kb two-SNP haplotype at KNG1–ADIPOQ associated with reduced adiponectin (frequency = 0.050, P = 1.8 × 10−25). Subsequent genotyping of CLHNS young adult offspring detected an uncommon variant [minor allele frequency (MAF) = 0.025] located ∼800 kb from ADIPOQ that showed strong association with lower adiponectin levels (P = 2.7 × 10−15, n = 1695) and tagged a subset of KNG1–ADIPOQ haplotype carriers with even lower adiponectin levels. Sequencing of the ADIPOQ-coding region detected variant R221S (MAF = 0.015, P = 2.9 × 10−69), which explained 17.1% of the variance in adiponectin levels and largely accounted for the initial GWA signal in Filipinos. R221S was not present in 12 514 Europeans with previously sequenced exons. To explore the mechanism of this substitution, we re-measured adiponectin level in 20 R221S offspring carriers and 20 non-carriers using two alternative antibodies and determined that the presence of R221S resulted in artificially low quantification of adiponectin level using the original immunoassay. These data provide an example of an uncommon variant responsible for a GWA signal and demonstrate that genetic associations with phenotypes measured by antibody-based quantification methods can be affected by uncommon coding SNPs residing in the antibody target region
Recommended from our members
The metabolomics of asthma control: a promising link between genetics and disease
Short-acting β agonists (e.g., albuterol) are the most commonly used medications for asthma, a disease that affects over 300 million people in the world. Metabolomic profiling of asthmatics taking β agonists presents a new and promising resource for identifying the molecular determinants of asthma control. The objective is to identify novel genetic and biochemical predictors of asthma control using an integrative “omics” approach. We generated lipidomic data by liquid chromatography tandem mass spectrometry (LC-MS), using plasma samples from 20 individuals with asthma. The outcome of interest was a binary indicator of asthma control defined by the use of albuterol inhalers in the preceding week. We integrated metabolomic data with genome-wide genotype, gene expression, and methylation data of this cohort to identify genomic and molecular indicators of asthma control. A Conditional Gaussian Bayesian Network (CGBN) was generated using the strongest predictors from each of these analyses. Integrative and metabolic pathway over-representation analyses (ORA) identified enrichment of known biological pathways within the strongest molecular determinants. Of the 64 metabolites measured, 32 had known identities. The CGBN model based on four SNPs (rs9522789, rs7147228, rs2701423, rs759582) and two metabolites—monoHETE_0863 and sphingosine-1-phosphate (S1P) could predict asthma control with an AUC of 95%. Integrative ORA identified 17 significantly enriched pathways related to cellular immune response, interferon signaling, and cytokine-related signaling, for which arachidonic acid, PGE2 and S1P, in addition to six genes (CHN1, PRKCE, GNA12, OASL, OAS1, and IFIT3) appeared to drive the pathway results. Of these predictors, S1P, GNA12, and PRKCE were enriched in the results from integrative and metabolic ORAs. Through an integrative analysis of metabolomic, genomic, and methylation data from a small cohort of asthmatics, we implicate altered metabolic pathways, related to sphingolipid metabolism, in asthma control. These results provide insight into the pathophysiology of asthma control
- …