Search CORE

308 research outputs found

Principal-component-based population structure adjustment in the North American Rheumatoid Arthritis Consortium data: impact of single-nucleotide polymorphism set and analysis method

Author: Lunetta Kathryn L
Peloso Gina M
Timofeev Nadia
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Population structure occurs when a sample is composed of individuals with different ancestries and can result in excess type I error in genome-wide association studies. Genome-wide principal-component analysis (PCA) has become a popular method for identifying and adjusting for subtle population structure in association studies. Using the Genetic Analysis Workshop 16 (GAW16) NARAC data, we explore two unresolved issues concerning the use of genome-wide PCA to account for population structure in genetic associations studies: the choice of single-nucleotide polymorphism (SNP) subset and the choice of adjustment model. We computed PCs for subsets of genome-wide SNPs with varying levels of LD. The first two PCs were similar for all subsets and the first three PCs were associated with case status for all subsets. When the PCs associated with case status were included as covariates in an association model, the reduction in genomic inflation factor was similar for all SNP sets. Several models have been proposed to account for structure using PCs, but it is not yet clear whether the different methods will result in substantively different results for association studies with individuals of European descent. We compared genome-wide association p-values and results for two positive-control SNPs previously associated with rheumatoid arthritis using four PC adjustment methods as well as no adjustment and genomic control. We found that in this sample, adjusting for the continuous PCs or adjusting for discrete clusters identified using the PCs adequately accounts for the case-control population structure, but that a recently proposed randomization test performs poorly

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

PubMed Central

Deep-coverage whole genome sequences and blood lipids among 16,324 individuals

Author: Blangero John
Chaffin Mark
Curran Joanne E.
Ganna Andrea
Khera Amit V.
Mahaney Michael C.
Montasser May
Natarajan Pradeep
Peloso Gina M.
Zekavat Seyedeh M.
Publication venue: ScholarWorks @ UTRGV
Publication date: 01/08/2018
Field of study

Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth \u3e29X and analyze genotypes with four quantitative traits—plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia

Scholarworks@UTRGV Univ. of Texas RioGrande Valley

A three-stage approach for genome-wide association studies with family data for quantitative traits

Author: Atwood Larry D
Chen Ming-Huei
Fox Caroline S
Guo Chao-Yu
Hsu Yi-Hsiang
Larson Martin G
Peloso Gina M
Yang Qiong
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Background: Genome-wide association (GWA) studies that use population-based association approaches may identify spurious associations in the presence of population admixture. In this paper, we propose a novel three-stage approach that is computationally efficient and robust to population admixture and more powerful than the family-based association test (FBAT) for GWA studies with family data. We propose a three-stage approach for GWA studies with family data. The first stage is to perform linear regression ignoring phenotypic correlations among family members. SNPs with a first stage p-value below a liberal cut-off (e.g. 0.1) are then analyzed in the second stage that employs a linear mixed effects (LME) model that accounts for within family correlations. Next, SNPs that reach genome-wide significance (e.g.

10^{-6}

for 34,625 genotyped SNPs in this paper) are analyzed in the third stage using FBAT, with correction of multiple testing only for SNPs that enter the third stage. Simulations are performed to evaluate type I error and power of the proposed method compared to LME adjusting for 10 principal components (PC) of the genotype data. We also apply the three-stage approach to the GWA analyses of uric acid in Framingham Heart Study's SNP Health Association Resource (SHARe) project. Results: Our simulations show that whether or not population admixture is present, the three-stage approach has no inflated type I error. In terms of power, using LME adjusting PC is only slightly more powerful than the three-stage approach. When applied to the GWA analyses of uric acid in the SHARe project of FHS, the three-stage approach successfully identified and confirmed three SNPs previously reported as genome-wide significant signals. Conclusions: For GWA analyses of quantitative traits with family data, our three-stage approach provides another appealing solution to population admixture, in addition to LME adjusting for genetic PC

Crossref

Boston University Institutional Repository (OpenBU)

Harvard University - DASH

Springer - Publisher Connector

PubMed Central

Recommended from our members

Recurring exon deletions in the haptoglobin (HP) gene associate with lower blood cholesterol levels

Author: Boettger Linda M.
Handsaker Robert E.
Hirschhorn Joel
Kathiresan Sekar
McCarroll Steven A.
Peloso Gina
Salem Rany M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 11/10/2016
Field of study

Two exons of the human haptoglobin (HP) gene exhibit copy number variation that affects HP multimerization and underlies one of the first protein polymorphisms identified in humans. The evolutionary origins and medical significance of this polymorphism have been uncertain. Here we show that this variation has likely arisen from the recurring reversion of an ancient hominin-specific duplication of these exons. Though this polymorphism has been largely invisible to genome-wide genetic studies to date, we describe a way to analyze it by imputation from SNP haplotypes and find among 22,288 individuals that these HP exonic deletions associate with reduced LDL and total cholesterol levels. We show that these deletions, and a SNP that affects HP expression, are the likely drivers of the strong but complex association of cholesterol levels to SNPs near HP. Recurring exonic deletions in the haptoglobin gene likely enhance human health by lowering cholesterol levels in the blood

Harvard University - DASH

Implicating genes, pleiotropy, and sexual dimorphism at blood lipid loci through multi-ancestry meta-analysis

Author: Brumpton Ben M
et al
Graham Sarah E
Haworth Simon
Kanoni Stavroula
Mitchell Ruth E
Peloso Gina M
Rasheed Humaira
Surakka Ida
Timpson Nicholas J
Wang Yuxuan
Willer Cristen J
Wong Andrew
Zeggini Eleftheria
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/12/2022
Field of study

Explore Bristol Research

A genome-wide association study for blood lipid phenotypes in the Framingham Heart Study

Author: Arnett Donna K
Burtt Nöel P
Cupples L Adrienne
D'Agostino Ralph B
Demissie Serkalem
Gianniny Lauren
Guiducci Candace
Kathiresan Sekar
Manning Alisa K
Melander Olle
Ordovas Jose M
Orho-Melander Marju
Peloso Gina M
Surti Aarti
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background Blood lipid levels including low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol (HDL-C), and triglycerides (TG) are highly heritable. Genome-wide association is a promising approach to map genetic loci related to these heritable phenotypes. Methods In 1087 Framingham Heart Study Offspring cohort participants (mean age 47 years, 52% women), we conducted genome-wide analyses (Affymetrix 100K GeneChip) for fasting blood lipid traits. Total cholesterol, HDL-C, and TG were measured by standard enzymatic methods and LDL-C was calculated using the Friedewald formula. The long-term averages of up to seven measurements of LDL-C, HDL-C, and TG over a ~30 year span were the primary phenotypes. We used generalized estimating equations (GEE), family-based association tests (FBAT) and variance components linkage to investigate the relationships between SNPs (on autosomes, with minor allele frequency ≥10%, genotypic call rate ≥80%, and Hardy-Weinberg equilibrium p ≥ 0.001) and multivariable-adjusted residuals. We pursued a three-stage replication strategy of the GEE association results with 287 SNPs (P < 0.001 in Stage I) tested in Stage II (n ~1450 individuals) and 40 SNPs (P < 0.001 in joint analysis of Stages I and II) tested in Stage III (n~6650 individuals). Results Long-term averages of LDL-C, HDL-C, and TG were highly heritable (h2 = 0.66, 0.69, 0.58, respectively; each P < 0.0001). Of 70,987 tests for each of the phenotypes, two SNPs had p < 10-5 in GEE results for LDL-C, four for HDL-C, and one for TG. For each multivariable-adjusted phenotype, the number of SNPs with association p < 10-4 ranged from 13 to 18 and with p < 10-3, from 94 to 149. Some results confirmed previously reported associations with candidate genes including variation in the lipoprotein lipase gene (<it>LPL</it>) and HDL-C and TG (rs7007797; P = 0.0005 for HDL-C and 0.002 for TG). The full set of GEE, FBAT and linkage results are posted at the database of Genotype and Phenotype (dbGaP). After three stages of replication, there was no convincing statistical evidence for association (i.e., combined P < 10-5 across all three stages) between any of the tested SNPs and lipid phenotypes. Conclusion Using a 100K genome-wide scan, we have generated a set of putative associations for common sequence variants and lipid phenotypes. Validation of selected hypotheses in additional samples did not identify any new loci underlying variability in blood lipids. Lack of replication may be due to inadequate statistical power to detect modest quantitative trait locus effects (i.e., <1% of trait variance explained) or reduced genomic coverage of the 100K array. GWAS in FHS using a denser genome-wide genotyping platform and a better-powered replication strategy may identify novel loci underlying blood lipids.</p

Lund University Publications

Crossref

Boston University Institutional Repository (OpenBU)

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Deep-coverage whole genome sequences and blood lipids among 16,324 individuals.

Author: Abecasis Goncalo
Alver Maris
Bloom Jonathan M
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Engreitz Jesse M
Ernst Jason
Esko Tonu
Ganna Andrea
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Khera Amit V
Lander Eric S
Manichaikul Ani
Mitchell Braxton
Montasser May
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
O'Connell Jeffrey R
Peloso Gina M
Perry James A
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni E
Salomaa Veikko
Seed Cotton
Surakka Ida L
Vasan Ramachandran S
Willer Cristen J
Wilson James G
Zekavat Seyedeh Maryam
Zhou Wei
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

Large-scale deep-coverage whole-genome sequencing (WGS) is now feasible and offers potential advantages for locus discovery. We perform WGS in 16,324 participants from four ancestries at mean depth >29X and analyze genotypes with four quantitative traits-plasma total cholesterol, low-density lipoprotein cholesterol (LDL-C), high-density lipoprotein cholesterol, and triglycerides. Common variant association yields known loci except for few variants previously poorly imputed. Rare coding variant association yields known Mendelian dyslipidemia genes but rare non-coding variant association detects no signals. A high 2M-SNP LDL-C polygenic score (top 5th percentile) confers similar effect size to a monogenic mutation (~30 mg/dl higher for each); however, among those with severe hypercholesterolemia, 23% have a high polygenic score and only 2% carry a monogenic mutation. At these sample sizes and for these phenotypes, the incremental value of WGS for discovery is limited but WGS permits simultaneous assessment of monogenic and polygenic models to severe hypercholesterolemia

DSpace@MIT

Directory of Open Access Journals

eScholarship - University of California

George Washington University: Health Sciences Research Commons (HSRC)

Recommended from our members

Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.

Author: Alver Maris
Bloom Jonathan
Budoff Matthew
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Daly Mark J
Engreitz Jesse
Ernst Jason
Esko Tonu
Fu Mao
Ganna Andrea
Handsaker Robert E
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Manichaikul Ani
McCarroll Steven
Metspalu Andres
Mitchell Braxton D
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
Peloso Gina M
Post Wendy
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni
Ryan Kathleen A
Salomaa Veikko
Seed Cotton
Surakka Ida
Tsai Michael
Vasan Ramachandran S
Wilson James G
Yang Chaojie
Zekavat Seyedeh M
Publication venue: eScholarship, University of California
Publication date: 01/07/2018
Field of study

Lipoprotein(a), Lp(a), is a modified low-density lipoprotein particle that contains apolipoprotein(a), encoded by LPA, and is a highly heritable, causal risk factor for cardiovascular diseases that varies in concentrations across ancestries. Here, we use deep-coverage whole genome sequencing in 8392 individuals of European and African ancestry to discover and interpret both single-nucleotide variants and copy number (CN) variation associated with Lp(a). We observe that genetic determinants between Europeans and Africans have several unique determinants. The common variant rs12740374 associated with Lp(a) cholesterol is an eQTL for SORT1 and independent of LDL cholesterol. Observed associations of aggregates of rare non-coding variants are largely explained by LPA structural variation, namely the LPA kringle IV 2 (KIV2)-CN. Finally, we find that LPA risk genotypes confer greater relative risk for incident atherosclerotic cardiovascular diseases compared to directly measured Lp(a), and are significantly associated with measures of subclinical atherosclerosis in African Americans

eScholarship - University of California

George Washington University: Health Sciences Research Commons (HSRC)

Key Variants via the Alzheimer\u27s Disease Sequencing Project Whole Genome Sequence Data

Author: Bis Joshua C
Blue Elizabeth E
Boerwinkle Eric
Choi Seung Hoan
De Jager Philip L
DeStefano Anita L
Dupuis Josée
Fornage Myriam
Heard-Costa Nancy L
Lin Honghuang
Peloso Gina M
Pitsillides Achilleas N
Sarnowski Chloé
Seshadri Sudha
Wang Dongyu
Wang Yanbing
Wijsman Ellen M
Publication venue: DigitalCommons@TMC
Publication date: 01/05/2024
Field of study

INTRODUCTION: Genome-wide association studies (GWAS) have identified loci associated with Alzheimer\u27s disease (AD) but did not identify specific causal genes or variants within those loci. Analysis of whole genome sequence (WGS) data, which interrogates the entire genome and captures rare variations, may identify causal variants within GWAS loci. METHODS: We performed single common variant association analysis and rare variant aggregate analyses in the pooled population (N cases = 2184, N controls = 2383) and targeted analyses in subpopulations using WGS data from the Alzheimer\u27s Disease Sequencing Project (ADSP). The analyses were restricted to variants within 100 kb of 83 previously identified GWAS lead variants. RESULTS: Seventeen variants were significantly associated with AD within five genomic regions implicating the genes OARD1/NFYA/TREML1, JAZF1, FERMT2, and SLC24A4. KAT8 was implicated by both single variant and rare variant aggregate analyses. DISCUSSION: This study demonstrates the utility of leveraging WGS to gain insights into AD loci identified via GWAS

DigitalCommons@The Texas Medical Center

Publisher Correction: Deep coverage whole genome sequences and plasma lipoprotein(a) in individuals of European and African ancestries.

Author: Alver Maris
Bloom Jonathan
Budoff Matthew
Chaffin Mark
Correa Adolfo
Cupples L Adrienne
Daly Mark J
Engreitz Jesse
Ernst Jason
Esko Tonu
Fu Mao
Ganna Andrea
Handsaker Robert E
Johnson W Craig
Kathiresan Sekar
Kellis Manolis
Manichaikul Ani
McCarroll Steven
Metspalu Andres
Mitchell Braxton D
Natarajan Pradeep
Neale Benjamin M
NHLBI TOPMed Lipids Working Group
Peloso Gina M
Post Wendy
Poterba Timothy
Rich Stephen S
Ripatti Samuli
Rotter Jerome I
Ruotsalainen Sanni
Ryan Kathleen A
Salomaa Veikko
Seed Cotton
Surakka Ida
Tsai Michael
Vasan Ramachandran S
Wilson James G
Yang Chaojie
Zekavat Seyedeh M
Publication venue: eScholarship, University of California
Publication date: 01/08/2018
Field of study

The original version of this article contained an error in the name of the author Ramachandran S. Vasan, which was incorrectly given as Vasan S. Ramachandran. This has now been corrected in both the PDF and HTML versions of the article

Crossref

eScholarship - University of California

George Washington University: Health Sciences Research Commons (HSRC)