Search CORE

2,001 research outputs found

Allelic expression mapping across cellular lineages to establish impact of non-coding SNPs

Author: Alicia Schiavi
Alison H Goodall
Ann‐Christine Syvänen
Bing Ge
Chuan Wang
Francois Cambien
Jonas Carlsson Almlöf
Lars Rönnblom
Maxime Caron
Nicholas Light
Panos Deloukas
Per Lundmark
Shu‐Huang Chen
Tomi Pastinen
Tony Kwan
Veronique Adoue
Wang Y
Willem H Ouwehand
Publication venue: 'EMBO'
Publication date: 01/01/2014
Field of study

This is an open access article under the terms of the Creative Commons Attribution 4.0 License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited

Crossref

Publikationer från Uppsala Universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Queen Mary Research Online

Leicester Research Archive

A reference haplotype panel for genome-wide imputation of short tandem repeats.

Author: Fotsing Stephanie Feupe
Gymrek Melissa
Mitra Ileena
Mousavi Nima
Saini Shubham
Publication venue: eScholarship, University of California
Publication date: 01/10/2018
Field of study

Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits

Directory of Open Access Journals

eScholarship - University of California

Whole-genome sequencing to understand the genetic architecture of common gene expression and biomarker phenotypes.

Author: Almeida Marcio
Bandinelli Stefania
Blangero John
Curran Joanne E
Duggirala Ravindranath
Ferrucci Luigi
Frayling Timothy M
Gaulton Kyle
Gibbs J Raphael
Göring Harald HH
Hernandez Dena
Johnson Matthew P
Jun Goo
Li Qibin
Lin Haoxiang
Mccarthy Mark I
Melzer David
Murray Anna
Nalls Mike
Pearson Richard
Perry John RB
Rivas Manny
Shen Juan
Singleton Andrew
Tanaka Toshiko
Tuke Marcus A
Weedon Michael N
Wood Andrew R
Xu Christopher S
Publication venue: eScholarship, University of California
Publication date: 06/11/2014
Field of study

Initial results from sequencing studies suggest that there are relatively few low-frequency (<5%) variants associated with large effects on common phenotypes. We performed low-pass whole-genome sequencing in 680 individuals from the InCHIANTI study to test two primary hypotheses: (i) that sequencing would detect single low-frequency-large effect variants that explained similar amounts of phenotypic variance as single common variants, and (ii) that some common variant associations could be explained by low-frequency variants. We tested two sets of disease-related common phenotypes for which we had statistical power to detect large numbers of common variant-common phenotype associations-11 132 cis-gene expression traits in 450 individuals and 93 circulating biomarkers in all 680 individuals. From a total of 11 657 229 high-quality variants of which 6 129 221 and 5 528 008 were common and low frequency (<5%), respectively, low frequency-large effect associations comprised 7% of detectable cis-gene expression traits [89 of 1314 cis-eQTLs at P < 1 × 10(-06) (false discovery rate ∼5%)] and one of eight biomarker associations at P < 8 × 10(-10). Very few (30 of 1232; 2%) common variant associations were fully explained by low-frequency variants. Our data show that whole-genome sequencing can identify low-frequency variants undetected by genotyping based approaches when sample sizes are sufficiently large to detect substantial numbers of common variant associations, and that common variant associations are rarely explained by single low-frequency variants of large effect

PubMed Central

eScholarship - University of California

Leveraging large scale beef cattle genomic data to identify the architecture of polygenic selection and local adaptation

Author: Rowan Troy
Publication venue: 'University of Missouri Libraries'
Publication date
Field of study

Includes vita.Since the invention of the first array-based genotyping assay for cattle in 2008, millions of animals have been genotyped worldwide. Leveraging these genotypes offers exciting opportunities to explore both basic and applied research questions. Commercial genotyping assays are of adequate variant density to perform well in prediction contexts but are not sufficient for mapping studies. Using reference panels made up of individuals genotyped at higher densities, we can statistically infer the missing variation of low-density assays through the process of imputation. Here, we explore the best practices for performing routine imputation in large commercially generated genomic datasets of U.S. beef cattle. We find that using a large multi-breed imputation reference maximizes accuracy, particularly for rare variants. Using three of these large, imputed datasets, we explore two major population genetics questions. First, we map polygenic selection in the bovine genome, using Generation Proxy Selection Mapping (GPSM). This identifies hundreds of regions of the genome actively under selection in cattle populations. Using a similar approach, we identify dozens of genomic variants associated with environments across the U.S., likely involved local adaptation. Understanding the genomic basis of local adaptation in cattle will enable select and breed cattle better suited to a changing climate.Includes bibliographical references (pages 203-228

University of Missouri: MOspace

Multiple breast cancer risk variants are associated with differential transcript isoform expression in tumors.

Author: Brenner Steven E
Camarda Roman
Caswell Jennifer L
Goga Andrei
Hu Donglei
Huntsman Scott
Zaitlen Noah
Zhou Alicia Y
Ziv Elad
Publication venue: eScholarship, University of California
Publication date: 15/10/2015
Field of study

Genome-wide association studies have identified over 70 single-nucleotide polymorphisms (SNPs) associated with breast cancer. A subset of these SNPs are associated with quantitative expression of nearby genes, but the functional effects of the majority remain unknown. We hypothesized that some risk SNPs may regulate alternative splicing. Using RNA-sequencing data from breast tumors and germline genotypes from The Cancer Genome Atlas, we tested the association between each risk SNP genotype and exon-, exon-exon junction- or transcript-specific expression of nearby genes. Six SNPs were associated with differential transcript expression of seven nearby genes at FDR < 0.05 (BABAM1, DCLRE1B/PHTF1, PEX14, RAD51L1, SRGAP2D and STXBP4). We next developed a Bayesian approach to evaluate, for each SNP, the overlap between the signal of association with breast cancer and the signal of association with alternative splicing. At one locus (SRGAP2D), this method eliminated the possibility that the breast cancer risk and the alternate splicing event were due to the same causal SNP. Lastly, at two loci, we identified the likely causal SNP for the alternative splicing event, and at one, functionally validated the effect of that SNP on alternative splicing using a minigene reporter assay. Our results suggest that the regulation of differential transcript isoform expression is the functional mechanism of some breast cancer risk SNPs and that we can use these associations to identify causal SNPs, target genes and the specific transcripts that may mediate breast cancer risk

PubMed Central

eScholarship - University of California

Haplotype-based quantitative trait mapping using a clustering algorithm

Author: AD Long
C Durrant
DB Allison
DJ Sheskin
GA Churchill
HT Toivonen
HW Deng
HW Deng
J Li
J Li
J Molitor
Jing Li
JM Comeron
JS Liu
JY Tzeng
K Song
K Zhang
L Kruglyak
M Ester
M Lynch
M Stephens
MJ Daly
MS McPeek
PIW de Bakker
R Fan
Robert C Elston
RR Hudson
S Zollner
SB Gabriel
T Niu
The International HapMap Consortium
The International HapMap Consortium
Yingyao Zhou
Publication venue: BioMed Central
Publication date: 01/01/2006
Field of study

BACKGROUND: With the availability of large-scale, high-density single-nucleotide polymorphism (SNP) markers, substantial effort has been made in identifying disease-causing genes using linkage disequilibrium (LD) mapping by haplotype analysis of unrelated individuals. In addition to complex diseases, many continuously distributed quantitative traits are of primary clinical and health significance. However the development of association mapping methods using unrelated individuals for quantitative traits has received relatively less attention. RESULTS: We recently developed an association mapping method for complex diseases by mining the sharing of haplotype segments (i.e., phased genotype pairs) in affected individuals that are rarely present in normal individuals. In this paper, we extend our previous work to address the problem of quantitative trait mapping from unrelated individuals. The method is non-parametric in nature, and statistical significance can be obtained by a permutation test. It can also be incorporated into the one-way ANCOVA (analysis of covariance) framework so that other factors and covariates can be easily incorporated. The effectiveness of the approach is demonstrated by extensive experimental studies using both simulated and real data sets. The results show that our haplotype-based approach is more robust than two statistical methods based on single markers: a single SNP association test (SSA) and the Mann-Whitney U-test (MWU). The algorithm has been incorporated into our existing software package called HapMiner, which is available from our website at . CONCLUSION: For QTL (quantitative trait loci) fine mapping, to identify QTNs (quantitative trait nucleotides) with realistic effects (the contribution of each QTN less than 10% of total variance of the trait), large samples sizes (≥ 500) are needed for all the methods. The overall performance of HapMiner is better than that of the other two methods. Its effectiveness further depends on other factors such as recombination rates and the density of typed SNPs. Haplotype-based methods might provide higher power than methods based on a single SNP when using tag SNPs selected from a small number of samples or some other sources (such as HapMap data). Rank-based statistics usually have much lower power, as shown in our study

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Genetic loci associated with coronary artery disease harbor evidence of selection and antagonistic pleiotropy

Author: Abraham Gad
Bakshi Andrew
Byars Sean G.
Gray Lesley-Ann
Huang Qin Qin
Inouye Michael
Ripatti Samuli
Stearns Stephen C.
Publication venue
Publication date: 01/06/2017
Field of study

Traditional genome-wide scans for positive selection have mainly uncovered selective sweeps associated with monogenic traits. While selection on quantitative traits is much more common, very few signals have been detected because of their polygenic nature. We searched for positive selection signals underlying coronary artery disease (CAD) in worldwide populations, using novel approaches to quantify relationships between polygenic selection signals and CAD genetic risk. We identified new candidate adaptive loci that appear to have been directly modified by disease pressures given their significant associations with CAD genetic risk. These candidates were all uniquely and consistently associated with many different male and female reproductive traits suggesting selection may have also targeted these because of their direct effects on fitness. We found that CAD loci are significantly enriched for lifetime reproductive success relative to the rest of the human genome, with evidence that the relationship between CAD and lifetime reproductive success is antagonistic. This supports the presence of antagonistic-pleiotropic tradeoffs on CAD loci and provides a novel explanation for the maintenance and high prevalence of CAD in modern humans. Lastly, we found that positive selection more often targeted CAD gene regulatory variants using HapMap3 lymphoblastoid cell lines, which further highlights the unique biological significance of candidate adaptive loci underlying CAD. Our study provides a novel approach for detecting selection on polygenic traits and evidence that modern human genomes have evolved in response to CAD-induced selection pressures and other early-life traits sharing pleiotropic links with CAD.Peer reviewe

Directory of Open Access Journals

Helsingin yliopiston digitaalinen arkisto

University of Melbourne Institutional Repository

FigShare

The Population Genetic Signature of Polygenic Local Adaptation

Author: Berg Jeremy J.
Coop Graham
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 06/02/2014
Field of study

Adaptation in response to selection on polygenic phenotypes may occur via subtle allele frequencies shifts at many loci. Current population genomic techniques are not well posed to identify such signals. In the past decade, detailed knowledge about the specific loci underlying polygenic traits has begun to emerge from genome-wide association studies (GWAS). Here we combine this knowledge from GWAS with robust population genetic modeling to identify traits that may have been influenced by local adaptation. We exploit the fact that GWAS provide an estimate of the additive effect size of many loci to estimate the mean additive genetic value for a given phenotype across many populations as simple weighted sums of allele frequencies. We first describe a general model of neutral genetic value drift for an arbitrary number of populations with an arbitrary relatedness structure. Based on this model we develop methods for detecting unusually strong correlations between genetic values and specific environmental variables, as well as a generalization of

Q_{ST}/F_{ST}

comparisons to test for over-dispersion of genetic values among populations. Finally we lay out a framework to identify the individual populations or groups of populations that contribute to the signal of overdispersion. These tests have considerably greater power than their single locus equivalents due to the fact that they look for positive covariance between like effect alleles, and also significantly outperform methods that do not account for population structure. We apply our tests to the Human Genome Diversity Panel (HGDP) dataset using GWAS data for height, skin pigmentation, type 2 diabetes, body mass index, and two inflammatory bowel disease datasets. This analysis uncovers a number of putative signals of local adaptation, and we discuss the biological interpretation and caveats of these results.Comment: 42 pages including 8 figures and 3 tables; supplementary figures and tables not included on this upload, but are mostly unchanged from v

arXiv.org e-Print Archive

Directory of Open Access Journals

PubMed Central

FigShare

Patterns of Ancestry, Signatures of Natural Selection, and Genetic Association with Stature in Western African Pygmies

Author: A La Batide-Alanore
A Leonhardt
AB Migliano
AL Price
AL Price
Alain Froment
AR Boyko
B Pasaniuc
Bart Ferwerda
BF Voight
BS Weir
C Ballard
C Batini
CA Winkler
CC Khor
CD Huff
Charla Lambert
D Lopez Herraez
D Philipson
D Redelman
DL Rimoin
E Patin
G Baumann
G Destro-Bisol
GA McVean
Gabriel Hoffman
GH Perry
H Eleftherohorinou
H Innan
H Lango Allen
H Tang
H Yasukawa
HJ Bandelt
HM Kang
J Chen
J Kamath
Jason Mezey
JD Storey
JE Pool
Jean-Marie Bodo
JK Pickrell
JK Pritchard
JM Akey
JM Kidd
Joseph P. Jarvis
Joshua M. Akey
JZ Li
K Bryc
K Tang
L Quintana-Murci
Larsson Omberg
Laura B. Scheinfeldt
LG Moore
LJ Young
M Bozzola
M Joron
M Pelican
M Stephens
M Stephens
M Stephens
MB Lanktree
MD Shriver
MG de Silva
N Davila
NS Becker
P Librado
P Moorjani
P Scheet
P Verdu
PA Fujita
PC Sabeti
PR Dormitzer
R Chakraborty
R Kimura
S Jain
S Ludwig
S Purcell
S Sankararaman
SA Miller
SA Tishkoff
Sameer Soi
Sarah A. Tishkoff
SH Williamson
SJ Kang
ST Sherry
TJ Merimee
TJ Merimee
TJ Merimee
TJ Pemberton
William Beggs
WS Alexander
Y Benjamini
Y Chen
Y Hattori
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

African Pygmy groups show a distinctive pattern of phenotypic variation, including short stature, which is thought to reflect past adaptation to a tropical environment. Here, we analyze Illumina 1M SNP array data in three Western Pygmy populations from Cameroon and three neighboring Bantu-speaking agricultural populations with whom they have admixed. We infer genome-wide ancestry, scan for signals of positive selection, and perform targeted genetic association with measured height variation. We identify multiple regions throughout the genome that may have played a role in adaptive evolution, many of which contain loci with roles in growth hormone, insulin, and insulin-like growth factor signaling pathways, as well as immunity and neuroendocrine signaling involved in reproduction and metabolism. The most striking results are found on chromosome 3, which harbors a cluster of selection and association signals between approximately 45 and 60 Mb. This region also includes the positional candidate genes DOCK3, which is known to be associated with height variation in Europeans, and CISH, a negative regulator of cytokine signaling known to inhibit growth hormone-stimulated STAT5 signaling. Finally, pathway analysis for genes near the strongest signals of association with height indicates enrichment for loci involved in insulin and insulin-like growth factor signaling

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Horizon / Pleins textes

FigShare