Search CORE

205 research outputs found

The effect of minor allele frequency on the likelihood of obtaining false positives

Author: AC Lam
IP Gorlov
JC Florez
Jessica G Woo
KG Ardlie
LA Cupples
Lisa J Martin
Meredith E Tabangin
V Moskvina
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Determining the most promising single-nucleotide polymorphisms (SNPs) presents a challenge in genome-wide association studies, when hundreds of thousands of association tests are conducted. The power to detect genetic effects is dependent on minor allele frequency (MAF), and genome-wide association studies SNP arrays include SNPs with a wide distribution of MAFs. Therefore, it is critical to understand MAF's effect on the false positive rate

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Replication of genome-wide association studies (GWAS) loci for fasting plasma glucose in African-Americans

Author: A Adeyemo
A. Adeyemo
A. Doumatey
A. Herbert
C. Rotimi
CS Bretherton
D Shriner
D. Shriner
DM Evans
E. Ramos
G. Chen
GM Reaven
H Unoki
H. Huang
J Dupuis
J. Zhou
JC Chambers
KG Ardlie
M. F. Christman
MI McCarthy
N. P. Gerry
S O’Rahilly
SA Tishkoff
SC Schuster
Publication venue: Springer-Verlag
Publication date: 01/01/2010
Field of study

Crossref

Springer - Publisher Connector

PubMed Central

A unified framework for multi-locus association analysis of both common and rare variants

Author: B Devlin
B Li
BE Madsen
C Kooperberg
Daniel Shriner
DT Redden
E Eisenberg
G Zheng
H-T Kao
HC Fung
I Ruczinski
IP Gorlov
J Simón-Sánchez
JA Longmate
JC Cohen
JC Mueller
JK Pritchard
KG Ardlie
Laura Kelly Vaughan
LC Kwee
MC Wu
N Pankratz
NS Fearnhead
R Development Core Team
S Purcell
S Won
T Suzuki
The Wellcome Trust Case Control Consortium
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Common, complex diseases are hypothesized to result from a combination of common and rare genetic variants. We developed a unified framework for the joint association testing of both types of variants. Within the framework, we developed a union-intersection test suitable for genome-wide analysis of single nucleotide polymorphisms (SNPs), candidate gene data, as well as medical sequencing data. The union-intersection test is a composite test of association of genotype frequencies and differential correlation among markers. Results We demonstrated by computer simulation that the false positive error rate was controlled at the expected level. We also demonstrated scenarios in which the multi-locus test was more powerful than traditional single marker analysis. To illustrate use of the union-intersection test with real data, we analyzed a publically available data set of 319,813 autosomal SNPs genotyped for 938 cases of Parkinson disease and 863 neurologically normal controls for which no genome-wide significant results were found by traditional single marker analysis. We also analyzed an independent follow-up sample of 183 cases and 248 controls for replication. Conclusions We identified a single risk haplotype with a directionally consistent effect in both samples in the gene <it>GAK</it>, which is involved in clathrin-mediated membrane trafficking. We also found suggestive evidence that directionally inconsistent marginal effects from single marker analysis appeared to result from risk being driven by different haplotypes in the two samples for the genes <it>SYN3 </it>and <it>NGLY1</it>, which are involved in neurotransmitter release and proteasomal degradation, respectively. These results illustrate the utility of our unified framework for genome-wide association analysis of common, complex diseases.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Cubic exact solutions for the estimation of pairwise haplotype frequencies: implications for linkage disequilibrium analyses and a web tool 'CubeX'

Author: F Pettersson
GKS Wong
GR Abecasis
Ian NM Day
INM Day
JC Barrett
JC Mueller
KG Ardlie
KK Alharbi
KK Alharbi
KM Weiss
L Wang
LB Jorde
LJ Palmer
M Stephens
M Stephens
M Stephens
ME Weale
PS Foundation
RC Lewontin
RM Salem
RWD Nickalls
S Mano
Santiago Rodríguez
SH Orzack
TIHM Consortium
Tom R Gaunt
TR Gaunt
WG Hill
WG Hill
ZW Luo
Publication venue: BioMed Central
Publication date: 01/01/2007
Field of study

Abstract Background The frequency of a haplotype comprising one allele at each of two loci can be expressed as a cubic equation (the 'Hill equation'), the solution of which gives that frequency. Most haplotype and linkage disequilibrium analysis programs use iteration-based algorithms which substitute an estimate of haplotype frequency into the equation, producing a new estimate which is repeatedly fed back into the equation until the values converge to a maximum likelihood estimate (expectation-maximisation). Results We present a program, "CubeX", which calculates the biologically possible exact solution(s) and provides estimated haplotype frequencies, D', r2 and <it>χ</it>2 values for each. CubeX provides a "complete" analysis of haplotype frequencies and linkage disequilibrium for a pair of biallelic markers under situations where sampling variation and genotyping errors distort sample Hardy-Weinberg equilibrium, potentially causing more than one biologically possible solution. We also present an analysis of simulations and real data using the algebraically exact solution, which indicates that under perfect sample Hardy-Weinberg equilibrium there is only one biologically possible solution, but that under other conditions there may be more. Conclusion Our analyses demonstrate that lower allele frequencies, lower sample numbers, population stratification and a possible |D'| value of 1 are particularly susceptible to distortion of sample Hardy-Weinberg equilibrium, which has significant implications for calculation of linkage disequilibrium in small sample sizes (eg HapMap) and rarer alleles (eg paucimorphisms, q < 0.05) that may have particular disease relevance and require improved approaches for meaningful evaluation.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Explore Bristol Research

Accuracy of Predicting the Genetic Risk of Disease Using a Genome-Wide Approach

Author: A Robertson
A Robertson
ACJW Janssens
AF Mcrae
AJ Chamberlain
Beatriz Villanueva
BJ Hayes
D Habier
D Shriner
DE Reich
DR Cox
DS Falconer
Hans D. Daetwyler
HHH Goring
JC Barrett
JC Dekkers
JC Venter
JK Pritchard
JN Hirschhorn
John A. Woolliams
KG Ardlie
M Lynch
M Sargolzaei
Michael Nicholas Weedon
MN Weedon
N Risch
NJ Yi
NR Wray
P Bijma
PDP Pharoah
SZ Xu
TH Meuwissen
TR Solberg
W Valdar
Publication venue: Public Library of Science
Publication date: 01/01/2008
Field of study

Background - The prediction of the genetic disease risk of an individual is a powerful public health tool. While predicting risk has been successful in diseases which follow simple Mendelian inheritance, it has proven challenging in complex diseases for which a large number of loci contribute to the genetic variance. The large numbers of single nucleotide polymorphisms now available provide new opportunities for predicting genetic risk of complex diseases with high accuracy. Methodology/Principal Findings - We have derived simple deterministic formulae to predict the accuracy of predicted genetic risk from population or case control studies using a genome-wide approach and assuming a dichotomous disease phenotype with an underlying continuous liability. We show that the prediction equations are special cases of the more general problem of predicting the accuracy of estimates of genetic values of a continuous phenotype. Our predictive equations are responsive to all parameters that affect accuracy and they are independent of allele frequency and effect distributions. Deterministic prediction errors when tested by simulation were generally small. The common link among the expressions for accuracy is that they are best summarized as the product of the ratio of number of phenotypic records per number of risk loci and the observed heritability. Conclusions/Significance - This study advances the understanding of the relative power of case control and population studies of disease. The predictions represent an upper bound of accuracy which may be achievable with improved effect estimation methods. The formulae derived will help researchers determine an appropriate sample size to attain a certain accuracy when predicting genetic ris

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Edinburgh Research Explorer

Wageningen University & Research Publications

SRUC - Scotland's Rural College

Detection of regulator genes and eQTLs in gene networks

Author: A Butte
A Chatr-Aryamontri
A Clauset
A Joshi
A Joshi
A Kundaje
AA Shabalin
AJ Enright
AJ Walhout
AS Dimas
B Schwanhausser
B Zhang
B Zhang
C Cenik
CO Daub
D Koller
DA Cusanovich
DM Greenawalt
E Bonnet
E Ravasz
E Segal
EC Neto
EC Neto
EC Neto
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EE Schadt
EJ Foss
F Grubert
F Yue
FA Cubillos
FW Albert
G Hemani
G Nicholson
GD Smith
GH Golub
H Foroughi Asl
H Talukdar
HN Kadarmideen
J Millstein
J Qi
J Zhu
J Zhu
J Zhu
JE Aten
JF Ayroles
JJ Faith
JL Björkegren
JS Liu
K Basso
K Qu
KG Ardlie
L Wu
LA Hindorff
LH Hartwell
LS Chen
M Ashburner
M Civelek
M Georges
M Gerstein
M Medvedovic
M Schmidt
M Scutari
MA Schaub
MB Eisen
MD Ritchie
ME Goddard
MEJ Newman
MEJ Newman
MV Rockman
MV Rockman
N Friedman
N Friedman
N Friedman
N Laird
O Stegle
P Langfelder
P Langfelder
P Langfelder
P Lu
R Sharan
R Sharan
RB Brem
RW Williams
S Lee
S Roy
S Tavazoie
SI Lee
SM Waszak
SS Rao
T Lappalainen
T Michoel
TA Manolio
TF Mackay
The ENCODE
TS Furey
VG Cheung
W Cookson
W Zhang
Y Chen
Y Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2016
Field of study

Genetic differences between individuals associated to quantitative phenotypic traits, including disease states, are usually found in non-coding genomic regions. These genetic variants are often also associated to differences in expression levels of nearby genes (they are "expression quantitative trait loci" or eQTLs for short) and presumably play a gene regulatory role, affecting the status of molecular networks of interacting genes, proteins and metabolites. Computational systems biology approaches to reconstruct causal gene networks from large-scale omics data have therefore become essential to understand the structure of networks controlled by eQTLs together with other regulatory genes, and to generate detailed hypotheses about the molecular mechanisms that lead from genotype to phenotype. Here we review the main analytical methods and softwares to identify eQTLs and their associated genes, to reconstruct co-expression networks and modules, to reconstruct causal Bayesian gene and module networks, and to validate predicted networks in silico.Comment: minor revision with typos corrected; review article; 24 pages, 2 figure

arXiv.org e-Print Archive

Crossref

Polymorphisms on SSC15q21-q26 Containing QTL for reproduction in Swine and its association with litter size

Author: Andersson-Eklund L
Ardlie KG
Barrett JC
Chen H
Cui JX
de Koning DJ
de Koning DJ
Du HL
Fujisawa H
Geesaman BJ
Gluzman-Poltorak Z
Hongli Du
Jeon JT
Jianxun Cui
Jing Chen
Kitsukawa T
Knott SA
Lohela M
Neufeld G
Nezer C
Nezer C
Rathje TA
Rieder MJ
Rioux JD
Rohrer GA
Sachidanandam R
Shen J
Stephens M
Stoll M
Van Eerdewegh P
Van Laere AS
Wang DG
Xiaoning Wang
Xiquan Zhang
Publication venue: Sociedade Brasileira de Genética
Publication date: 01/01/2009
Field of study

Several quantitative trait loci (QTL) for important reproductive traits (ovulation rate) have been identified on the porcine chromosome 15 (SSC15). To assist in the selection of positional candidate swine genes for these QTL on SSC15, twenty-one genes had already been assigned to SSC15 in a previous study in our lab, by using the radiation hybrid panel IMpRH. Further polymorphism studies were carried out on these positional candidate genes with four breeds of pigs (Duroc, Erhualian, Dahuabai and Landrace) harboring significant differences in reproduction traits. A total of nineteen polymorphisms were found in 21 genes. Among these, seven in six genes were used for association studies, whereby NRP2 polymorphism was found to be significantly (p < 0.05) associated with litter-size traits. NRP2 might be a candidate gene for pig-litter size based on its chromosome location (Du et al., 2006), significant association with litter-size traits and relationships with Sema and the VEGF super families

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

PubMed Central

Polymorphisms of XRCC4 are involved in reduced colorectal cancer risk in Chinese schizophrenia patients

Author: A Grinshpoon
Baocheng Liu
CT Yan
DE Barnes
F Dudbridge
F Faul
Fengping Yang
Guang He
Guoyin Feng
Jinfen Wang
JK Park
Jue Ji
KG Ardlie
Lei Wang
Lin He
M Cohen
MS Junop
PB Mortensen
PC Sham
Peng Chen
PF Sullivan
PJ Hayden
Qingzhu Zhao
Quan Wang
S Oksbjerg Dalton
SG Schwab
SO Dalton
T Sugai
Tao Li
Ti Wang
VS Catts
Xingwang Li
Y Barak
Y Benjamini
Y Gao
Yang Wang
Yanzeng Xiao
Yifeng Xu
YJ Lien
YY Shi
Zhihai Peng
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background Genetic factors related to the regulation of apoptosis in schizophrenia patients may be involved in a reduced vulnerability to cancer. XRCC4 is one of the potential candidate genes associated with schizophrenia which might induce colorectal cancer resistance. Methods To examine the genetic association between colorectal cancer and schizophrenia, we analyzed five SNPs (rs6452526, rs2662238, rs963248, rs35268, rs2386275) covering ~205.7 kb in the region of XRCC4. Results We observed that two of the five genetic polymorphisms showed statistically significant differences between 312 colorectal cancer subjects without schizophrenia and 270 schizophrenia subjects (rs6452536, p = 0.004, OR 0.61, 95% CI 0.44-0.86; rs35268, p = 0.028, OR 1.54, 95% CI 1.05-2.26). Moreover, the haplotype which combined all five markers was the most significant, giving a global <it>p </it>= 0.0005. Conclusions Our data firstly indicate that XRCC4 may be a potential protective gene towards schizophrenia, conferring reduced susceptibility to colorectal cancer in the Han Chinese population.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Comparison of linkage disequilibrium levels in Iranian indigenous cattle using whole genome SNPs data

Author: A Tenesa
AM Pérez O’Brien
BJ Hayes
C Flury
D Purfield
DF Conrad
EM Sellner
F Farnir
F Mokry
FX Du
JF Taylor
JJS Gouveia
JK Pritchard
JR Meadows
KE Kemper
KG Ardlie
L Kruglyak
LR Porto-Neto
M Sargolzaei
M Zhu
MS Khatkar
N Harmegnies
R Espigolan
R Salomon-Torres
R Villa-Angulo
RC Lewontin
S Purcell
S Qanbari
SD McKay
TH Meuwissen
TY Kiselyova
W Stephan
WG Hill
WG Hill
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Scanning and filling : ultra-dense SNP genotyping combining genotyping-by-sequencing, SNP array and whole-genome resequencing data

Author: AE Lipka
B Howie
BN Howie
D Ellinghaus
D Jarquín
Davoud Torkamaneh
Francois Belzile
H Li
H Li
H Sonah
HD Daetwyler
J Crossa
J Poland
J Schmutz
J Zheng
JE Rutkoski
K Hao
KG Ardlie
LR Porto-Neto
M Wang
MA Gore
MD Donato
MH Santana
Nicholas A. Tinker
NT Ha
O Delaneau
O Delaneau
P Scheet
Q Song
Q Zhu
RJ Elshire
S Browning
S He
S Kim
S Purcell
S Shifman
X Huang
X Xu
Y Li
YB Fu
YB Fu
YB Fu
YB Fu
YF Pei
Z Yang
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 10/07/2015
Field of study

Genotyping-by-sequencing (GBS) represents a highly cost-effective high-throughput genotyping approach. By nature, however, GBS is subject to generating sizeable amounts of missing data and these will need to be imputed for many downstream analyses. The extent to which such missing data can be tolerated in calling SNPs has not been explored widely. In this work, we first explore the use of imputation to fill in missing genotypes in GBS datasets. Importantly, we use whole genome resequencing data to assess the accuracy of the imputed data. Using a panel of 301 soybean accessions, we show that over 62,000 SNPs could be called when tolerating up to 80% missing data, a five-fold increase over the number called when tolerating up to 20% missing data. At all levels of missing data examined (between 20% and 80%), the resulting SNP datasets were of uniformly high accuracy (96– 98%). We then used imputation to combine complementary SNP datasets derived from GBS and a SNP array (SoySNP50K). We thus produced an enhanced dataset of >100,000 SNPs and the genotypes at the previously untyped loci were again imputed with a high level of accuracy (95%). Of the >4,000,000 SNPs identified through resequencing 23 accessions (among the 301 used in the GBS analysis), 1.4 million tag SNPs were used as a reference to impute this large set of SNPs on the entire panel of 301 accessions. These previously untyped loci could be imputed with around 90% accuracy. Finally, we used the 100K SNP dataset (GBS + SoySNP50K) to perform a GWAS on seed oil content within this collection of soybean accessions. Both the number of significant marker-trait associations and the peak significance levels were improved considerably using this enhanced catalog of SNPs relative to a smaller catalog resulting from GBS alone at 20% missing data. Our results demonstrate that imputation can be used to fill in both missing genotypes and untyped loci with very high accuracy and that this leads to more powerful genetic analyses

Crossref

Directory of Open Access Journals

PubMed Central

CorpusUL