Search CORE

54 research outputs found

A robust clustering algorithm for identifying problematic samples in genome-wide association studies

Author: Amy Strange
Chris C.A. Spencer
Colin Freeman
Céline Bellenguez
Genetic Analysis of Psoriasis Consortium & the WTCCC2
Hadi
Peter Donnelly
The International Multiple Sclerosis Genetics Consortium & the WTCCC2
The UK IBD Genetics Consortium & the WTCCC2
Publication venue: Oxford University Press
Publication date: 01/01/2012
Field of study

Summary: High-throughput genotyping arrays provide an efficient way to survey single nucleotide polymorphisms (SNPs) across the genome in large numbers of individuals. Downstream analysis of the data, for example in genome-wide association studies (GWAS), often involves statistical models of genotype frequencies across individuals. The complexities of the sample collection process and the potential for errors in the experimental assay can lead to biases and artefacts in an individual's inferred genotypes. Rather than attempting to model these complications, it has become a standard practice to remove individuals whose genome-wide data differ from the sample at large. Here we describe a simple, but robust, statistical algorithm to identify samples with atypical summaries of genome-wide variation. Its use as a semi-automated quality control tool is demonstrated using several summary statistics, selected to identify different potential problems, and it is applied to two different genotyping platforms and sample collections

Crossref

PubMed Central

Oxford University Research Archive

University of Queensland eSpace

Author Correction: Cross-ancestry genome-wide association analysis of corneal thickness strengthens link between complex and Mendelian eye diseases.

Author: Aung Tin
Bailey Jessica N Cooke
Beutel Manfred E
Blue Mountains Eye Study - GWAS group
Bonnemaijer Pieter
Boutin Thibaud
Burdon Kathryn P
Bykhovskaya Yelena
Cheng Ching-Yu
Craig Jamie E
Cuellar-Partida Gabriel
Foster Paul J
Gharahkhani Puya
Haines Jonathan L
Hammond Christopher J
Hayward Caroline
Hewitt Alex W
Hysi Pirro G
Höhn René
Iglesias Adriana I
Jonas Jost B
Kang Jae H
Kearns Lisa S
Khawaja Anthony P
Khor Chiea Chuen
Klaver Caroline CW
Li Xiaohui
Lucas Sionne EM
MacGregor Stuart
Mackey David A
Martin Nicholas G
Mills Richard A
Mishra Aniket
Mitchell Paul
Montgomery Grant W
Nag Abhishek
NEIGHBORHOOD consortium
Pasquale Louis R
Pfeiffer Norbert
Polašek Ozren
Rabinowitz Yaron S
Rotter Jerome I
Schmidtmann Irene
Shi Yuan
Siscovick David
Souzeau Emmanuelle
Springelkamp Henriët
Staffieri Sandra E
Taylor Kent D
Tham Yih Chung
Uitterlinden André G
van Duijn Cornelia M
van Leeuwen Elisabeth M
Vitart Veronique
Vithana Eranga N
Wellcome Trust Case Control Consortium 2 (WTCCC2)
Wiggs Janey L
Willoughby Colin E
Wilson James F
Wong Tien Yin
Yazar Seyhan
Zeller Tanja
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

Emmanuelle Souzeau, who contributed to analysis of data, was inadvertently omitted from the author list in the originally published version of this Article. This has now been corrected in both the PDF and HTML versions of the Article

Directory of Open Access Journals

eScholarship - University of California

University of Miami: Scholarship Miami

Polymorphism in a lincRNA Associates with a Doubled Risk of Pneumococcal Bacteremia in Kenyan Children.

Author: Band Gavin
Bellenguez Céline
Berkley James A
Blackwell Jenefer M
Bramon Elvira
Brown Matthew A
Bumpstead Suzannah J
Casas Juan P
Chapman Stephen J
Corvin Aiden
Deloukas Panos
Donnelly Peter
Dronov Serge
Duncanson Audrey
Edkins Sarah
Freeman Colin
Giannoulatou Eleni
Gilchrist James J
Gray Emma
Hill Adrian VS
Hunt Sarah E
Jankowski Janusz
Kenyan Bacteraemia Study Group
Khandwalla Iqbal
Kilifi Bacteraemia Surveillance Group
Kitsao Barnes S
Langford Cordelia
Lowe Brett S
Macharia Alex W
Markus Hugh S
Mathew Christopher G
Mills Tara C
Mohammed Shebe
Morpeth Susan C
Mturi Neema
Mwangi Isaiah
Mwarumba Salim
Naranbhai Vivek
Ndila Carolyne
Ndungu Anne W
Njuguna Patricia
Palmer Colin NA
Pearson Richard D
Peltonen Leena
Pirinen Matti
Plomin Robert
Rautanen Anna
Rockett Kirk A
Sawcer Stephen J
Scott J Anthony G
Spencer Chris CA
Strange Amy
Su Zhan
Trembath Richard C
Uyoga Sophie
Viswanathan Ananth C
Vukcevic Damjan
Wellcome Trust Case Control Consortium 2 (WTCCC2)
Williams Thomas N
Wood Nicholas W
Publication venue: Am J Hum Genet
Publication date: 28/03/2016
Field of study

Bacteremia (bacterial bloodstream infection) is a major cause of illness and death in sub-Saharan Africa but little is known about the role of human genetics in susceptibility. We conducted a genome-wide association study of bacteremia susceptibility in more than 5,000 Kenyan children as part of the Wellcome Trust Case Control Consortium 2 (WTCCC2). Both the blood-culture-proven bacteremia case subjects and healthy infants as controls were recruited from Kilifi, on the east coast of Kenya. Streptococcus pneumoniae is the most common cause of bacteremia in Kilifi and was thus the focus of this study. We identified an association between polymorphisms in a long intergenic non-coding RNA (lincRNA) gene (AC011288.2) and pneumococcal bacteremia and replicated the results in the same population (p combined = 1.69 × 10(-9); OR = 2.47, 95% CI = 1.84-3.31). The susceptibility allele is African specific, derived rather than ancestral, and occurs at low frequency (2.7% in control subjects and 6.4% in case subjects). Our further studies showed AC011288.2 expression only in neutrophils, a cell type that is known to play a major role in pneumococcal clearance. Identification of this novel association will further focus research on the role of lincRNAs in human infectious disease.Wellcome Trust (Grant ID: 084716/Z/08/Z)This is the final version of the article. It first appeared from Cell Press/Elsevier via http://dx.doi.org/10.1016/j.ajhg.2016.03.02

Elsevier - Publisher Connector

Crossref

LSHTM Research Online

PubMed Central

Spiral - Imperial College Digital Repository

Apollo (Cambridge)

University of Dundee Online Publications

King's Research Portal

An inherited duplication at the gene p21 protein-activated Kinase 7 (PAK7) is a risk factor for psychosis

Author: Bellini Stefania
Blackwood Douglas
Buizer Jacobine
Coe Bradley
Cormican Paul
Corvin Aiden
Craddock Nick
Dinan Timothy G.
Donohoe Gary
Eichler Evan E.
Elves Rachel L.
Ennis Sean
Fahey Ciara
Freeman Colin
Giannoulatou Eleni
Gill Michael
Grozeva Detelina
Gurling Hugh
Hultman Christina
Johnstone Mandy
Kelleher Eric
Kendler Kenneth S.
Kenny Elaine M.
Kirov George
Maher Brion S.
McDonald Colm
Mcquillin Andrew
Molinos Ines
Morris Derek W.
Murphy Kieran C.
O'Callaghan Eadbhard
O'Donovan Michael
O'Dushlaine Colm T.
O'Neill Francis A.
Ophoff Roel
Pearson Richard D.
Perreault Louis Philippe Lemieux
Pirinen Matti
Purcell Shaun
Rees Elliott
Regan Regina
Riley Brien P.
Scolnick Ed
SGENE+ Consortium
Sklar Pamela
Spencer Chris C. A.
St Clair David
Stone Jennifer
Strange Amy
Sullivan Patrick
Szatkiewicz Jin
The International Schizophrenia Consortium (ISC)
The Wellcome Trust Case Control Consortium 2 (WTCCC2)
Thiselton Dawn L.
Tropea Daniela
Waddington John L.
Walsh Dermot
Walters James
Wormley Brandon
Publication venue: 'Oxford University Press (OUP)'
Publication date: 28/01/2014
Field of study

FUNDING Funding for this study was provided by the Wellcome Trust Case Control Consortium 2 project (085475/B/08/Z and 085475/Z/08/Z), the Wellcome Trust (072894/Z/03/Z, 090532/Z/09/Z and 075491/Z/04/B), NIMH grants (MH 41953 and MH083094) and Science Foundation Ireland (08/IN.1/B1916). We acknowledge use of the Trinity Biobank sample from the Irish Blood Transfusion Service; the Trinity Centre for High Performance Computing; British 1958 Birth Cohort DNA collection funded by the Medical Research Council (G0000934) and the Wellcome Trust (068545/Z/02) and of the UK National Blood Service controls funded by the Wellcome Trust. Chris Spencer is supported by a Wellcome Trust Career Development Fellowship (097364/Z/11/Z). Funding to pay the Open Access publication charges for this article was provided by the Wellcome Trust. ACKNOWLEDGEMENTS The authors sincerely thank all patients who contributed to this study and all staff who facilitated their involvement. We thank W. Bodmer and B. Winney for use of the People of the British Isles DNA collection, which was funded by the Wellcome Trust. We thank Akira Sawa and Koko Ishzuki for advice on the PAK7–DISC1 interaction experiment and Jan Korbel for discussions on mechanism of structural variation.Peer reviewedPublisher PD

Aberdeen University Research

Crossref

Online Research @ Cardiff

PubMed Central

Oxford University Research Archive

Genome-wide association studies in oesophageal adenocarcinoma and Barrett's oesophagus: a large-scale meta-analysis.

Author: Anders Mario
Anderson Lesley A
Attwood Stephen
Barr Hugh
Barrett's and Esophageal Adenocarcinoma Consortium (BEACON)
Becker Jessica
Bernstein Leslie
Bird Nigel C
Buas Matthew F
Böhmer Anne C
Caldas Carlos
Chegwidden Laura
Chow Wong-Ho
Corley Douglas A
de Caestecker John
Ell Christian
Esophageal Adenocarcinoma GenEtics Consortium (EAGLE)
Fitzgerald Rebecca C
Gammon Marilie D
Gerges Christian
Gharahkhani Puya
Gockel Ines
Hackelsberger Andreas
Hardie Laura J
Harrison Rebecca
Hess Timo
Hölscher Arnulf H
Iyer Prasad G
Izbicki Jakob R
Jankowski Janusz
Knapp Michael
Kreuser Nicole
Lagergren Jesper
Lang Hauke
Liu Geoffrey
Lorenz Dietmar
Love Sharon B
MacDonald David
MacGregor Stuart
Manner Hendrik
May Andrea
Mayershofer Rupert
Moayyedi Paul
Moebus Susanne
Neuhaus Horst
Noder Tania
Nöthen Markus M
Ott Katja
Palles Claire
Pech Oliver
Peters Wilbert HM
Pharoah Paul
Prenen Hans
Risch Harvey A
Rösch Thomas
Schmidt Claudia
Schmidt Thomas
Schumacher Brigitte
Schumacher Johannes
Shaheen Nicholas J
Tomlinson Ian
Vashist Yogesh
Vaughan Thomas L
Veits Lothar
Venerito Marino
Vieth Michael
Watson RG Peter
Weismüller Josef
Wellcome Trust Case Control Consortium 2 (WTCCC2)
Whiteman David C
Wu Anna H
Ye Weimin
Publication venue: Lancet Oncol
Publication date: 01/01/2016
Field of study

BACKGROUND: Oesophageal adenocarcinoma represents one of the fastest rising cancers in high-income countries. Barrett's oesophagus is the premalignant precursor of oesophageal adenocarcinoma. However, only a few patients with Barrett's oesophagus develop adenocarcinoma, which complicates clinical management in the absence of valid predictors. Within an international consortium investigating the genetics of Barrett's oesophagus and oesophageal adenocarcinoma, we aimed to identify novel genetic risk variants for the development of Barrett's oesophagus and oesophageal adenocarcinoma. METHODS: We did a meta-analysis of all genome-wide association studies of Barrett's oesophagus and oesophageal adenocarcinoma available in PubMed up to Feb 29, 2016; all patients were of European ancestry and disease was confirmed histopathologically. All participants were from four separate studies within Europe, North America, and Australia and were genotyped on high-density single nucleotide polymorphism (SNP) arrays. Meta-analysis was done with a fixed-effects inverse variance-weighting approach and with a standard genome-wide significance threshold (p<5 × 10-8). We also did an association analysis after reweighting of loci with an approach that investigates annotation enrichment among genome-wide significant loci. Furthermore, the entire dataset was analysed with bioinformatics approaches-including functional annotation databases and gene-based and pathway-based methods-to identify pathophysiologically relevant cellular mechanisms. FINDINGS: Our sample comprised 6167 patients with Barrett's oesophagus and 4112 individuals with oesophageal adenocarcinoma, in addition to 17 159 representative controls from four genome-wide association studies in Europe, North America, and Australia. We identified eight new risk loci associated with either Barrett's oesophagus or oesophageal adenocarcinoma, within or near the genes CFTR (rs17451754; p=4·8 × 10-10), MSRA (rs17749155; p=5·2 × 10-10), LINC00208 and BLK (rs10108511; p=2·1 × 10-9), KHDRBS2 (rs62423175; p=3·0 × 10-9), TPPP and CEP72 (rs9918259; p=3·2 × 10-9), TMOD1 (rs7852462; p=1·5 × 10-8), SATB2 (rs139606545; p=2·0 × 10-8), and HTR3C and ABCC5 (rs9823696; p=1·6 × 10-8). The locus identified near HTR3C and ABCC5 (rs9823696) was associated specifically with oesophageal adenocarcinoma (p=1·6 × 10-8) and was independent of Barrett's oesophagus development (p=0·45). A ninth novel risk locus was identified within the gene LPA (rs12207195; posterior probability 0·925) after reweighting with significantly enriched annotations. The strongest disease pathways identified (p<10-6) belonged to muscle cell differentiation and to mesenchyme development and differentiation. INTERPRETATION: Our meta-analysis of genome-wide association studies doubled the number of known risk loci for Barrett's oesophagus and oesophageal adenocarcinoma and revealed new insights into causes of these diseases. Furthermore, the specific association between oesophageal adenocarcinoma and the locus near HTR3C and ABCC5 might constitute a novel genetic marker for prediction of the transition from Barrett's oesophagus to oesophageal adenocarcinoma. Fine-mapping and functional studies of new risk loci could lead to identification of key molecules in the development of Barrett's oesophagus and oesophageal adenocarcinoma, which might encourage development of advanced prevention and intervention strategies. FUNDING: US National Cancer Institute, US National Institutes of Health, National Health and Medical Research Council of Australia, Swedish Cancer Society, Medical Research Council UK, Cambridge NIHR Biomedical Research Centre, Cambridge Experimental Cancer Medicine Centre, Else Kröner Fresenius Stiftung, Wellcome Trust, Cancer Research UK, AstraZeneca UK, University Hospitals of Leicester, University of Oxford, Australian Research Council

Aberdeen University Research

Queen's University Belfast Research Portal

University of Birmingham Research Portal

Carolina Digital Repository

White Rose Research Online

CLoK

Elsevier - Publisher Connector

University of Regensburg Publication Server

Crossref

Kölner UniversitätsPublikationsServer

PubMed Central

UCL Discovery

Oxford University Research Archive

Institutional Repository Universiteit Antwerpen

Apollo (Cambridge)

A Two-Stage Meta-Analysis Identifies Several New Loci for Parkinson's Disease

Author: Amouyel P
Arepalli S
Band G
Barker RA
Bellinguez C
Ben-Shlomo Y
Berendse HW
Berg D
Bhatia K
Biffi A
Bloem B
Bochdanovits Z
Bonin M
Bras JM
Brice A
Brockmann K
Brooks J
Burn DJ
Charlesworth G
Chen HL
Chinnery PF
Chong S
Clarke CE
Cookson MR
Cooper JM
Corvol JC
Counsell C
Damier P
Dartigues JF
de Bie RMA
de Silva R
Deloukas P
Deuschl G
Dexter DT
Dillman A
Donnelly P
Durif F
Durr A
Edkins S
Evans JR
Foltynie T
Freeman C
Gao JJ
Gardner M
Gasser T
Gibbs JR
Goate A
Gray E
Guerreiro R
Gustafsson O
Hardy J
Harris C
Hellenthal G
Hernandez DG
Heutink P
Hofman A
Hollenbeck A
Holton J
Hu M
Huang XM
Huber H
Hudson G
Hunt SE
Huttenlocher J
Illig T
Jonsson PV
Langford C
Lees A
Lesage S
Lichtner P
Limousin P
Lopez G
Lorenz D
Martinez M
McNeill A
Moorby C
Moore M
Morris H
Morrison KE
Mudanohwo E
Nalls MA
O'Sullivan SS
Pearson J
Pearson R
Perlmutter JS
Petursson H
Pirinen M
Plagnol V
Pollak P
Post B
Potter S
Ravina B
Revesz T
Riess O
Rivadeneira F
Rizzu P
Ryten M
Saad M
Sawcer S
Schapira A
Scheffer H
Schulte C
Sharma M
Shaw K
Sheerin UM
Shoulson I
Sidransky E
Simon-Sanchez J
Singleton AB
Smith C
Spencer CCA
Stefansson H
Stefansson K
Steinberg S
Stockton JD
Strange A
Su Z
Sveinbjornsdottir S
Talbot K
Tanner CM
Tashakkori-Ghanbaria A
Tison F
Trabzuni D
Traynor BJ
Uitterlinden AG
van de Warrenburg B
van Dijk KD
van Hilten JJ
Vandrovcova J
Velseboer D
Vidailhet M
Vukcevic D
Walker R
Weale ME
Wickremaratchi M
Williams N
Williams-Gray CH
Winder-Rhodes S
Wood NW
WTCCC2
Publication venue: PUBLIC LIBRARY SCIENCE
Publication date: 01/01/2011
Field of study

A previous genome-wide association (GWA) meta-analysis of 12,386 PD cases and 21,026 controls conducted by the International Parkinson's Disease Genomics Consortium (IPDGC) discovered or confirmed 11 Parkinson's disease (PD) loci. This first analysis of the two-stage IPDGC study focused on the set of loci that passed genome-wide significance in the first stage GWA scan. However, the second stage genotyping array, the ImmunoChip, included a larger set of 1,920 SNPs selected on the basis of the GWA analysis. Here, we analyzed this set of 1,920 SNPs, and we identified five additional PD risk loci (combined p<5x10(-10), PARK16/1q32, STX1B/16p11, FGF20/8p22, STBD1/4q21, and GPNMB/7p15). Two of these five loci have been suggested by previous association studies (PARK16/1q32, FGF20/8p22), and this study provides further support for these findings. Using a dataset of post-mortem brain samples assayed for gene expression (n = 399) and methylation (n = 292), we identified methylation and expression changes associated with PD risk variants in PARK16/1q32, GPNMB/7p15, and STX1B/16p11 loci, hence suggesting potential molecular mechanisms and candidate genes at these risk loci

UCL Discovery

Oxford University Research Archive

Leiden University Scholary Publications

Radboud Repository

The Irish DNA Atlas: Revealing Fine-Scale Population Structure and History within Ireland

Author: AL Price
B McEvoy
B McEvoy
B McEvoy
B McEvoy
B Weir
B Winney
C Capelli
C Dolan
C McGuigan
CT O’Dushlaine
D Petkova
EW Hill
G Fiorito
G Hellenthal
GW Dawson
J Yang
JF Wilson
JH Relethford
KC Desch
KP Coss
LM Cassidy
LT Moore
M Karakachoff
M Murphy
M Mylotte
O Delaneau
O Lao
PM Farrell
S Cronin
S Leslie
S Purcell
WE Hackett
WTCCC2 IMSGC
Y Itan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The extent of population structure within Ireland is largely unknown, as is the impact of historical migrations. Here we illustrate fine-scale genetic structure across Ireland that follows geographic boundaries and present evidence of admixture events into Ireland. Utilising the ‘Irish DNA Atlas’, a cohort (n = 194) of Irish individuals with four generations of ancestry linked to specific regions in Ireland, in combination with 2,039 individuals from the Peoples of the British Isles dataset, we show that the Irish population can be divided in 10 distinct geographically stratified genetic clusters; seven of ‘Gaelic’ Irish ancestry, and three of shared Irish-British ancestry. In addition we observe a major genetic barrier to the north of Ireland in Ulster. Using a reference of 6,760 European individuals and two ancient Irish genomes, we demonstrate high levels of North-West French-like and West Norwegian-like ancestry within Ireland. We show that that our ‘Gaelic’ Irish clusters present homogenous levels of ancient Irish ancestries. We additionally detect admixture events that provide evidence of Norse-Viking gene flow into Ireland, and reflect the Ulster Plantations. Our work informs both on Irish history, as well as the study of Mendelian and complex disease genetics involving populations of Irish ancestry

Crossref

Directory of Open Access Journals

Edinburgh Research Explorer

Oxford University Research Archive

RCSI Repository

Explore Bristol Research

Resolving the polymorphism-in-probe problem is critical for correct interpretation of expression QTL studies.

Polymorphisms in the target mRNA sequence can greatly affect the binding affinity of microarray probe sequences, leading to false-positive and false-negative expression quantitative trait locus (QTL) signals with any other polymorphisms in linkage disequilibrium. We provide the most complete solution to this problem, by using the latest genome and exome sequence reference data to identify almost all common polymorphisms (frequency >1% in Europeans) in probe sequences for two commonly used microarray panels (the gene-based Illumina Human HT12 array, which uses 50-mer probes, and exon-based Affymetrix Human Exon 1.0 ST array, which uses 25-mer probes). We demonstrate the impact of this problem using cerebellum and frontal cortex tissues from 438 neuropathologically normal individuals. We find that although only a small proportion of the probes contain polymorphisms, they account for a large proportion of apparent expression QTL signals, and therefore result in many false signals being declared as real. We find that the polymorphism-in-probe problem is insufficiently controlled by previous protocols, and illustrate this using some notable false-positive and false-negative examples in MAPT and PRICKLE1 that can be found in many eQTL databases. We recommend that both new and existing eQTL data sets should be carefully checked in order to adequately address this issue

Crossref

UCL Discovery

PubMed Central

Edinburgh Research Explorer

Carolina Digital Repository

King's Research Portal