Search CORE

257 research outputs found

A framework for interpreting genome-wide association studies of psychiatric disorders

Author: D Altshuler
D Altshuler
DY Lin
H Stefansson
I Pe’er
II Gottesman
International Schizophrenia Consortium
J Huxley
J Marchini
JP Ioannidis
JS Witte
KA Frazer
LA Weiss
MI McCarthy
N Craddock
N Craddock
N Craddock
SJ Chanock
T Konneker
T Rankinen
TA Manolio
TA Manolio
TA Pearson
TM Frayling
Publication venue
Publication date: 01/01/2008
Field of study

Genome-wide association studies (GWAS) have yielded a plethora of new findings in the past 3 years. By early 2009, GWAS on 47 samples of subjects with attention-deficit hyperactivity disorder, autism, bipolar disorder, major depressive disorder and schizophrenia will be completed. Taken together, these GWAS constitute the largest biological experiment ever conducted in psychiatry (59 000 independent cases and controls, 7700 family trios and >40 billion genotypes). We know that GWAS can work, and the question now is whether it will work for psychiatric disorders. In this review, we describe these studies, the Psychiatric GWAS Consortium for meta-analyses of these data, and provide a logical framework for interpretation of some of the conceivable outcomes

Crossref

VU Research Portal

Online Research @ Cardiff

Radboud Repository

Linked read technology for assembling large complex and polyploid genomes

Author: A Akintayo
A Akintayo
A Balu
A Salman-Minkov
Alina Ott
B Nystedt
C Del Fabbro
C Feuillet
C Liu
C Rao
Chao Liu
Cheng-Ting Yeh
Clifton L. Dalgard
CS Chin
DM Altshuler
DR Bentley
E Lieberman-Aiden
E Lyons
E Lyons
GXY Zheng
H Li
H Tang
HB Tang
Heng-Cheng Hu
HV Hunt
James C. Schnable
JL Bennetzen
JR MacDonald
JS Seo
L Coombe
Linjiang Wu
LJ Briggs
M Freeling
M Kubesova
MA Hamoud
ME Rasekh
MW Crepeau
MW Libbrecht
N Rodic
N Spies
NI Weisenfeld
P SanMiguel
Patrick S. Schnable
PS Schnable
RK Saxena
RS Baucom
RS Li
S Goodwin
S Renny-Byfield
S Sarkar
S Sarkar
SJ Emrich
SJ Emrich
SM Utturkar
Soumik Sarkar
TJ Treangen
Y Fu
Y Mostovoy
YN Jiao
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2018
Field of study

Background: Short read DNA sequencing technologies have revolutionized genome assembly by providing high accuracy and throughput data at low cost. But it remains challenging to assemble short read data, particularly for large, complex and polyploid genomes. The linked read strategy has the potential to enhance the value of short reads for genome assembly because all reads originating from a single long molecule of DNA share a common barcode. However, the majority of studies to date that have employed linked reads were focused on human haplotype phasing and genome assembly. Results: Here we describe a de novo maize B73 genome assembly generated via linked read technology which contains ~ 172,000 scaffolds with an N50 of 89 kb that cover 50% of the genome. Based on comparisons to the B73 reference genome, 91% of linked read contigs are accurately assembled. Because it was possible to identify errors with \u3e 76% accuracy using machine learning, it may be possible to identify and potentially correct systematic errors. Complex polyploids represent one of the last grand challenges in genome assembly. Linked read technology was able to successfully resolve the two subgenomes of the recent allopolyploid, proso millet (Panicum miliaceum). Our assembly covers ~ 83% of the 1 Gb genome and consists of 30,819 scaffolds with an N50 of 912 kb. Conclusions: Our analysis provides a framework for future de novo genome assemblies using linked reads, and we suggest computational strategies that if implemented have the potential to further improve linked read assemblies, particularly for repetitive genomes

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

FigShare

Development of the Healthy Pathways Child-Report Scales

Crossref

Springer - Publisher Connector

PubMed Central

Genomics meets HIV-1

Author: A Ciuffi
A Reymond
A Telenti
A Telenti
A Telenti
A Telenti
AA Bashirova
AC Syvanen
AG Clark
AM Sheehy
Amalio Telenti
B Mangeat
B Mangeat
B Schrofelbauer
BF Voight
BH Hahn
BR Cullen
C Marzolini
CB Moore
CD Pilcher
D Altshuler
David B. Goldstein
DB Goldstein
DC Montefiori
DM Evans
DW Haas
E Trachtenberg
EC Walsh
FA Plummer
G Bleiber
G Silvestri
GJ Towers
HJ Tsai
HM Colhoun
I Pe'er
J Chang
J Lieberman
JA Todd
JC Barrett
JC Roach
JD Lifson
JG Sambrook
JL Foster
JP Ioannidis
JW Mellors
KR Ahmadi
L Chen
LB Barreiro
M Carrington
M Schindler
M Stremlau
M Stremlau
M Stremlau
ME Quinones-Mateu
MP Davenport
MS Cohen
MW Yap
N Risch
NB Freimer
O Lambotte
OT van Opijnen
PB Gilbert
PC Sabeti
PM Sharp
R Draenert
RA Kaslow
S Mazzoli
S Nisole
S Ohkura
SJ O'Brien
SL Sawyer
SL Sawyer
VM Hirsch
W Yang
X-F Yu
Y Li
Z Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2006
Field of study

Genomics is now a core element in the effort to develop a vaccine against HIV-1. Thanks to unprecedented progress in high-throughput genotyping and sequencing, in knowledge about genetic variation in humans, and in evolutionary genomics, it is finally possible to systematically search the genome for common genetic variants that influence the human response to HIV-1. The identification of such variants would help to determine which aspects of the response to the virus are the most promising targets for intervention. However, a key obstacle to progress remains the scarcity of appropriate human cohorts available for genomic research

Crossref

Serveur académique lausannois

PubMed Central

Gemcitabine and Arabinosylcytosin Pharmacogenomics: Genome-Wide Association and Drug Response Biomarkers

Author: AE Baum
AL Price
AM Bergman
Anthony Batzler
AS Dimas
BK Shin
BM Bolstad
Brooke L. Fridley
C Lagenaur
C Schoch
D Altshuler
DF Wu
DJ Schaid
E Choy
E Half
E Sugiyama
EE Schadt
Eric J. Bernhard
F Lokiec
Gregory Jenkins
HL Kindler
I Hornstein
J Alfonso
J Li
JC Barrett
JD Storey
JD Storey
JE Wigginton
K Smid
K Yonemori
KJ Bussey
Krishna Kalari
L Bystrykh
L Li
Liang Li
Liewei Wang
LW Hertel
M Morley
M Turner
MA Hauser
MF Moffatt
MH Tomasson
P Seve
PC Gwee
PC Sabeti
PR Burton
R Sachidanandam
Richard M. Weinshilboum
RS Huang
RS Huang
S Duan
S Mukobata
S Weidinger
SB Gabriel
SJ Shukla
SR Gullans
SR Kim
SR Kim
SW Guo
T Trenkle
TB Campbell
V Heinemann
V Heinemann
W Kern
WH Feng
WH Feng
WK Bleibel
WS Kwon
X Wu
Y Li
Z Wu
Publication venue: Public Library of Science
Publication date: 01/11/2009
Field of study

Cancer patients show large individual variation in their response to chemotherapeutic agents. Gemcitabine (dFdC) and AraC, two cytidine analogues, have shown significant activity against a variety of tumors. We previously used expression data from a lymphoblastoid cell line-based model system to identify genes that might be important for the two drug cytotoxicity. In the present study, we used that same model system to perform a genome-wide association (GWA) study to test the hypothesis that common genetic variation might influence both gene expression and response to the two drugs. Specifically, genome-wide single nucleotide polymorphisms (SNPs) and mRNA expression data were obtained using the Illumina 550K® HumanHap550 SNP Chip and Affymetrix U133 Plus 2.0 GeneChip, respectively, for 174 ethnically-defined “Human Variation Panel” lymphoblastoid cell lines. Gemcitabine and AraC cytotoxicity assays were performed to obtain IC50 values for the cell lines. We then performed GWA studies with SNPs, gene expression and IC50 of these two drugs. This approach identified SNPs that were associated with gemcitabine or AraC IC50 values and with the expression regulation for 29 genes or 30 genes, respectively. One SNP in IQGAP2 (rs3797418) was significantly associated with variation in both the expression of multiple genes and gemcitabine and AraC IC50. A second SNP in TGM3 (rs6082527) was also significantly associated with multiple gene expression and gemcitabine IC50. To confirm the association results, we performed siRNA knock down of selected genes with expression that was associated with rs3797418 and rs6082527 in tumor cell and the knock down altered gemcitabine or AraC sensitivity, confirming our association study results. These results suggest that the application of GWA approaches using cell-based model systems, when combined with complementary functional validation, can provide insights into mechanisms responsible for variation in cytidine analogue response

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Genome-Wide Polymorphism and Comparative Analyses in the White-Tailed Deer (Odocoileus virginianus): A Model for Conservation Genomics

The white-tailed deer (Odocoileus virginianus) represents one of the most successful and widely distributed large mammal species within North America, yet very little nucleotide sequence information is available. We utilized massively parallel pyrosequencing of a reduced representation library (RRL) and a random shotgun library (RSL) to generate a complete mitochondrial genome sequence and identify a large number of putative single nucleotide polymorphisms (SNPs) distributed throughout the white-tailed deer nuclear and mitochondrial genomes. A SNP validation study designed to test specific classes of putative SNPs provides evidence for as many as 10,476 genome-wide SNPs in the current dataset. Based on cytogenetic evidence for homology between cow (Bos taurus) and white-tailed deer chromosomes, we demonstrate that a divergent genome may be used for estimating the relative distribution and density of de novo sequence contigs as well as putative SNPs for species without draft genome assemblies. Our approach demonstrates that bioinformatic tools developed for model or agriculturally important species may be leveraged to support next-generation research programs for species of biological, ecological and evolutionary importance. We also provide a functional annotation analysis for the de novo sequence contigs assembled from white-tailed deer pyrosequencing reads, a mitochondrial phylogeny involving 13,722 nucleotide positions for 10 unique species of Cervidae, and a median joining haplotype network as a putative representation of mitochondrial evolution in O. virginianus. The results of this study are expected to provide a detailed template enabling genome-wide sequence-based studies of threatened, endangered or conservationally important non-model organisms

Crossref

Directory of Open Access Journals

PubMed Central

Texas A&M Repository

Analysis of protein-coding genetic variation in 60,706 humans

Author: Altshuler DM
Ardissino D
Banks E
Berghout J
Birnbaum DP
Boehnke M
Cooper DN
Cummings BB
Daly MJ
Danesh J
Deflaux N
DePristo M
Do R
Donnelly S
Duncan LE
Elosua R
Estrada K
Exome Aggregation Consortium
Fennell T
Flannick J
Florez JC
Fromer M
Gabriel SB
Gauthier L
Getz G
Glatt SJ
Goldstein J
Gupta N
Hill AJ
Howrigan D
Hultman CM
Karczewski KJ
Kathiresan S
Kiezun A
Kosmicki JA
Kurki MI
Laakso M
Lek M
MacArthur DG
McCarroll S
McCarthy MI
McGovern D
McPherson R
Minikel EV
Moonshine AL
Natarajan P
Neale BM
O'Donnell-Luria AH
Orozco L
Palotie A
Peloso GM
Pierce-Hoffman E
Poplin R
Purcell SM
Rivas MA
Rose SA
Ruano-Rubio V
Ruderfer DM
Saleheen D
Samocha KE
Scharf JM
Shakir K
Sklar P
Stenson PD
Stevens C
Sullivan PF
Thomas BP
Tiao G
Tsuang MT
Tukiainen T
Tuomilehto J
Tusie-Luna MT
Ware JS
Watkins HC
Weisburd B
Wilson JG
Won HH
Yu D
Zhao F
Zou J
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 24/06/2016
Field of study

Large-scale reference data sets of human genetic variation are critical for the medical and functional interpretation of DNA sequence changes. Here we describe the aggregation and analysis of high-quality exome (protein-coding region) DNA sequence data for 60,706 individuals of diverse ancestries generated as part of the Exome Aggregation Consortium (ExAC). This catalogue of human genetic diversity contains an average of one variant every eight bases of the exome, and provides direct evidence for the presence of widespread mutational recurrence. We have used this catalogue to calculate objective metrics of pathogenicity for sequence variants, and to identify genes subject to strong selection against various classes of mutation; identifying 3,230 genes with near-complete depletion of predicted protein-truncating variants, with 72% of these genes having no currently established human disease phenotype. Finally, we demonstrate that these data can be used for the efficient filtering of candidate disease-causing variants, and for the discovery of human 'knockout' variants in protein-coding genes

Oxford University Research Archive

Spiral - Imperial College Digital Repository

Enhanced Statistical Tests for GWAS in Admixed Populations: Assessment using African Americans from CARe and a Breast Cancer Consortium

Author: A Adeyemo
AL Price
AL Price
AL Price
Alkes L. Price
Arti Tandon
B Devlin
B Newman
B Pasaniuc
B Pasaniuc
B Pasaniuc
BL Browning
BN Howie
Bogdan Pasaniuc
Brian E. Henderson
Cameron D. Palmer
CB Ambrosone
Christine B. Ambrosone
Christopher A. Haiman
CY Cheng
D Altshuler
D Reich
D Reich
David Reich
David S. Siscovick
DB Hancock
DJ Hunter
DM Altshuler
DW Jones
E Zeggini
EL Harris
Elisa V. Bandera
EM John
EM John
EM John
Emma Larkin
Ermeg L. Akylbekova
Esther M. John
G Lettre
Gary K. Chen
George J. Papanicolaou
Guillaume Lettre
Ingo Ruczinski
J Marchini
J Marchini
James G. Wilson
Jasmin Divers
Jennifer J. Hu
JK Pritchard
Joe Mychaleckyj
Joel N. Hirschhorn
Jorge L. Rodriguez-Gil
JR Palmer
K Wang
KA Frazer
KH Kjerulff
L Huang
L. Adrienne Cupples
Leslie A. Lange
Leslie Bernstein
LN Kolonel
Lynette Ekunwe
M Molokhia
M Slatkin
M Stephens
MA Nalls
MG Hayes
MI McCarthy
Michael F. Press
Mingyao Li
ML Freedman
MS Udler
MW Smith
MW Smith
Myriam Fornage
N Patterson
N Patterson
N Risch
N Zaitlen
N Zaitlen
NA Rosenberg
Nicholas J. Schork
Nick Patterson
Noah Zaitlen
PA Marchbanks
PC Prorok
Qiong Yang
R Cooper
RC Deo
Regina G. Ziegler
RF Gillum
Robert C. Millikan
Sandra L. Deming
Sarah Buxbaum
Sarah J. Nyante
Simon Myers
SJ Freedland
Solomon K. Musani
Stephen J. Chanock
Sue A. Ingles
TM Teslovich
TR Rebbeck
TR Smith
W Zheng
W. H. Linda Kao
Wei Zheng
WH Kao
X Zhu
Xiaofeng Zhu
Y Guan
Y Li
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

While genome-wide association studies (GWAS) have primarily examined populations of European ancestry, more recent studies often involve additional populations, including admixed populations such as African Americans and Latinos. In admixed populations, linkage disequilibrium (LD) exists both at a fine scale in ancestral populations and at a coarse scale (admixture-LD) due to chromosomal segments of distinct ancestry. Disease association statistics in admixed populations have previously considered SNP association (LD mapping) or admixture association (mapping by admixture-LD), but not both. Here, we introduce a new statistical framework for combining SNP and admixture association in case-control studies, as well as methods for local ancestry-aware imputation. We illustrate the gain in statistical power achieved by these methods by analyzing data of 6,209 unrelated African Americans from the CARe project genotyped on the Affymetrix 6.0 chip, in conjunction with both simulated and real phenotypes, as well as by analyzing the FGFR2 locus using breast cancer GWAS data from 5,761 African-American women. We show that, at typed SNPs, our method yields an 8% increase in statistical power for finding disease risk loci compared to the power achieved by standard methods in case-control studies. At imputed SNPs, we observe an 11% increase in statistical power for mapping disease loci when our local ancestry-aware imputation framework and the new scoring statistic are jointly employed. Finally, we show that our method increases statistical power in regions harboring the causal SNP in the case when the causal SNP is untyped and cannot be imputed. Our methods and our publicly available software are broadly applicable to GWAS in admixed populations

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

University of Miami: Scholarship Miami

Oxford University Research Archive

Genetic linkage analysis in the age of whole-genome sequencing

Author: A McKenna
AA Schaffer
AA Schaffer
AE Doyle
AS Whittemore
B Li
B Li
C Dubay
C Genomes Project
C Suo
CAB Smith
CI Amos
CT Falk
CY Cheung
CY Cheung
D Altshuler
D Bentley
D Gordon
DB Allison
DC Thomas
DG MacArthur
E Lander
E Sobel
EM Gertz
EM Smigielski
ES Lander
ES Lander
G De
GJ Mendel
GM Lathrop
GM Lathrop
GR Abecasis
GT Wang
H Louis-Dit-Picard
I Adzhubei
J Dietter
J McClellan
J Ott
J Ott
J Ott
J Ott
J Ott
J Ott
J Ott
J Ott
J Ott
J Yan
JA Tennessen
JBS Haldane
JD Terwilliger
JD Terwilliger
JE Bailey-Wilson
JE Powell
JF Gusella
JH Renwick
JH Renwick
Jing Wang
JM Lee
JR O'Connell
Jurg Ott
K Hoffmann
K Lange
K Lange
K Lange
KM Weiss
KR Smith
KS Pollard
L Kruglyak
LM Brzustowicz
LS Penrose
M Fishelson
M Knapp
M Knapp
M Su
MC Neale
NJ Schork
NM Laird
NM Laird
P Danecek
PC Sham
PD Sasieni
Q Shi
R Goldschmidt
RB Goldschmidt
RC Elston
RC Elston
RC Elston
RE Tanzi
RH Houwen
RL Santos-Cortez
RL Santos-Cortez
RS Spielman
RW Cottingham Jr.
S Basu
S Lee
S Purcell
SC Heath
SC Heath
SE Hodge
SJ Hasstedt
SM Pulst
Suzanne M. Leal
T Kamphans
VS Kostic
Z He
Publication venue
Publication date: 01/05/2015
Field of study

For many years, linkage analysis was the primary tool used for the genetic mapping of Mendelian and complex traits with familial aggregation. Linkage analysis was largely supplanted by the wide adoption of genome-wide association studies (GWASs). However, with the recent increased use of whole-genome sequencing (WGS), linkage analysis is again emerging as an important and powerful analysis method for the identification of genes involved in disease aetiology, often in conjunction with WGS filtering approaches. Here, we review the principles of linkage analysis and provide practical guidelines for carrying out linkage studies using WGS data

Crossref

Institute of Psychology,Chinese Academy Of Sciences

PubMed Central

Institutional Repository of Institute of Psychology, Chinese Academy of Sciences

Informed Conditioning on Clinical Covariates Increases Power in Case-Control Association Studies

Author: Aage Haugen
AL Price
AL Price
Albert Rosenberger
Alkes L. Price
Angela Risch
Ann W. Morgan
Anne Barton
Anthony G. Wilson
Barry I. Freedman
Benjamin Voight
BF Voight
Bogdan Pasaniuc
Brian E. Henderson
C Wallace
Carl D. Langefeld
Christopher Haiman
CI Amos
CL Kuo
D Campa
D Clayton
D Cox
D Thomas
DA Schaumberg
Daniel I. Chasman
David Altshuler
David C. Christiani
David J. Friedman
David J. Hunter
David Scherf
Debra A. Schaumberg
DJ Hunter
Donald W. Bowden
DS Falconer
Eric Tchetgen Tchetgen
ESBD Lander
G Genovese
G Jin
G Maskarinec
Giulio Genovese
GM Monsees
GV Kryukov
H Holm
HC So
Heike Bickeböller
J Dong
J Marchini
Jane Worthington
JK Field
JM Neuhaus
Joachim Heinrich
John K. Field
JR Perry
JRB Perry
Kevin M. Waters
KL Ellis
KM Waters
Laurence N. Kolonel
LD Robinson
Leif Groop
Loic Le Marchand
LT Guey
Lynne J. Hocking
M Imielinski
M Pirinen
Maria Teresa Landi
Marilyn Cornelis
Martin Walshaw
Michael Meister
ML Freedman
N Chatterjee
N Risch
N Zaitlen
N Zaitlen
Nick Patterson
NJ Risch
NJ Wald
Noah Zaitlen
NR Wray
Olaide Y. Raji
P Armitage
P Kraft
P Sulem
Pamela J. Hicks
Paul Wordsworth
Peter Kraft
Peter M. Visscher
PM Ridker
Robert M. Plenge
S Kathiresan
S Lindstrom
S Raychaudhuri
S Rose
S Van Gestel
S Zienolddiny
Samuela Pollack
Sara Lindström
SH Lee
Shanbeh Zienolddiny
SJ Chanock
Sophia Steer
Steve Eyre
T Lumley
TH Hamza
TJ Vanderweele
TM Frayling
W Thomson
WG Hill
WW Piegorsch
Z Kote-Jarai
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2012
Field of study

Genetic case-control association studies often include data on clinical covariates, such as body mass index (BMI), smoking status, or age, that may modify the underlying genetic risk of case or control samples. For example, in type 2 diabetes, odds ratios for established variants estimated from low–BMI cases are larger than those estimated from high–BMI cases. An unanswered question is how to use this information to maximize statistical power in case-control studies that ascertain individuals on the basis of phenotype (case-control ascertainment) or phenotype and clinical covariates (case-control-covariate ascertainment). While current approaches improve power in studies with random ascertainment, they often lose power under case-control ascertainment and fail to capture available power increases under case-control-covariate ascertainment. We show that an informed conditioning approach, based on the liability threshold model with parameters informed by external epidemiological information, fully accounts for disease prevalence and non-random ascertainment of phenotype as well as covariates and provides a substantial increase in power while maintaining a properly controlled false-positive rate. Our method outperforms standard case-control association tests with or without covariates, tests of gene x covariate interaction, and previously proposed tests for dealing with covariates in ascertained data, with especially large improvements in the case of case-control-covariate ascertainment. We investigate empirical case-control studies of type 2 diabetes, prostate cancer, lung cancer, breast cancer, rheumatoid arthritis, age-related macular degeneration, and end-stage kidney disease over a total of 89,726 samples. In these datasets, informed conditioning outperforms logistic regression for 115 of the 157 known associated variants investigated (P-value = 1×10−9). The improvement varied across diseases with a 16% median increase in χ2 test statistics and a commensurate increase in power. This suggests that applying our method to existing and future association studies of these diseases may identify novel disease loci

Directory of Open Access Journals

The University of Manchester - Institutional Repository

PuSH

White Rose Research Online

FigShare

University of Queensland eSpace

Lund University Publications

Crossref

Harvard University - DASH

PubMed Central

Oxford University Research Archive

King's Research Portal