Search CORE

70 research outputs found

A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data

Author: Buchanan Carrie C
Bush William S
Ritchie Marylyn D
Torstenson Eric S
Publication venue: BMJ Group
Publication date
Field of study

Crossref

PubMed Central

LD-Spline: Mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium

Author: Bush William S
Chen Guanhua
Ritchie Marylyn D
Torstenson Eric S
Publication venue: BioMed Central Ltd
Publication date: 03/12/2009
Field of study

Background Gene-centric analysis tools for genome-wide association study data are being developed both to annotate single locus statistics and to prioritize or group single nucleotide polymorphisms (SNPs) prior to analysis. These approaches require knowledge about the relationships between SNPs on a genotyping platform and genes in the human genome. SNPs in the genome can represent broader genomic regions via linkage disequilibrium (LD), and population-specific patterns of LD can be exploited to generate a data-driven map of SNPs to genes. Methods In this study, we implemented LD-Spline, a database routine that defines the genomic boundaries a particular SNP represents using linkage disequilibrium statistics from the International HapMap Project. We compared the LD-Spline haplotype block partitioning approach to that of the four gamete rule and the Gabriel et al. approach using simulated data; in addition, we processed two commonly used genome-wide association study platforms. Results We illustrate that LD-Spline performs comparably to the four-gamete rule and the Gabriel et al. approach; however as a SNP-centric approach LD-Spline has the added benefit of systematically identifying a genomic boundary for each SNP, where the global block partitioning approaches may falter due to sampling variation in LD statistics. Conclusion LD-Spline is an integrated database routine that quickly and effectively defines the genomic region marked by a SNP using linkage disequilibrium, with a SNP-centric block definition algorithm

Carolina Digital Repository

The effects of linkage disequilibrium in large scale SNP datasets for MDR

Author: Benjamin J Grady
Eric S Torstenson
FJ Richards
JK Yang
LR Cardon
M Slatkin
Marylyn D Ritchie
MD Ritchie
MD Ritchie
P Gasso
S Dudek
SD Turner
SH Nordgard
SM Naushad
TL Edwards
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background In the analysis of large-scale genomic datasets, an important consideration is the power of analytical methods to identify accurate predictive models of disease. When trying to assess sensitivity from such analytical methods, a confounding factor up to this point has been the presence of linkage disequilibrium (LD). In this study, we examined the effect of LD on the sensitivity of the Multifactor Dimensionality Reduction (MDR) software package. Results Four relative amounts of LD were simulated in multiple one- and two-locus scenarios for which the position of the functional SNP(s) within LD blocks varied. Simulated data was analyzed with MDR to determine the sensitivity of the method in different contexts, where the sensitivity of the method was gauged as the number of times out of 100 that the method identifies the correct one- or two-locus model as the best overall model. As the amount of LD increases, the sensitivity of MDR to detect the correct functional SNP drops but the sensitivity to detect the disease signal and find an indirect association increases. Conclusions Higher levels of LD begin to confound the MDR algorithm and lead to a drop in sensitivity with respect to the identification of a direct association; it does not, however, affect the ability to detect indirect association. Careful examination of the solution models generated by MDR reveals that MDR can identify loci in the correct LD block; though it is not always the functional SNP. As such, the results of MDR analysis in datasets with LD should be carefully examined to consider the underlying LD structure of the dataset.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Recommended from our members

Electronic medical records and genomics (eMERGE) network exploration in cataract: Several new potential susceptibility loci

Author: Berg Richard L.
Carlson Christopher S.
Carrell Dave S.
Chen Lin
Crosslin David R.
Denny Joshua C.
Goodloe Robert J.
Hall Molly A.
Jarvik Gail
Li Rongling
Linneman James G.
McCarty Catherine A.
Pathak Jyoti
Peissig Peggy
Ramirez Andrea H.
Rasmussen Luke V.
Ritchie Marylyn D.
Torstenson Eric S.
Turner Stephen D.
Verma Shefali S.
Wang Xiaoming
Wilke Russell A.
Wolf Wendy A.
Publication venue: Molecular Vision
Publication date: 03/11/2014
Field of study

Purpose Cataract is the leading cause of blindness in the world, and in the United States accounts for approximately 60% of Medicare costs related to vision. The purpose of this study was to identify genetic markers for age-related cataract through a genome-wide association study (GWAS). Methods: In the electronic medical records and genomics (eMERGE) network, we ran an electronic phenotyping algorithm on individuals in each of five sites with electronic medical records linked to DNA biobanks. We performed a GWAS using 530,101 SNPs from the Illumina 660W-Quad in a total of 7,397 individuals (5,503 cases and 1,894 controls). We also performed an age-at-diagnosis case-only analysis. Results: We identified several statistically significant associations with age-related cataract (45 SNPs) as well as age at diagnosis (44 SNPs). The 45 SNPs associated with cataract at p<1×10−5 are in several interesting genes, including ALDOB, MAP3K1, and MEF2C. All have potential biologic relationships with cataracts. Conclusions: This is the first genome-wide association study of age-related cataract, and several regions of interest have been identified. The eMERGE network has pioneered the exploration of genomic associations in biobanks linked to electronic health records, and this study is another example of the utility of such resources. Explorations of age-related cataract including validation and replication of the association results identified herein are needed in future studies

Harvard University - DASH

LD-Spline: Mapping SNPs on genotyping platforms to genomic regions using patterns of linkage disequilibrium

Author: A Torkamani
B Devlin
BS Weir
C Li
C Schmegner
CS Carlson
D Fallin
Eric S Torstenson
Guanhua Chen
IB Borecki
International HapMap Consortium
J Aubert
J Cohen
JC Barrett
JP Lewinger
K Hao
M Slatkin
MA Eberle
MA Eberle
MA Province
Marylyn D Ritchie
N Wang
NE Morton
PC Sabeti
RC Lewontin
S Purcell
SB Gabriel
TA Manolio
TD Wickens
TJ Hubbard
TL Edwards
W Zhang
William S Bush
Publication venue: BioMed Central
Publication date: 01/01/2009
Field of study

Abstract Background Gene-centric analysis tools for genome-wide association study data are being developed both to annotate single locus statistics and to prioritize or group single nucleotide polymorphisms (SNPs) prior to analysis. These approaches require knowledge about the relationships between SNPs on a genotyping platform and genes in the human genome. SNPs in the genome can represent broader genomic regions via linkage disequilibrium (LD), and population-specific patterns of LD can be exploited to generate a data-driven map of SNPs to genes. Methods In this study, we implemented LD-Spline, a database routine that defines the genomic boundaries a particular SNP represents using linkage disequilibrium statistics from the International HapMap Project. We compared the LD-Spline haplotype block partitioning approach to that of the four gamete rule and the Gabriel et al. approach using simulated data; in addition, we processed two commonly used genome-wide association study platforms. Results We illustrate that LD-Spline performs comparably to the four-gamete rule and the Gabriel et al. approach; however as a SNP-centric approach LD-Spline has the added benefit of systematically identifying a genomic boundary for each SNP, where the global block partitioning approaches may falter due to sampling variation in LD statistics. Conclusion LD-Spline is an integrated database routine that quickly and effectively defines the genomic region marked by a SNP using linkage disequilibrium, with a SNP-centric block definition algorithm.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Carolina Digital Repository

A General Framework for Formal Tests of Interaction after Exhaustive Search Methods with Applications to MDR and MDR-PDT

Author: A Templeton
AA Motsinger
AA Motsinger
CS Coffey
DB Hancock
DM Evans
DR Velez
DW Hosmer
Eden R. Martin
ER Martin
ER Martin
Eric S. Torstenson
J Marchini
J Millstein
JD Owens
JH Moore
JH Moore
JH Moore
JH Moore
KA Pattin
KD Siegmund
KY Liang
LW Hahn
M Schmidt
Marylyn D. Ritchie
MD Ritchie
MD Ritchie
MF Baksh
MP Bass
N Risch
R Culverhouse
RE Bellman
RL Milne
RS Michalsky
Schlicting
Scott M. Dudek
SL Zeger
SM Dudek
Stephen D. Turner
T Hastie
Thorkild I. A. Sorensen
TL Edwards
TL Edwards
TL Edwards
TL Edwards
Todd L. Edwards
WS Bush
WS Bush
Z Feng
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

The initial presentation of multifactor dimensionality reduction (MDR) featured cross-validation to mitigate over-fitting, computationally efficient searches of the epistatic model space, and variable construction with constructive induction to alleviate the curse of dimensionality. However, the method was unable to differentiate association signals arising from true interactions from those due to independent main effects at individual loci. This issue leads to problems in inference and interpretability for the results from MDR and the family-based compliment the MDR-pedigree disequilibrium test (PDT). A suggestion from previous work was to fit regression models post hoc to specifically evaluate the null hypothesis of no interaction for MDR or MDR-PDT models. We demonstrate with simulation that fitting a regression model on the same data as that analyzed by MDR or MDR-PDT is not a valid test of interaction. This is likely to be true for any other procedure that searches for models, and then performs an uncorrected test for interaction. We also show with simulation that when strong main effects are present and the null hypothesis of no interaction is true, that MDR and MDR-PDT reject at far greater than the nominal rate. We also provide a valid regression-based permutation test procedure that specifically tests the null hypothesis of no interaction, and does not reject the null when only main effects are present. The regression-based permutation test implemented here conducts a valid test of interaction after a search for multilocus models, and can be applied to any method that conducts a search to find a multilocus model representing an interaction

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

University of Miami: Scholarship Miami

Erratum to: A multi-stage genome-wide association study of uterine fibroids in African Americans

Author: Allen Alexander
Denny Joshua C.
Dickinson Scott
Edwards Todd L.
Gallagher C. Scott
Hartmann Katherine E.
Hellwege Jacklyn N.
Hinds David A.
Im Hae Kyung
Jeff Janina M.
Jones Sarah F.
Kenny Eimear E.
Mancuso Nicholas
Morton Cynthia C.
Palmer Julie R.
Pasaniuc Bogdan
Reich David
Roden Dan M.
Rohland Nadin
Rosenberg Lynn
Ruiz-Narváez Edward A.
Stewart Elizabeth A.
Tandon Arti
Torstenson Eric S.
Velez Edwards Digna R.
Wellons Melissa
Wise Lauren A.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/11/2017
Field of study

The article "A multi-stage genome-wide association study of uterine fibroids in African Americans", written by Jacklyn N. Hellwege, was originally published Online First without open access. After publication in volume 136, issue 10, page 1363-1373 the author decided to opt for Open Choice and to make the article an open access publication. Therefore, the copyright of the article has been changed t

Crossref

Harvard University - DASH

eScholarship - University of California

The University of Manchester - Institutional Repository

Platelet-Related Variants Identified by Exomechip Meta-analysis in 157,293 Individuals

Author: Abecasis Goncalo R.
Auer Paul L.
Bartz Traci M.
Becker Diane M.
Becker Lewis C.
Boerwinkle Eric
Bottinger Erwin P.
Brody Jennifer A.
Burt Amber
Chami Nathalie
Chen Ming-Huei
Crosslin David R.
Cushman Mary
Deary Ian J.
Dehghan Abbas
Deloukas Panos
de Denus Simon
Dube Marie-Pierre
Edwards Todd L.
Eicher John D.
Elliot Paul
Engström Gunnar
Erdmann Jeanette
Esko Tõnu
Evangelou Evangelos
Evans Michele K.
Faraday Nauder
Floyd James S.
Fornage Myriam
Franco Oscar H.
Ganesh Santhi K.
Gao He
Giri Ayush
Greinacher Andreas
Gudnason Vilmundur
Harris Tamara B.
Hayward Caroline
Hernesniemi Jussi
Highland Heather M.
Hill W. David
Hirschhorn Joel
Johnson Andrew D.
Kacprowski Tim
Kathiresan Sekar
Kähönen Mika
Lange Ethan
Lange Leslie A.
Launer Lenore J.
Lehtimäki Terho
Lettre Guillaume
Li Jin
Liewald David C.M.
Liu Dajiang J.
Liu Yongmei
Loos Ruth J.F.
Lu Yingchang
Lyytikäinen Leo-Pekka
Manichaikul Ani
Mathias Rasika A.
Melander Olle
Metspalu Andres
Mihailov Evelin
Mononen Nina
Mägi Reedik
Nalls Mike A.
Nickerson Deborah A.
Nikus Kjell
Nomura Akihiro
Orho-Melander Marju
O’Donoghue Michelle L.
Pankratz Nathan
Pazoki Raha
Peloso Gina M.
Polfus Linda
Psaty Bruce M.
Quarells Rakale
Raffield Laura M.
Raitakari Olli T.
Raitoharju Emma
Reiner Alex P.
Rice Kenneth M.
Rich Stephen S.
Richard Melissa
Rioux John D.
Rotter Jerome I.
Samani Nilesh J.
Schick Ursula M.
Schunkert Heribert
Schurmann Claudia
Slater Andrew J.
Smith Albert Vernon
Starr J.M.
Tajuddin Salman M.
Tardif Jean-Claude
Thiele Thomas
Torstenson Eric S.
Tracy Russell P.
Tzoulaki Ioanna
Uitterlinden André G.
Vacchi-Suzzi Caterina
van Rooij Frank J.A.
Velez Edwards Digna R.
Vergnaud Anne-Claire
Völker Uwe
Völzke Henry
Wallentin Lars
Waterworth Dawn M.
White Harvey D.
Willer Cristen J.
Wilson James G.
Yanek Lisa R.
Zakai Neil A.
Zonderman Alan B.
Publication venue
Publication date: 01/01/2016
Field of study

Platelet production, maintenance, and clearance are tightly controlled processes indicative of platelets important roles in hemostasis and thrombosis. Platelets are common targets for primary and secondary prevention of several conditions. They are monitored clinically by complete blood counts, specifically with measurements of platelet count (PLT) and mean platelet volume (MPV). Identifying genetic effects on PLT and MPV can provide mechanistic insights into platelet biology and their role in disease. Therefore, we formed the Blood Cell Consortium (BCX) to perform a large-scale meta-analysis of Exomechip association results for PLT and MPV in 157,293 and 57,617 individuals, respectively. Using the low-frequency/rare coding variant-enriched Exomechip genotyping array, we sought to identify genetic variants associated with PLT and MPV. In addition to confirming 47 known PLT and 20 known MPV associations, we identified 32 PLT and 18 MPV associations not previously observed in the literature across the allele frequency spectrum, including rare large effect (FCER1A), low-frequency (IQGAP2, MAP1A, LY75), and common (ZMIZ2, SMG6, PEAR1, ARFGAP3/PACSIN2) variants. Several variants associated with PLT/MPV (PEAR1, MRVI1, PTGES3) were also associated with platelet reactivity. In concurrent BCX analyses, there was overlap of platelet-associated variants with red (MAP1A, TMPRSS6, ZMIZ2) and white (PEAR1, ZMIZ2, LY75) blood cell traits, suggesting common regulatory pathways with shared genetic architecture among these hematopoietic lineages. Our large-scale Exomechip analyses identified previously undocumented associations with platelet traits and further indicate that several complex quantitative hematological, lipid, and cardiovascular traits share genetic factors

PubMed Central

Carolina Digital Repository

eScholarship - University of California

A multi-stage genome-wide association study of uterine fibroids in African Americans

Author: A Barbeira
AB Moore
AL Price
Alexander Allen
Arti Tandon
B Aissani
B Aissani
B Vollenhoven
BJ Borah
Bogdan Pasaniuc
C Nagata
C Templeman
C. Scott Gallagher
CJ Willer
Cynthia C. Morton
Dan M. Roden
David A. Hinds
David Reich
DD Baird
DD Baird
DD Baird
Digna R. Velez Edwards
E Faerstein
Edward A. Ruiz-Narváez
Eimear E. Kenny
Elizabeth A. Stewart
ER Gamazon
Eric S. Torstenson
G Lettre
G Orozco
GD Friedman
Genotype-Tissue Expression (GTEx)
Hae Kyung Im
J Cobb
J Marchini
J Ott
J Pulley
Jacklyn N. Hellwege
Janina M. Jeff
Joshua C. Denny
Julie R. Palmer
K Zhang
Katherine E. Hartmann
KL Terry
L Feingold-Link
L Luo
LA Wise
LA Wise
LA Wise
LA Wise
Lauren A. Wise
LJ Carithers
LM Marshall
Lynn Rosenberg
M Klemke
M Mele
M Rezazadeh
Melissa Wellons
MF Wellons
MH Kim
MZ Braganza
N Spinos
Nadin Rohland
Nicholas Mancuso
O Delaneau
PC Cha
Q Duan
R Luoto
S Purcell
Sarah F. Jones
Scott Dickinson
SL Edwards
SL Eggert
SL Myers
TL Edwards
Todd L. Edwards
V Kanamarlapudi
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space

Author: Abel Haley J
Afgan Enis
Baker Dannon
Banasiewicz M Katie
Banks Eric
Baumann Alexander
Baumann Michael
Bernard Clare
Blauvelt Lon
Cabansay Louise
Caetano-Anollés Derek
Canas Justin
Carey Vincent J
Carroll Robert J
Chaluvadi Sushma
Chilton John
Clements Dave
Cox Katherine EL
Culotti Alessandro
Di Francesco Valentina
Disman William
Ellrott Kyle
Geistlinger Ludwig
Ghanaim Elena M
Goecks Jeremy
Golitsynskiy Sergey
Grossman Robert L
Gupta Namrata
Hajian Allie
Hall Ira M
Hannafious Brian
Hansen Kasper D
Harris Tim
Hastie Mim
Herman Kate
Hutter Carolyn
Jalili Vahid
Kammers Kai
Kiernan Elizabeth
Kovalsy Anton
Kucher Nataliya
Lawson Jonathan
Leek Jeffrey T
Lucas Julian
Luria Anne O’Donnell
Mahmoud Alexandru
McDade Frances
Morgan Martin
Mosher Stephen
Munshi Ruchi
Nekrutenko Anton
Oh Sehyun
Osborn Kevin
Ostrovsky Alexander
Overbeck Charles
O’Connor Brian D
O’Farrell Ash
Paten Benedict
Patterson Candace
Philippakis Anthony A
Ramos Marcel
Reddy Radhika
Reeves Valerie
Reid Charles
Rogers Dave
Rula Andrew
s Yuen Deni
Sargent Luke
Schatz Michael C
Sen Shurjo K
Sheets Elizabeth A
Shepherd Lori
Simeon Marianie
Steinberg David Charles
Stevens Ana
Stubbs BJ
Suderman Keith
Tan Frederick J
Taylor Casey Overby
Taylor M Morgan
Thomas Salin
Title Robert
Torstenson Eric
Turaga Nitesh
Van der Auwera Geraldine A
Vessio Jennifer
Vizzier Benton A
Vosburg Trish
Waldron Levi
Walker Jason
Walsh Brian
Wang Qi
Wang Ting
Warren Noah
Wellington Christopher
Wheelan Sarah J
Wiley Ken L
Wuichet Kristin
Yuksel Kaan
Zarate Samantha
Publication venue: 'Elsevier BV'
Publication date: 12/01/2022
Field of study

The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types

Cold Spring Harbor Laboratory Institutional Repository