Search CORE

33 research outputs found

Genome-wide association studies of metabolites in Finnish men identify disease-relevant loci

Author: Abel Haley
Das Indraniel
et al
Ganel Liron
Locke Adam E
Stitziel Nathan O
Yin Xianyong
Young Erica
Publication venue: Digital Commons@Becker
Publication date: 28/03/2022
Field of study

Few studies have explored the impact of rare variants (minor allele frequency \u3c 1%) on highly heritable plasma metabolites identified in metabolomic screens. The Finnish population provides an ideal opportunity for such explorations, given the multiple bottlenecks and expansions that have shaped its history, and the enrichment for many otherwise rare alleles that has resulted. Here, we report genetic associations for 1391 plasma metabolites in 6136 men from the late-settlement region of Finland. We identify 303 novel association signals, more than one third at variants rare or enriched in Finns. Many of these signals identify genes not previously implicated in metabolite genome-wide association studies and suggest mechanisms for diseases and disease-related traits

Digital Commons@Becker

Genome modeling system: A knowledge management platform for genomics

Author: Abbott Benjamin S
Abbott Travis E
Ainscough Benjamin J
Belter Edward A
Brummett Anthony M
Burnett Mark M
Callaway Matthew B
Carmichael Lynn K
Chen Ken
Clark Eric
Coffman Adam C
Das Indraniel
Dees Nathan D
Derickson Brian R
Ding Li
Dooling David J
Du Feiyu
Dukes Adam
Eldred James M
Fan Xian
Ferguson Ian T
Griffith Malachi
Griffith Obi L
Harris Christopher C
Hawkins Amy E
Helper Todd G
Hundal Jasreet
Kandoth Cyriac
Kim Kyung H
Kiwala Michael J
Koboldt Daniel C
Larson David E
Leonard Shawn M
Lolofie Justin T
Long Robert L
Lu Charles
Magrini Vincent J
Maher Christopher A
Maher Nicole
Mardis Elaine R
McLellan Michael D
McMichael Joshua F
Miller Christopher A
Mooney Thomas P
Morton David L
Nutter Nathaniel G
Oberkfell Ben J
Peck Joshua B
Pohl Craig S
Ramu Avinash
Regier Allison A
Sanderson Gabriel E
Schierding William S
Schroeder William E
Shi Xiaoqi
Skidmore Zachary L
Smith Scott M
Stiehr Gary
Walker Jason R
Weible James V
Weil Matthew R
Wilson Richard K
Wohlstadter Richard W
Wylie Todd N
Publication venue: Digital Commons@Becker
Publication date: 01/01/2015
Field of study

In this work, we present the Genome Modeling System (GMS), an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395) and matched lymphoblastoid line (HCC1395BL). These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms

Crossref

Directory of Open Access Journals

Digital Commons@Becker

PubMed Central

The Francis Crick Institute

Mitochondrial genome copy number measured by DNA sequencing in human blood is strongly associated with metabolic traits via cell-type composition differences

Author: Abel Haley
Chen Lei
Christ Ryan
Das Indraniel
et al
Ganel Liron
Hall Ira M
Kanchi Krishna
Kang Chul Joo
Larson David
Locke Adam
Regier Allison
Scott Alexandra
Stitziel Nathan O
Young Erica
Publication venue: Digital Commons@Becker
Publication date: 07/06/2021
Field of study

BACKGROUND: Mitochondrial genome copy number (MT-CN) varies among humans and across tissues and is highly heritable, but its causes and consequences are not well understood. When measured by bulk DNA sequencing in blood, MT-CN may reflect a combination of the number of mitochondria per cell and cell-type composition. Here, we studied MT-CN variation in blood-derived DNA from 19184 Finnish individuals using a combination of genome (N = 4163) and exome sequencing (N = 19034) data as well as imputed genotypes (N = 17718). RESULTS: We identified two loci significantly associated with MT-CN variation: a common variant at the MYB-HBS1L locus (P = 1.6 × 10 CONCLUSION: These results suggest that measurements of MT-CN in blood-derived DNA partially reflect differences in cell-type composition and that these differences are causally linked to insulin and related traits

Digital Commons@Becker

Mitochondrial genome copy number measured by DNA sequencing in human blood is strongly associated with metabolic traits via cell-type composition differences

Author: Abel Haley
Boehnke Michael
Chen Lei
Chiang Charleston W. K.
Christ Ryan
Das Indraniel
Freimer Nelson
Ganel Liron
Hall Ira M.
Havulinna Aki
Kanchi Krishna
Kang Chul Joo
Kuusisto Johanna
Laakso Markku
Larson David
Locke Adam
Palotie Aarno
Regier Allison
Ripatti Samuli
Scott Alexandra
Service Susan
Stitziel Nathan O.
Vangipurapu Jagadish
Young Erica
Publication venue
Publication date: 01/06/2021
Field of study

Background Mitochondrial genome copy number (MT-CN) varies among humans and across tissues and is highly heritable, but its causes and consequences are not well understood. When measured by bulk DNA sequencing in blood, MT-CN may reflect a combination of the number of mitochondria per cell and cell-type composition. Here, we studied MT-CN variation in blood-derived DNA from 19184 Finnish individuals using a combination of genome (N = 4163) and exome sequencing (N = 19034) data as well as imputed genotypes (N = 17718). Results We identified two loci significantly associated with MT-CN variation: a common variant at the MYB-HBS1L locus (P = 1.6 x 10(-8)), which has previously been associated with numerous hematological parameters; and a burden of rare variants in the TMBIM1 gene (P = 3.0 x 10(-8)), which has been reported to protect against non-alcoholic fatty liver disease. We also found that MT-CN is strongly associated with insulin levels (P = 2.0 x 10(-21)) and other metabolic syndrome (metS)-related traits. Using a Mendelian randomization framework, we show evidence that MT-CN measured in blood is causally related to insulin levels. We then applied an MT-CN polygenic risk score (PRS) derived from Finnish data to the UK Biobank, where the association between the PRS and metS traits was replicated. Adjusting for cell counts largely eliminated these signals, suggesting that MT-CN affects metS via cell-type composition. Conclusion These results suggest that measurements of MT-CN in blood-derived DNA partially reflect differences in cell-type composition and that these differences are causally linked to insulin and related traits.Peer reviewe

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

Helsingin yliopiston digitaalinen arkisto

Deep Blue Documents

Mapping and characterization of structural variation in 17,795 human genomes.

Author: Abel Haley J
Buyske Steven
Chiang Colby
Das Indraniel
Dutcher Susan K
Hall Ira M
Kanchi Krishna L
Lander Eric S
Larson David E
Layer Ryan M
Matise Tara C
Muzny Donna M
Neale Benjamin M
NHGRI Centers for Common Disease Genomics
Reeves Catherine
Regier Allison A
Salerno William J
Stitziel Nathan O
Zody Michael C
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2020
Field of study

A key goal of whole-genome sequencing for studies of human genetics is to interrogate all forms of variation, including single-nucleotide variants, small insertion or deletion (indel) variants and structural variants. However, tools and resources for the study of structural variants have lagged behind those for smaller variants. Here we used a scalable pipeline1 to map and characterize structural variants in 17,795 deeply sequenced human genomes. We publicly release site-frequency data to create the largest, to our knowledge, whole-genome-sequencing-based structural variant resource so far. On average, individuals carry 2.9 rare structural variants that alter coding regions; these variants affect the dosage or structure of 4.2 genes and account for 4.0-11.2% of rare high-impact coding alleles. Using a computational model, we estimate that structural variants account for 17.2% of rare alleles genome-wide, with predicted deleterious effects that are equivalent to loss-of-function coding alleles; approximately 90% of such structural variants are noncoding deletions (mean 19.1 per genome). We report 158,991 ultra-rare structural variants and show that 2% of individuals carry ultra-rare megabase-scale structural variants, nearly half of which are balanced or complex rearrangements. Finally, we infer the dosage sensitivity of genes and noncoding elements, and reveal trends that relate to element class and conservation. This work will help to guide the analysis and interpretation of structural variants in the era of whole-genome sequencing

Crossref

Cold Spring Harbor Laboratory Institutional Repository

eScholarship - University of California

Genome-wide association studies of metabolites in Finnish men identify disease-relevant loci

Few studies have explored the impact of rare variants (minor allele frequency < 1%) on highly heritable plasma metabolites identified in metabolomic screens. The Finnish population provides an ideal opportunity for such explorations, given the multiple bottlenecks and expansions that have shaped its history, and the enrichment for many otherwise rare alleles that has resulted. Here, we report genetic associations for 1391 plasma metabolites in 6136 men from the late-settlement region of Finland. We identify 303 novel association signals, more than one third at variants rare or enriched in Finns. Many of these signals identify genes not previously implicated in metabolite genome-wide association studies and suggest mechanisms for diseases and disease-related traits

Carolina Digital Repository

svtools: population-scale analysis of structural variation

Author: Abhijit Badve
Colby Chiang
David E Larson
Haley J Abel
Indraniel Das
Ira M Hall
James M Eldred
Ryan M Layer
Publication venue: Oxford University Press (OUP)
Publication date: 20/06/2019
Field of study

Abstract Summary Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps—including deletions, duplications, mobile element insertions, inversions and other rearrangements—in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. Availability and implementation svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. Supplementary information Supplementary data are available at Bioinformatics online. </jats:sec

Crossref

svtools: Population-scale analysis of structural variation

Author: Abel Haley J
Badve Abhijit
Chiang Colby
Das Indraniel
Eldred James M
Hall Ira M
Larson David E
Layer Ryan M
Publication venue: Digital Commons@Becker
Publication date: 01/11/2019
Field of study

SUMMARY: Large-scale human genetics studies are now employing whole genome sequencing with the goal of conducting comprehensive trait mapping analyses of all forms of genome variation. However, methods for structural variation (SV) analysis have lagged far behind those for smaller scale variants, and there is an urgent need to develop more efficient tools that scale to the size of human populations. Here, we present a fast and highly scalable software toolkit (svtools) and cloud-based pipeline for assembling high quality SV maps-including deletions, duplications, mobile element insertions, inversions and other rearrangements-in many thousands of human genomes. We show that this pipeline achieves similar variant detection performance to established per-sample methods (e.g. LUMPY), while providing fast and affordable joint analysis at the scale of ≥100 000 genomes. These tools will help enable the next generation of human genetics studies. AVAILABILITY AND IMPLEMENTATION: svtools is implemented in Python and freely available (MIT) from https://github.com/hall-lab/svtools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online

Digital Commons@Becker

Genealogy-based trait association with LOCATER boosts power at loci with allelic heterogeneity

Author: Aslett Louis J.M.
Belter Edward A.
Christ Ryan
Das Indraniel
Hall Ira M.
Kang Chul Joo
Laakso Markku
Steinsaltz David
Stitziel Nathan O.
Wang Xinxin
Young Erica
Publication venue: Cold Spring Harbor Laboratory Press
Publication date: 16/04/2026
Field of study

A key methodological challenge for genome-wide association studies is how to leverage haplotype diversity and allelic heterogeneity to improve trait association power, especially in noncoding regions where it is difficult to predict variant impacts and define functional units for variant aggregation. Genealogy-based association methods have the potential to bridge this gap by testing combinations of common and rare haplotypes based purely on their ancestral relationships. In parallel work, we have developed an efficient local ancestry inference engine and a novel statistical method (LOCATER) for combining signals present on different branches of a locus-specific haplotype tree. Here, we develop a genome-wide LOCATER analysis pipeline and apply it to a genome sequencing study of 6795 Finnish individuals with 101 cardiometabolic traits and 18.9 million autosomal variants. We identify 351 significant trait associations at 47 distinct genomic loci and find that LOCATER boosts the single marker test (SMT) association signal at five loci by combining independent signals from distinct alleles. LOCATER successfully recovers known quantitative trait loci not found by SMT, including LIPG, recovers known allelic heterogeneity at the APOE/C1/C4/C2 gene cluster, and suggests one novel association. We find that confounders have a more pronounced effect on genealogy-based methods than SMT, and we propose a new randomization approach and a general method for genomic control to eliminate their effects. This study demonstrates that genealogy-based methods such as LOCATER excel when multiple causal variants are present and suggests that their application to larger and more diverse cohorts will be fruitful

Durham Research Online

Germline genetic variation impacts clonal hematopoiesis landscape and progression to malignancy

Author: Bolton Kelly L
Cao Yin
Chan Irenaeus C C
Das Indraniel
Ding Li
et al.
Link Daniel C
Liu Jie
Tran Duc
Walter Matthew J
Wiley Brian J
Publication venue: Digital Commons@Becker
Publication date: 01/08/2025
Field of study

With age, clonal expansions occur pervasively across normal tissues yet only in rare instances lead to cancer, despite being driven by well-established cancer drivers. Characterization of the factors that influence clonal progression is needed to inform interventional approaches. Germline genetic variation influences cancer risk and shapes tumor mutational profile, but its influence on the mutational landscape of normal tissues is not well known. Here we studied the impact of germline genetic variation on clonal hematopoiesis (CH) in 731,835 individuals. We identified 22 new CH-predisposition genes, most of which predispose to CH driven by specific mutational events. CH-predisposition genes contribute to unique somatic landscapes, reflecting the influence of germline genetic backdrop on gene-specific CH fitness. Correspondingly, somatic-germline interactions influence the risk of CH progression to hematologic malignancies. These results demonstrate that germline genetic variation influences somatic evolution in the blood, findings that likely extend to other tissues

Digital Commons@Becker