Search CORE

17 research outputs found

A draft human pangenome reference

Author: Abel Haley J
Antonacci-Fulton Lucinda L
Cody Sarah
et al.
Fulton Robert S
Liao Wen-Wei
Regier Allison A
Tomlinson Chad
Wang Ting
Publication venue: Digital Commons@Becker
Publication date: 01/05/2023
Field of study

Chromosome Xq23 Is Associated with Lower Atherogenic Lipid Concentrations and Favorable Cardiometabolic Indices

Author: Antonacci-Fulton Lucinda
Aragam Krishna
Arden Moscati
Arnett Donna K.
Aslibekyan Stella
Assimes Themistocles L.
Ballantyne Christie M.
Bielak Lawrence F.
Bis Joshua C.
Brody Jennifer A.
Broome Jai G.
Cade Brian E.
de Vries Paul S.
Graham Sarah E.
Honigberg Michael C.
Natarajan Pradeep
Pampana Akhil
Perry James A.
Pirruccello James P.
Ruotsalainen Sanni E.
Wolford Brooke
Publication venue: UKnowledge
Publication date: 12/04/2021
Field of study

Autosomal genetic analyses of blood lipids have yielded key insights for coronary heart disease (CHD). However, X chromosome genetic variation is understudied for blood lipids in large sample sizes. We now analyze genetic and blood lipid data in a high-coverage whole X chromosome sequencing study of 65,322 multi-ancestry participants and perform replication among 456,893 European participants. Common alleles on chromosome Xq23 are strongly associated with reduced total cholesterol, LDL cholesterol, and triglycerides (min P = 8.5 × 10−72), with similar effects for males and females. Chromosome Xq23 lipid-lowering alleles are associated with reduced odds for CHD among 42,545 cases and 591,247 controls (P = 1.7 × 10−4), and reduced odds for diabetes mellitus type 2 among 54,095 cases and 573,885 controls (P = 1.4 × 10−5). Although we observe an association with increased BMI, waist-to-hip ratio adjusted for BMI is reduced, bioimpedance analyses indicate increased gluteofemoral fat, and abdominal MRI analyses indicate reduced visceral adiposity. Co-localization analyses strongly correlate increased CHRDL1 gene expression, particularly in adipose tissue, with reduced concentrations of blood lipids

University of Kentucky

Modernizing Reference Genome Assemblies

Crossref

Directory of Open Access Journals

PubMed Central

King's Research Portal

The Human Pangenome Project: a global resource to map genomic diversity

Author: Antonacci-Fulton Lucinda
Asri Mobin
Carson Caryn
Chaisson Mark JP
Chang Xian
Cook-Deegan Robert
Eichler Evan E
Felsenfeld Adam L
Flicek Paul
Fulton Robert S
Garrison Erik P
Garrison Nanibaa’ A
Graves-Lindsay Tina A
Hall Ira M
Haussler David
Howe Kerstin
Jarvis Erich D
Ji Hanlee
Kenny Eimear E
Koenig Barbara A
Lawson Heather A
Li Daofeng
Li Heng
Lucas Julian K
Marschall Tobias
McMichael Joshua F
Miga Karen H
Novak Adam M
Paten Benedict
Phillippy Adam M
Popejoy Alice B
Purushotham Deepak
Schneider Valerie A
Schultz Baergen I
Smith Michael W
Sofia Heidi J
Wang Ting
Weissman Tsachy
Publication venue: eScholarship, University of California
Publication date: 20/04/2022
Field of study

The human reference genome is the most widely used resource in human genetics and is due for a major update. Its current structure is a linear composite of merged haplotypes from more than 20 people, with a single individual comprising most of the sequence. It contains biases and errors within a framework that does not represent global human genomic variation. A high-quality reference with global representation of common variants, including single-nucleotide variants, structural variants and functional elements, is needed. The Human Pangenome Reference Consortium aims to create a more sophisticated and complete human reference genome with a graph-based, telomere-to-telomere representation of global genomic diversity. Here we leverage innovations in technology, study design and global partnerships with the goal of constructing the highest-possible quality human pangenome reference. Our goal is to improve data representation and streamline analyses to enable routine assembly of complete diploid genomes. With attention to ethical frameworks, the human pangenome reference will contain a more accurate and diverse representation of global genomic variation, improve gene-disease association studies across populations, expand the scope of genomics research to the most repetitive and polymorphic regions of the genome, and serve as the ultimate genetic resource for future biomedical research and precision medicine

PubMed Central

eScholarship - University of California

A draft human pangenome reference

Author: Abel Haley J.
Abou Tayoun Ahmad
Antonacci-Fulton Lucinda L.
Asri Mobin
Baid Gunjan
Baker Carl A.
Belyaeva Anastasiya
Billis Konstantinos
Bourque Guillaume
Buonaiuto Silvia
Carroll Andrew
Chaisson Mark
Chang Pi-Chuan
Chang Xian H.
Cheng Haoyu
Chu Justin
Cody Sarah
Colonna Vincenza
Cook Daniel E.
Cook-Deegan Robert M.
Cornejo Omar E.
Diekhans Mark
Doerr Daniel
Ebert Peter
Ebler Jana
Eichler Evan E.
Eizenga Jordan
Fairley Susan
Fedrigo Olivier
Felsenfeld Adam L.
Feng Xiaowen
Fischer Christian
Flicek Paul
Formenti Giulio
Frankish Adam
Fulton Robert S.
Gao Yan
Garg Shilpa
Garrison Erik
Garrison Nanibaa' A.
Giron Carlos Garcia
Green Richard E.
Groza Cristian
Guarracino Andrea
Haggerty Leanne
Hall Ira M.
Harvey William T.
Haukness Marina
Haussler David
Heumos Simon
Hickey Glenn
Hoekzema Kendra
Hourlier Thibaut
Howe Kerstin
Jain Miten
Jarvis Erich
Ji Hanlee P.
Kenny Eimear E.
Koenig Barbara A.
Kolesnikov Alexey
Korbel Jan O.
Kordosky Jennifer
Koren Sergey
Lee HoJoon
Lewis Alexandra P.
Li Heng
Liao Wen-Wei
Lu Shuangjia
Lu Tsung-Yu
Lucas Julian K.
Magalhães Hugo
Marco-Sola Santiago
Marijon Pierre
Markello Charles
Marschall Tobias
Martin Fergal J.
McCartney Ann
McDaniel Jennifer
Miga Karen H.
Mitchell Matthew W.
Monlong Jean
Mountcastle Jacquelyn
Munson Katherine M.
Mwaniki Moses Njagi
Nattestad Maria
Novak Adam M.
Nurk Sergey
Olsen Hugh E.
Olson Nathan D.
Paten Benedict
Pesout Trevor
Phillippy Adam M.
Popejoy Alice B.
Porubsky David
Prins Pjotr
Puiu Daniela
Rautiainen Mikko
Regier Allison A.
Rhie Arang
Sacco Samuel
Sanders Ashley D.
Schneider Valerie A.
Schultz Baergen I.
Shafin Kishwar
Sibbesen Jonas A.
Sirén Jouni
Smith Michael W.
Sofia Heidi J.
Thibaud-Nissen Françoise
Tomlinson Chad
Tricomi Francesca Floriana
Villani Flavia
Vollger Mitchell R.
Wagner Justin
Walenz Brian
Wang Ting
Wood Jonathan M. D.
Zimin Aleksey V.
Zook Justin M.
Publication venue
Publication date: 01/01/2023
Field of study

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample

Diposit Digital de Documents de la UAB

Human whole-exome genotype data for Alzheimer’s disease

Author: Adams Larry D.
Ahmad Shahzad
Amin Najaf
Antonacci-Fulton Lucinda
Appelbaum Elizabeth
Banks Eric
Barral Sandra
Beecham Gary
Beiser Alexa
Below Jennifer E.
Benchek Penelope
Bennett David A.
Bis Joshua C.
Blue Elizabeth
Booth Briana M.
Brkanac Zoran
Brown Lisa
Bush William S.
Butkiewicz Mariusz
Cantwell Laura
Chen Yuning
Choi Seung Hoan
Chou Yi Fan
Chung Jaeyoon
Clark Kaylyn
Cruchaga Carlos
Cuccaro Michael
Cupples L. Adrienne
Day Tyler
De Jager Phillip L.
Destefano Anita
Dinh Huyen
Doddapeneni Harsha
Dorschner Michael
Dugan-Perez Shannon
Dupuis Josee
English Adam
Faber Kelley
Farrell John
Farrer Lindsay
Feolo Michael
Foroud Tatiana
Fulton Robert S.
Gabriel Stacey
Gangadharan Prabhakaran
Gibbs Richard A.
Goate Alison
Gupta Namrata
Haines Jonathan
Hamilton-Nelson Kara
Han Yi
Haut Jacob
Horimoto Andrea R.
Hu Jianhong
Ikram M. Arfan
Iqbal Taha
Jan Bressler Bressler
Jayaseelan Joy
Jian Xueqiu
Jun Gyungah R.
Kalra Divya
Kapoor Manav
Khan Ziad
Koboldt Daniel C.
Korchina Viktoriya
Kunkle Brian
Kuzma Amanda B.
Larson David E.
Launer Lenore J.
Lee Sandra
Lee Wan Ping
Leung Yuk Yee
Lin Honghuang
Liu Ching Ti
Liu Xiuping
Liu Yue
Lunetta Kathy
Ma Yiyi
Malamon John
Marcora Edoardo
Martin Eden
Mayeux Richard P.
Mena Pedro
Mez Jesse
Mlynarski Elisabeth
Mosley Thomas H.
Muzny Donna
Nafikov Rafael
Naj Adam C.
Nasser Waleed
Nato Alejandro Q.
Navas Pat
Nguyen Hiep
Nicaretta Heather
Pericak-Vance Margaret
Psaty Bruce
Qu Liming
Rajabli Farid
Reitz Christiane
Renton Alan
Reyes-Dumeyer Dolly
Rice Kenneth
Saad Mohamad
Salerno William
Santibanez Jireh
Satizabal Claudia
Schellenberg Gerard D.
Schmidt Helena
Schmidt Michael
Schmidt Reinhold
Seshadri Sudha
Sha Jin
Skinner Evette
Smieszek Sandra
Sohi Harkirat
Song Yeunjoo
Stine Adam
Sun Fangui Jenny
Thornton Timothy
Tosto Giuseppe
Tsuang Debby
Valladares Otto
van der Lee Sven
van Duijn Cornelia
Vance Jeffrey M.
Vanderspek Ashley
Vardarajan Badri
Waligorski Jason
Wang Bowen
Wang Li San
Wheeler Nicholas
Wijsman Ellen
Wilson Richard K.
Witten Daniela
Worley Kim
Xia Li Charlie
Zhang Nancy
Zhang Xiaoling
Zhao Yi
Zhu Congcong
Zhu Yiming
Publication venue
Publication date: 23/01/2024
Field of study

The heterogeneity of the whole-exome sequencing (WES) data generation methods present a challenge to a joint analysis. Here we present a bioinformatics strategy for joint-calling 20,504 WES samples collected across nine studies and sequenced using ten capture kits in fourteen sequencing centers in the Alzheimer’s Disease Sequencing Project. The joint-genotype called variant-called format (VCF) file contains only positions within the union of capture kits. The VCF was then processed specifically to account for the batch effects arising from the use of different capture kits from different studies. We identified 8.2 million autosomal variants. 96.82% of the variants are high-quality, and are located in 28,579 Ensembl transcripts. 41% of the variants are intronic and 1.8% of the variants are with CADD > 30, indicating they are of high predicted pathogenicity. Here we show our new strategy can generate high-quality data from processing these diversely generated WES samples. The improved ability to combine data sequenced in different batches benefits the whole genomics research community.</p

EUR Research Repository

Recommended from our members

Gaps and complex structurally variant loci in phased genome assemblies

Author: Abel Haley J
Antonacci-Fulton Lucinda L
Asri Mobin
Baid Gunjan
Baker Carl A
Belyaeva Anastasiya
Billis Konstantinos
Bourque Guillaume
Buonaiuto Silvia
Carroll Andrew
Chaisson Mark JP
Chang Pi-Chuan
Chang Xian H
Cheng Haoyu
Chu Justin
Cody Sarah
Colonna Vincenza
Consortium Human Pangenome Reference
Cook Daniel E
Cook-Deegan Robert M
Cornejo Omar E
Diekhans Mark
Doerr Daniel
Ebert Peter
Ebert Peter
Ebler Jana
Eichler Evan E
Eichler Evan E
Eizenga Jordan M
Fairley Susan
Fedrigo Olivier
Felsenfeld Adam L
Feng Xiaowen
Fischer Christian
Flicek Paul
Formenti Giulio
Frankish Adam
Fulton Robert S
Gao Yan
Garg Shilpa
Garrison Erik
Garrison Nanibaa’ A
Giron Carlos Garcia
Green Richard E
Groza Cristian
Guarracino Andrea
Haggerty Leanne
Hall Ira M
Harvey William T
Harvey William T
Hasenfeld Patrick
Haukness Marina
Haussler David
Heumos Simon
Hickey Glenn
Hickey Glenn
Hoekzema Kendra
Hourlier Thibaut
Howe Kerstin
Jain Miten
Jarvis Erich D
Ji Hanlee P
Kenny Eimear E
Koenig Barbara A
Kolesnikov Alexey
Korbel Jan O
Korbel Jan O
Kordosky Jennifer
Koren Sergey
Lee HoJoon
Lewis Alexandra P
Li Heng
Liao Wen-Wei
Lu Shuangjia
Lu Tsung-Yu
Lucas Julian K
Magalhães Hugo
Marco-Sola Santiago
Marijon Pierre
Markello Charles
Marschall Tobias
Marschall Tobias
Martin Fergal J
McCartney Ann
McDaniel Jennifer
Miga Karen H
Mitchell Matthew W
Monlong Jean
Mountcastle Jacquelyn
Munson Katherine M
Mwaniki Moses Njagi
Nattestad Maria
Novak Adam M
Nurk Sergey
Paten Benedict
Porubsky David
Rozanski Allison N
Sanders Ashley D
Stober Catherine
Vollger Mitchell R
Publication venue: eScholarship, University of California
Publication date: 01/04/2023
Field of study

There has been tremendous progress in phased genome assembly production by combining long-read data with parental information or linked-read data. Nevertheless, a typical phased genome assembly generated by trio-hifiasm still generates more than 140 gaps. We perform a detailed analysis of gaps, assembly breaks, and misorientations from 182 haploid assemblies obtained from a diversity panel of 77 unique human samples. Although trio-based approaches using HiFi are the current gold standard, chromosome-wide phasing accuracy is comparable when using Strand-seq instead of parental data. Importantly, the majority of assembly gaps cluster near the largest and most identical repeats (including segmental duplications [35.4%], satellite DNA [22.3%], or regions enriched in GA/AT-rich DNA [27.4%]). Consequently, 1513 protein-coding genes overlap assembly gaps in at least one haplotype, and 231 are recurrently disrupted or missing from five or more haplotypes. Furthermore, we estimate that 6-7 Mbp of DNA are misorientated per haplotype irrespective of whether trio-free or trio-based approaches are used. Of these misorientations, 81% correspond to bona fide large inversion polymorphisms in the human species, most of which are flanked by large segmental duplications. We also identify large-scale alignment discontinuities consistent with 11.9 Mbp of deletions and 161.4 Mbp of insertions per haploid genome. Although 99% of this variation corresponds to satellite DNA, we identify 230 regions of euchromatic DNA with frequent expansions and contractions, nearly half of which overlap with 197 protein-coding genes. Such variable and incompletely assembled regions are important targets for future algorithmic development and pangenome representation

eScholarship - University of California

Recommended from our members

A draft human pangenome reference.

Author: Abel Haley J
Abou Tayoun Ahmad N
Antonacci-Fulton Lucinda L
Asri Mobin
Baid Gunjan
Baker Carl A
Belyaeva Anastasiya
Billis Konstantinos
Buonaiuto Silvia
Carroll Andrew
Chang Pi-Chuan
Chang Xian H
Cheng Haoyu
Chu Justin
Cody Sarah
Colonna Vincenza
Cook Daniel E
Cook-Deegan Robert M
Cornejo Omar E
Diekhans Mark
Doerr Daniel
Ebert Peter
Ebler Jana
Eizenga Jordan M
Fairley Susan
Fedrigo Olivier
Felsenfeld Adam L
Feng Xiaowen
Fischer Christian
Formenti Giulio
Frankish Adam
Fulton Robert S
Gao Yan
Garg Shilpa
Garrison Nanibaa' A
Giron Carlos Garcia
Green Richard E
Groza Cristian
Guarracino Andrea
Haggerty Leanne
Harvey William T
Haukness Marina
Heumos Simon
Hickey Glenn
Hoekzema Kendra
Hourlier Thibaut
Howe Kerstin
Jain Miten
Ji Hanlee P
Kenny Eimear E
Koenig Barbara A
Kolesnikov Alexey
Korbel Jan O
Kordosky Jennifer
Koren Sergey
Lee HoJoon
Lewis Alexandra P
Liao Wen-Wei
Lu Shuangjia
Lu Tsung-Yu
Lucas Julian K
Magalhães Hugo
Marco-Sola Santiago
Marijon Pierre
Markello Charles
Martin Fergal J
McCartney Ann
McDaniel Jennifer
Mitchell Matthew W
Monlong Jean
Mountcastle Jacquelyn
Munson Katherine M
Mwaniki Moses Njagi
Nattestad Maria
Novak Adam M
Nurk Sergey
Olsen Hugh E
Olson Nathan D
Pesout Trevor
Popejoy Alice B
Porubsky David
Prins Pjotr
Puiu Daniela
Rautiainen Mikko
Regier Allison A
Rhie Arang
Sacco Samuel
Sanders Ashley D
Schneider Valerie A
Schultz Baergen I
Shafin Kishwar
Sibbesen Jonas A
Sirén Jouni
Smith Michael W
Sofia Heidi J
Thibaud-Nissen Françoise
Tomlinson Chad
Tricomi Francesca Floriana
Villani Flavia
Vollger Mitchell R
Publication venue: eScholarship, University of California
Publication date: 01/05/2023
Field of study

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample

eScholarship - University of California

A draft human pangenome reference

Author: Abel Haley J.
Abou Tayoun Ahmad N.
Antonacci-Fulton Lucinda L.
Asri Mobin
Baid Gunjan
Baker Carl A.
Belyaeva Anastasiya
Billis Konstantinos
Bourque Guillaume
Buonaiuto Silvia
Carroll Andrew
Chaisson Mark J.P.
Chang Pi Chuan
Chang Xian H.
Cheng Haoyu
Chu Justin
Cody Sarah
Colonna Vincenza
Cook Daniel E.
Cook-Deegan Robert M.
Cornejo Omar E.
Diekhans Mark
Doerr Daniel
Ebert Peter
Ebler Jana
Eichler Evan E.
Eizenga Jordan M.
Fairley Susan
Fedrigo Olivier
Felsenfeld Adam L.
Feng Xiaowen
Fischer Christian
Flicek Paul
Formenti Giulio
Frankish Adam
Fulton Robert S.
Gao Yan
Garg Shilpa
Garrison Erik
Garrison Nanibaa’ A.
Giron Carlos Garcia
Green Richard E.
Groza Cristian
Guarracino Andrea
Haggerty Leanne
Hall Ira M.
Harvey William T.
Haukness Marina
Haussler David
Heumos Simon
Hickey Glenn
Hoekzema Kendra
Hourlier Thibaut
Howe Kerstin
Jain Miten
Jarvis Erich D.
Ji Hanlee P.
Kenny Eimear E.
Koenig Barbara A.
Kolesnikov Alexey
Korbel Jan O.
Kordosky Jennifer
Koren Sergey
Lee Ho Joon
Lewis Alexandra P.
Li Heng
Liao Wen Wei
Lu Shuangjia
Lu Tsung Yu
Lucas Julian K.
Magalhães Hugo
Marco-Sola Santiago
Marijon Pierre
Markello Charles
Marschall Tobias
Martin Fergal J.
McCartney Ann
McDaniel Jennifer
Miga Karen H.
Mitchell Matthew W.
Monlong Jean
Mountcastle Jacquelyn
Munson Katherine M.
Mwaniki Moses Njagi
Nattestad Maria
Novak Adam M.
Nurk Sergey
Olsen Hugh E.
Olson Nathan D.
Paten Benedict
Pesout Trevor
Phillippy Adam M.
Popejoy Alice B.
Porubsky David
Prins Pjotr
Puiu Daniela
Rautiainen Mikko
Regier Allison A.
Rhie Arang
Sacco Samuel
Sanders Ashley D.
Schneider Valerie A.
Schultz Baergen I.
Shafin Kishwar
Sibbesen Jonas A.
Sirén Jouni
Smith Michael W.
Sofia Heidi J.
Thibaud-Nissen Françoise
Tomlinson Chad
Tricomi Francesca Floriana
Villani Flavia
Vollger Mitchell R.
Wagner Justin
Walenz Brian
Wang Ting
Wood Jonathan M.D.
Zimin Aleksey V.
Zook Justin M.
Publication venue
Publication date: 01/01/2023
Field of study

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals 1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.</p

Online Research Database In Technology