Search CORE

2 research outputs found

precisionFDA Truth Challenge V2: Calling variants from short- and long-reads in difficult-to-map regions

Author: Ahsan Mian Umair
Arslan Elif
Baid Gunjan
Boja Emily
Bourgey Mathieu
Bourque Guillaume
Brown Richard
Brueffer Christian
Budak Gungor
Carroll Andrew
Catreux Severine
Chang Pi-Chuan
Chen Luoqi
Demirkaya-Budak Sinem
Dolgoborodov Alexey
DU YuanPing
Eveleigh Robert
Fang Li Tai
Feng Hanying
Flores Carlos
Goel Sidharth
Hung Calvin
Jain Amit
Jain Chirag
Jain Miten
Jain Varun
Johanson Elaine
Johnson Ivan J.
Jáspez David
Kabakci-Zorlu Duygu
Kalay Özem
Kolesnikov Alexey
Kyriakidis Konstantinos
Lajoie Bryan
Li Gen
Li Zhipan
Liu Qian
Lorenzo-Salazar José M.
MA ChouXian
Maier Ezekiel J.
Malousi Andigoni
McDaniel Jennifer
Mehio Rami
Mohiyuddin Marghoob
Morata Jordi
Muñoz-Barrera Adrián
Narcı Kübra
Nattestad Maria
Olson Nathan D.
Parra Genís
Paten Benedict
Pesout Trevor
Prasanna Anish G.
Roddey Cooper
Rubio-Rodríguez Luis A.
Ruehle Mike
Sahraeian Sayed Mohammad Ebrahim
Sedlazeck Fritz J.
Semenyuk Vladimir
Serang Omar
Shafin Kishwar
Stephens Sarah H.
Tang LinQi
Tetikol H. Serhat
Tonda Raúl
Trotta Jean-Rémi
Turgut Deniz
Wagner Justin
Wang Kai
Westreich Samuel T.
Yang Howard
Zhang ShaoWei
Zook Justin M
Publication venue: 'Cold Spring Harbor Laboratory'
Publication date: 15/11/2020
Field of study

The precisionFDA Truth Challenge V2 aimed to assess the state-of-the-art of variant calling in difficult-to-map regions and the Major Histocompatibility Complex (MHC). Starting with FASTQ files, 20 challenge participants applied their variant calling pipelines and submitted 64 variant callsets for one or more sequencing technologies (~35X Illumina, ~35X PacBio HiFi, and ~50X Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with the new GIAB benchmark sets and genome stratifications. Challenge submissions included a number of innovative methods for all three technologies, with graph-based and machine-learning methods scoring best for short-read and long-read datasets, respectively. New methods out-performed the 2016 Truth Challenge winners, and new machine-learning approaches combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants

Lund University Publications

PrecisionFDA Truth Challenge V2: Calling variants from short and long reads in difficult-to-map regions

Author: Ahsan Mian Umair
Arslan Elif
Baid Gunjan
Boja Emily
Bourgey Mathieu
Bourque Guillaume
Brown Richard
Brueffer Christian
Budak Gungor
Carroll Andrew
Catreux Severine
Chang Pi-Chuan
Chen Luoqi
Demirkaya-Budak Sinem
Dolgoborodov Alexey
DU YuanPing
Eveleigh Robert
Fang Li Tai
Feng Hanying
Flores Carlos
Goel Sidharth
Hung Calvin
Jain Amit
Jain Chirag
Jain Miten
Jain Varun
Johanson Elaine
Johnson Ivan J.
Jáspez David
Kabakci-Zorlu Duygu
Kalay Özem
Kolesnikov Alexey
Kyriakidis Konstantinos
Lajoie Bryan
Li Gen
Li Zhipan
Liu Qian
Lorenzo-Salazar José M.
MA ChouXian
Maier Ezekiel J.
Malousi Andigoni
McDaniel Jennifer
Mehio Rami
Mohiyuddin Marghoob
Morata Jordi
Muñoz-Barrera Adrián
Narcı Kübra
Nattestad Maria
Olson Nathan D.
Parra Genís
Paten Benedict
Pesout Trevor
Prasanna Anish G.
Roddey Cooper
Rubio-Rodríguez Luis A.
Ruehle Mike
Sahraeian Sayed Mohammad Ebrahim
Sedlazeck Fritz J.
Semenyuk Vladimir
Serang Omar
Shafin Kishwar
Stephens Sarah H.
Tang LinQi
Tetikol H. Serhat
Tonda Raúl
Trotta Jean-Rémi
Turgut Deniz
Wagner Justin
Wang Kai
Westreich Samuel T.
Yang Howard
Zhang ShaoWei
Zook Justin M
Publication venue: 'Elsevier BV'
Publication date: 27/04/2022
Field of study

The precisionFDA Truth Challenge V2 aimed to assess the state of the art of variant calling in challenging genomic regions. Starting with FASTQs, 20 challenge participants applied their variant-calling pipelines and submitted 64 variant call sets for one or more sequencing technologies (Illumina, PacBio HiFi, and Oxford Nanopore Technologies). Submissions were evaluated following best practices for benchmarking small variants with updated Genome in a Bottle benchmark sets and genome stratifications. Challenge submissions included numerous innovative methods, with graph-based and machine learning methods scoring best for short-read and long-read datasets, respectively. With machine learning approaches, combining multiple sequencing technologies performed particularly well. Recent developments in sequencing and variant calling have enabled benchmarking variants in challenging genomic regions, paving the way for the identification of previously unknown clinically relevant variants

Lund University Publications

PubMed Central