Search CORE

9 research outputs found

The Use of Non-Variant Sites to Improve the Clinical Assessment of Whole-Genome Sequence Data

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date: 01/01/2015
Field of study

<div>Genetic testing, which is now a routine part of clinical practice and disease management protocols, is often based on the assessment of small panels of variants or genes. On the other hand, continuous improvements in the speed and per-base costs of sequencing have now made whole exome sequencing (WES) and whole genome sequencing (WGS) viable strategies for targeted or complete genetic analysis, respectively. Standard WGS/WES data analytical workflows generally rely on calling of sequence variants respect to the reference genome sequence. However, the reference genome sequence contains a large number of sites represented by rare alleles, by known pathogenic alleles and by alleles strongly associated to disease by GWAS. It’s thus critical, for clinical applications of WGS and WES, to interpret whether non-variant sites are homozygous for the reference allele or if the corresponding genotype cannot be reliably called. Here we show that an alternative analytical approach based on the analysis of both variant and non-variant sites from WGS data allows to genotype more than 92% of sites corresponding to known SNPs compared to 6% genotyped by standard variant analysis. These include homozygous reference sites of clinical interest, thus leading to a broad and comprehensive characterization of variation necessary to an accurate evaluation of disease risk. Altogether, our findings indicate that characterization of both variant and non-variant clinically informative sites in the genome is necessary to allow an accurate clinical assessment of a personal genome. Finally, we propose a highly efficient extended VCF (eVCF) file format which allows to store genotype calls for sites of clinical interest while remaining compatible with current variant interpretation software.</div

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Catalogo dei prodotti della ricerca

The Francis Crick Institute

Comparison of the content and size of different standard file formats for the storage of genomic data.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Comparison of the content and size of different standard file formats for the storage of genomic data.</p

The Francis Crick Institute

Compatibility of the eVCF file format with different variation analysis suites.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Compatibility of the eVCF file format with different variation analysis suites.</p

The Francis Crick Institute

Genotyping of known SNPs from dbSNP 141 using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Genotyping of known SNPs from dbSNP 141 using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.</p

The Francis Crick Institute

Genotyping of known SNPs from ClinVar using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Genotyping of known SNPs from ClinVar using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.</p

The Francis Crick Institute

Concordance of genotypes represented in VCF and gVCF files with those detected by the MI RISK Plus kit.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Concordance of genotypes represented in VCF and gVCF files with those detected by the MI RISK Plus kit.</p

The Francis Crick Institute

Genotyping of GWAS catalog sites using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Genotyping of GWAS catalog sites using the VCF and gVCF file formats and the number of homozygous reference sites and no-calls based on WGS data.</p

The Francis Crick Institute

Exonic regions coverage.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Percentage of exonic regions covered at a read depth ≥ 5, an alignment score ≥ 10, a basecall quality ≥ 10 from WGS subsets of the original full set with different average X-fold coverage values.</p

The Francis Crick Institute

Comparison of the number of dbSNP, ClinVar and GWAScat sites represented using VCF, gVCF and eVCF files.

Author: Alberto Ferrarini (133939)
Benjamin A. Salisbury (765937)
Cesare Centomo (765934)
Chiara Cantaloni (765933)
Claudio Franceschi (68521)
Francesca Griggio (66872)
John Max Harvey (765938)
Julien Marquis (765936)
Luciano Xumerle (765931)
Marianna Garonzi (765932)
Massimo Delledonne (133941)
Paolo Garagnani (414846)
Patrick Descombes (16839)
Sebastiano Collino (385873)
Sergio Marin Vargas (765935)
Publication venue
Publication date
Field of study

Comparison of the number of dbSNP, ClinVar and GWAScat sites represented using VCF, gVCF and eVCF files.</p

The Francis Crick Institute