59,461 research outputs found
Clinical exome performance for reporting secondary genetic findings.
BACKGROUND
:
Reporting clinically actionable incidental
genetic findings in the course of clinical exome testing is
recommended by the American College of Medical Genet-
ics and Genomics (ACMG). However, the performance of
clinical exome methods for reporting small subsets of genes
has not been previously reported.
METHODS
:
In this study, 57 exome data sets performed as
clinical (n
!
12) or research (n
!
45) tests were retrospec-
tively analyzed. Exome sequencing data was examined for
adequacy in the detection of potentially pathogenic variant
locations in the 56 genes described in the ACMG incidental
findings recommendation. All exons of the 56 genes were
examined for adequacy of sequencing coverage. In addition,
nucleotide positions annotated in HGMD (Human Gene
Mutation Database) were examined.
RESULTS
:
The 56 ACMG genes have 18336 nucleotide
variants annotated in HGMD. None of the 57 exome
data sets possessed a HGMD variant. The clinical exome
test had inadequate coverage for
"
50% of HGMD vari-
ant locations in 7 genes. Six exons from 6 different genes
had consistent failure across all 3 test methods; these
exons had high GC content (76%â84%).
CONCLUSIONS
:
The use of clinical exome sequencing
for the interpretation and reporting of subsets of genes
requires recognition of the substantial possibility of
inadequate depth and breadth of sequencing coverage
at clinically relevant locations. Inadequate depth of
coverage may contribute to false-negative clinical ex-
ome results
Development and Validation of Clinical Whole-Exome and Whole-Genome Sequencing for Detection of Germline Variants in Inherited Disease
Context.-With the decrease in the cost of sequencing, the clinical testing paradigm has shifted from single gene to gene panel and now whole-exome and whole-genome sequencing. Clinical laboratories are rapidly implementing next-generation sequencing-based whole-exome and whole-genome sequencing. Because a large number of targets are covered by whole-exome and whole-genome sequencing, it is critical that a laboratory perform appropriate validation studies, develop a quality assurance and quality control program, and participate in proficiency testing. Objective.-To provide recommendations for wholeexome and whole-genome sequencing assay design, validation, and implementation for the detection of germline variants associated in inherited disorders. Data Sources.-An example of trio sequencing, filtration and annotation of variants, and phenotypic consideration to arrive at clinical diagnosis is discussed. Conclusions.-It is critical that clinical laboratories planning to implement whole-exome and whole-genome sequencing design and validate the assay to specifications and ensure adequate performance prior to implementation. Test design specifications, including variant filtering and annotation, phenotypic consideration, guidance on consenting options, and reporting of incidental findings, are provided. These are important steps a laboratory must take to validate and implement whole-exome and whole-genome sequencing in a clinical setting for germline variants in inherited disorders
Quantifying single nucleotide variant detection sensitivity in exome sequencing
BACKGROUND: The targeted capture and sequencing of genomic regions has rapidly demonstrated its utility in genetic studies. Inherent in this technology is considerable heterogeneity of target coverage and this is expected to systematically impact our sensitivity to detect genuine polymorphisms. To fully interpret the polymorphisms identified in a genetic study it is often essential to both detect polymorphisms and to understand where and with what probability real polymorphisms may have been missed. RESULTS: Using down-sampling of 30 deeply sequenced exomes and a set of gold-standard single nucleotide variant (SNV) genotype calls for each sample, we developed an empirical model relating the read depth at a polymorphic site to the probability of calling the correct genotype at that site. We find that measured sensitivity in SNV detection is substantially worse than that predicted from the naive expectation of sampling from a binomial. This calibrated model allows us to produce single nucleotide resolution SNV sensitivity estimates which can be merged to give summary sensitivity measures for any arbitrary partition of the target sequences (nucleotide, exon, gene, pathway, exome). These metrics are directly comparable between platforms and can be combined between samples to give âpower estimatesâ for an entire study. We estimate a local read depth of 13X is required to detect the alleles and genotype of a heterozygous SNV 95% of the time, but only 3X for a homozygous SNV. At a mean on-target read depth of 20X, commonly used for rare disease exome sequencing studies, we predict 5â15% of heterozygous and 1â4% of homozygous SNVs in the targeted regions will be missed. CONCLUSIONS: Non-reference alleles in the heterozygote state have a high chance of being missed when commonly applied read coverage thresholds are used despite the widely held assumption that there is good polymorphism detection at these coverage levels. Such alleles are likely to be of functional importance in population based studies of rare diseases, somatic mutations in cancer and explaining the âmissing heritabilityâ of quantitative traits
Mutation Clusters from Cancer Exome
We apply our statistically deterministic machine learning/clustering
algorithm *K-means (recently developed in https://ssrn.com/abstract=2908286) to
10,656 published exome samples for 32 cancer types. A majority of cancer types
exhibit mutation clustering structure. Our results are in-sample stable. They
are also out-of-sample stable when applied to 1,389 published genome samples
across 14 cancer types. In contrast, we find in- and out-of-sample
instabilities in cancer signatures extracted from exome samples via nonnegative
matrix factorization (NMF), a computationally costly and non-deterministic
method. Extracting stable mutation structures from exome data could have
important implications for speed and cost, which are critical for early-stage
cancer diagnostics such as novel blood-test methods currently in development.Comment: 84 page
Immune DNA signature of T-cell infiltration in breast tumor exomes.
Tumor infiltrating lymphocytes (TILs) have been associated with favorable prognosis in multiple tumor types. The Cancer Genome Atlas (TCGA) represents the largest collection of cancer molecular data, but lacks detailed information about the immune environment. Here, we show that exome reads mapping to the complementarity-determining-region 3 (CDR3) of mature T-cell receptor beta (TCRB) can be used as an immune DNA (iDNA) signature. Specifically, we propose a method to identify CDR3 reads in a breast tumor exome and validate it using deep TCRB sequencing. In 1,078 TCGA breast cancer exomes, the fraction of CDR3 reads was associated with TILs fraction, tumor purity, adaptive immunity gene expression signatures and improved survival in Her2+ patients. Only 2/839 TCRB clonotypes were shared between patients and none associated with a specific HLA allele or somatic driver mutations. The iDNA biomarker enriches the comprehensive dataset collected through TCGA, revealing associations with other molecular features and clinical outcomes
Missense-depleted regions in population exomes implicate ras superfamily nucleotide-binding protein alteration in patients with brain malformation.
Genomic sequence interpretation can miss clinically relevant missense variants for several reasons. Rare missense variants are numerous in the exome and difficult to prioritise. Affected genes may also not have existing disease association. To improve variant prioritisation, we leverage population exome data to identify intragenic missense-depleted regions (MDRs) genome-wide that may be important in disease. We then use missense depletion analyses to help prioritise undiagnosed disease exome variants. We demonstrate application of this strategy to identify a novel gene association for human brain malformation. We identified de novo missense variants that affect the GDP/GTP-binding site of ARF1 in three unrelated patients. Corresponding functional analysis suggests ARF1 GDP/GTP-activation is affected by the specific missense mutations associated with heterotopia. These findings expand the genetic pathway underpinning neurologic disease that classically includes FLNA. ARF1 along with ARFGEF2 add further evidence implicating ARF/GEFs in the brain. Using functional ontology, top MDR-containing genes were highly enriched for nucleotide-binding function, suggesting these may be candidates for human disease. Routine consideration of MDR in the interpretation of exome data for rare diseases may help identify strong genetic factors for many severe conditions, infertility/reduction in reproductive capability, and embryonic conditions contributing to preterm loss
Recommended from our members
Case report: targeted whole exome sequencing enables the first prenatal diagnosis of the lethal skeletal dysplasia Osteocraniostenosis.
BACKGROUND: Osteocraniostenosis (OCS) is a rare genetic disorder characterised by premature closure of cranial sutures, gracile bones and perinatal lethality. Previously, diagnosis has only been possible postnatally on clinical and radiological features. This study describes the first prenatal diagnosis of OCS. CASE PRESENTATION: In this case prenatal ultrasound images were suggestive of a serious but non-lethal skeletal dysplasia. Due to the uncertain prognosis the parents were offered Whole Exome Sequencing (WES), which identified a specific gene mutation in the FAMIIIa gene. This mutation had previously been detected in two cases and was lethal in both perinatally. This established the diagnosis, a clear prognosis and allowed informed parental choice regarding ongoing pregnancy management. CONCLUSIONS: This case report supports the use of targeted WES prenatally to confirm the underlying cause and prognosis of sonographically suspected abnormalities
Analysis of Archived Residual Newborn Screening Blood Spots After Whole Genome Amplification
Deidentified newborn screening bloodspot samples (NBS) represent a valuable potential resource for genomic research if impediments to whole exome sequencing of NBS deoxyribonucleic acid (DNA), including the small amount of genomic DNA in NBS material, can be overcome. For instance, genomic analysis of NBS could be used to define allele frequencies of disease-associated variants in local populations, or to conduct prospective or retrospective studies relating genomic variation to disease emergence in pediatric populations over time. In this study, we compared the recovery of variant calls from exome sequences of amplified NBS genomic DNA to variant calls from exome sequencing of non-amplified NBS DNA from the same individuals. Results: Using a standard alignment-based Genome Analysis Toolkit (GATK), we find 62,000-76,000 additional variants in amplified samples. After application of a unique kmer enumeration and variant detection method (RUFUS), only 38,000-47,000 additional variants are observed in amplified gDNA. This result suggests that roughly half of the amplification-introduced variants identified using GATK may be the result of mapping errors and read misalignment. Conclusions: Our results show that it is possible to obtain informative, high-quality data from exome analysis of whole genome amplified NBS with the important caveat that different data generation and analysis methods can affect variant detection accuracy, and the concordance of variant calls in whole-genome amplified and non-amplified exomes.National Institute of Health P01HD067244, NS076465, R01ES021006Nutritional Science
- âŠ