142 research outputs found
Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2
Copy Number Variants (CNVs) are structural rear- rangements contributing to phenotypic variation that have been proved to be associated with many dis- ease states. Over the last years, the identification of CNVs from whole-exome sequencing (WES) data has become a common practice for research and clinical purpose and, consequently, the demand for more and more efficient and accurate methods has increased. In this paper, we demonstrate that more than 30% of WES data map outside the targeted re- gions and that these reads, usually discarded, can be exploited to enhance the identification of CNVs from WES experiments. Here, we present EXCAVATOR2, the first read count based tool that exploits all the reads produced by WES experiments to detect CNVs with a genome-wide resolution. To evaluate the per- formance of our novel tool we use it for analysing two WES data sets, a population data set sequenced by the 1000 Genomes Project and a tumor data set made of bladder cancer samples. The results obtained from these analyses demonstrate that EXCAVATOR2 out- performs other four state-of-the-art methods and that our combined approach enlarge the spectrum of detectable CNVs from WES data with an unprece- dented resolution
A Novel null homozygous mutation confirms <i>CACNA2D2</i> as a gene mutated in epileptic encephalopathy
Contribution to epileptic encephalopathy (EE) of mutations in CACNA2D2, encoding α2δ-2 subunit of Voltage Dependent Calcium Channels, is unclear. To date only one CACNA2D2 mutation altering channel functionality has been identified in a single family. In the same family, a rare CELSR3 polymorphism also segregated with disease. Involvement of CACNA2D2 in EE
is therefore not confirmed, while that of CELSR3 is questionable. In a patient with epilepsy, dyskinesia, cerebellar atrophy, psychomotor delay and dysmorphic features, offspring to consanguineous parents, we performed whole exome sequencing (WES) for homozygosity mapping and mutation detection. WES identified extended autozygosity on
chromosome 3, containing two novel homozygous candidate mutations: c.1295delA (p.Asn432fs) in CACNA2D2 and
c.G6407A (p.Gly2136Asp) in CELSR3. Gene prioritization pointed to CACNA2D2 as the most prominent candidate gene. The WES finding in CACNA2D2 resulted to be statistically significant (p = 0.032), unlike that in CELSR3. CACNA2D2 homozygous c.1295delA essentially abolished α2δ-2 expression. In summary, we identified a novel null CACNA2D2 mutation associated to a clinical phenotype strikingly similar to the Cacna2d2 null mouse model. Molecular and statistical analyses together argued
in favor of a causal contribution of CACNA2D2 mutations to EE, while suggested that finding in CELSR3, although potentially damaging, is likely incidental
Is Focal Cortical Dysplasia/Epilepsy Caused by Somatic MTOR Mutations Always a Unilateral Disorder?
To alert about the wide margin of unpredictability that distribution of somatic MTOR mosaicism may have in the brain and the risk for independent epileptogenesis arising from the seemingly healthy contralateral hemisphere after complete removal of epileptogenic focal cortical dysplasia (FCD)
Characterization and identification of hidden rare variants in the human genome
BackgroundBy examining the genotype calls generated by the 1000 Genomes Project we discovered that the human reference genome GRCh37 contains almost 20,000 loci in which the reference allele has never been observed in healthy individuals and around 70,000 loci in which it has been observed only in the heterozygous state.ResultsWe show that a large fraction of this rare reference allele (RRA) loci belongs to coding, functional and regulatory elements of the genome and could be linked to rare Mendelian disorders as well as cancer. We also demonstrate that classical germline and somatic variant calling tools are not capable to recognize the rare allele when present in these loci. To overcome such limitations, we developed a novel tool, named RAREVATOR, that is able to identify and call the rare allele in these genomic positions. By using a small cancer dataset we compared our tool with two state-of-the-art callers and we found that RAREVATOR identified more than 1,500 germline and 22 somatic RRA variants missed by the two methods and which belong to significantly mutated pathways.ConclusionsThese results show that, to date, the investigation of around 100,000 loci of the human genome has been missed by re-sequencing experiments based on the GRCh37 assembly and that our tool can fill the gap left by other methods. Moreover, the investigation of the latest version of the human reference genome, GRCh38, showed that although the GRC corrected almost all insertions and a small part of SNVs and deletions, a large number of functionally relevant RRAs still remain unchanged. For this reason, also future resequencing experiments, based on GRCh38, will benefit from RAREVATOR analysis results. RAREVATOR is freely available at http://sourceforge.net/projects/rarevator
5′UTR point substitutions and N-terminal truncating mutations of ANKRD26 in acute myeloid leukemia
Thrombocytopenia 2 (THC2) is an inherited disorder caused by monoallelic single nucleotide substitutions in the 5'UTR of the ANKRD26 gene. Patients have thrombocytopenia and increased risk of myeloid malignancies, in particular, acute myeloid leukemia (AML). Given the association of variants in the ANKRD26 5'UTR with myeloid neoplasms, we investigated whether, and to what extent, mutations in this region contribute to apparently sporadic AML. To this end, we studied 250 consecutive, non-familial, adult AML patients and screened the first exon of ANKRD26 including the 5'UTR. We found variants in four patients. One patient had the c.-125T>G substitution in the 5'UTR, while three patients carried two different variants in the 5' end of the ANKRD26 coding region (c.3G>A or c.105C>G). Review of medical history showed that the patient carrying the c.-125T>G was actually affected by typical but unrecognized THC2, highlighting that some apparently sporadic AML cases represent the evolution of a well-characterized familial predisposition disorder. As regards the c.3G>A and the c.105C>G, we found that both variants result in the synthesis of N-terminal truncated ANKRD26 isoforms, which are stable and functional in cells, in particular, have a strong ability to activate the MAPK/ERK signaling pathway. Moreover, investigation of one patient with the c.3G>A showed that mutation was associated with strong ANKRD26 overexpression in vivo, which is the proposed mechanism for predisposition to AML in THC2 patients. These data provide evidence that N-terminal ANKRD26 truncating mutations play a potential pathogenetic role in AML. Recognition of AML patients with germline ANKRD26 pathogenetic variants is mandatory for selection of donors for bone marrow transplantation
MIXER: a Machine-learning method to detect genomic Imbalances exploiting X chromosome exome reads
Whole Exome Sequencing (WES) is rapidly becoming a first-tier test, thanks to declining costs and automatic clinical pipelines. However, while identification of small variants follows standardized workflows, there is no agreement on methods for detection of Copy Number Variants (CNVs). A plethora of WES-based CNV callers have been developed, each showing good performance towards only a limited range of CNV classes/sizes. As clinical CNVs extend from large rearrangements to single genes, more versatile approaches are needed to be of enhanced diagnostic use
Loss of function mutations in CCDC32 cause a congenital syndrome characterized by craniofacial, cardiac and neurodevelopmental anomalies
Despite the wide use of genomics to investigate the molecular basis of rare congenital malformations, a significant fraction of patients remains bereft of diagnosis. As part of our continuous effort to recruit and perform genomic and functional studies on such cohorts, we investigated the genetic and mechanistic cause of disease in two independent consanguineous families affected by overlapping craniofacial, cardiac, laterality, and neurodevelopmental anomalies. Using whole exome sequencing, we identified homozygous frameshift CCDC32 variants in three affected individuals. Functional analysis in a zebrafish model revealed that ccdc32 depletion recapitulates the human phenotypes. Because some of the patient phenotypes overlap defects common to ciliopathies, we asked if loss of CCDC32 might contribute to the dysfunction of this organelle. Consistent with this hypothesis, we show that ccdc32 is required for normal cilia formation in zebrafish embryos and mammalian cell culture, arguing that ciliary defects are at least partially involved in the pathomechanism of this disorder
Resources and tools for rare disease variant interpretation
: Collectively, rare genetic disorders affect a substantial portion of the world's population. In most cases, those affected face difficulties in receiving a clinical diagnosis and genetic characterization. The understanding of the molecular mechanisms of these diseases and the development of therapeutic treatments for patients are also challenging. However, the application of recent advancements in genome sequencing/analysis technologies and computer-aided tools for predicting phenotype-genotype associations can bring significant benefits to this field. In this review, we highlight the most relevant online resources and computational tools for genome interpretation that can enhance the diagnosis, clinical management, and development of treatments for rare disorders. Our focus is on resources for interpreting single nucleotide variants. Additionally, we present use cases for interpreting genetic variants in clinical settings and review the limitations of these results and prediction tools. Finally, we have compiled a curated set of core resources and tools for analyzing rare disease genomes. Such resources and tools can be utilized to develop standardized protocols that will enhance the accuracy and effectiveness of rare disease diagnosis
- …