70 research outputs found

    NGS based studies on primary immunodeficiencies (PIDs) : causative gene identification, tool development and application

    Get PDF
    Primary immunodeficiency diseases (PIDs) are composed by a group of highly heterogeneous immune system diseases, of which approximately 350 forms of PID have been described so far. The causative gene of around 60% of patients with PIDs has yet unknown. In recent years, Next Generation Sequencing (NGS) has been increasingly adopted for gene identification and molecular diagnosis of rare diseases, including PIDs. An overview of the genetic makeup that underlies PID using NGS has been suggested as a promising approach to elucidate the etiology of PIDs, which could yield diagnostic and, possibly, provide new treatment advances for PID. To approach this goal, we performed either whole exome sequencing (WES, 454 samples) or targeted region sequencing (TRS, 217 samples) on 602 samples of 500 PID pedigrees. We have summarized the practical suggestions for the interpretation of NGS data and the techniques that can be used to search disease-causative PID genes in Paper I. This work aims to improve data annotation, interpretation, and application of NGS data in PIDs, which also facilitates a wide range of application of NGS data analysis in other Mendelian disorders. The genetic approach together with immunological investigations have identified potential pathogenic variants in 86 primary antibody deficiency (PAD) patients (68.2%), and a correct diagnosis can guide/change treatment plan in around half of the patients with PAD (Paper II). We identified potentially disease-causing variants (including variants classified as VUS (variants of unknown clinical significance)) in around 34% of genetically unidentified PID samples, which had been subjected to TRS using a panel of 219 common PID genes. Notably, the genetic diagnosis of a specific atypical ITK deficiency case adds to the growing amount of evidence supporting the importance of genetic investigations initiated at an early stage of the patient´s disease (Paper III). Altogether, around 60% of PID patients have a possible diagnosis via WES/TRS. Copy number variation defects were identified in 16 patients (4 genes were involved, LRBA, ATM, DOCK8 and PMS2). Beyond the identification of the monogenic causal gene based on pedigree analysis, mutation frequency analysis has been used to identify genes with rare functional variants in the higher proportion of patients in specific patient group compared to control samples, which have discovered several potential novel PID genes (TNFRSF18, PIK3CG, LILRB1, EPHB2, TXNIP, CD5 and NLRP5). Other possible models beyond the monogenic scenario were also explored, and 16 severe combined immunodeficiency (SCID) or common variable immunodeficiency (CVID) patients might be due to an accumulation of rare amino acid substitution variants in genes related to the same function or pathway (RAG1 & RAG2, RAG1 & ATM, C3 & ITGB2, PRKDC & ATM, C5 & NIPBL, LRBA & CR2, CR2 & NFKB1, UNC93B1 & NIPBL, PLCG2 & NOD2 and IGLL1 & ATM). These findings indicate that NGS, together with a large sample size, is powerful in decoding the genetic characteristics of PID and provide insight into molecular mechanisms that cause the disease. Existing variants impact prediction software/algorithms still have a challenge to evaluate the pathological consequences of the prioritized variants or genes. We thus developed a Random Forest-based discriminator, Variant Impact Predictor for PIDs (VIPPID), to refine the prediction algorithms, which utilized the features of pathogenic variants and benign mutations, integrated with other 24 predictive softwares currently used. Evaluation of VIPPID showed that it had superior performance (AUC=0.95) over existing tools, we also showed the gene-specific model outperformed the non-gene-specific model and provided a possibility to explore the underlying molecular mechanism based on our gene-specific model in Paper IV. Specific mutations of PID causative genes may exert different effects on TCR repertoire diversity and composition, which ultimately lead to heterogeneous phenotypes. DNA damage response/methylation is an essential process during antigen receptor recombination. To investigate the effect of mutations in DNA repair genes on adaptive immunity, 19 patients with DNA repair/methylation defects were selected and subdivided into several groups based on their causative genes, we then performed deep immune repertoire sequencing and comparison with 14 age-matched healthy controls. Patients with different molecular diagnosis exhibited distinct repertoire diversity, clonality and V-J pairing patterns. Aberrant complementarity-determining region 3 (CDR3) length distribution was observed both in unproductive and productive TCRs in all patients, suggesting that it predominantly arose before thymic selection. Shorter CDR3 lengths in AT patients resulted from a decreased number of insertions, led to an increase in the number of shared clonotypes, whereas patients with DNMT3B and ZBTB24 mutations presented longer CDR3 lengths and reduced specificity for pathogen-associated CDR3 sequences (Paper V). This study revealed the role of DNA repair/methylation machinery in patients with ATM, DNMT3B and ZBTB24 deficiency, and shed light on the mechanistic etiology of their T cell dysfunction

    Calculating and comparing codon usage values in rare disease genes highlights codon clustering with disease-and tissue- specific hierarchy

    Get PDF
    We designed a novel strategy to define codon usage bias (CUB) in 6 specific small cohorts of human genes. We calculated codon usage (CU) values in 29 non-disease-causing (NDC) and 31 disease-causing (DC) human genes which are highly expressed in 3 distinct tissues, kidney, muscle, and skin. We applied our strategy to the same selected genes annotated in 15 mammalian species. We obtained CUB hierarchical clusters for each gene cohort which showed tissue-specific and disease-specific CUB fingerprints. We showed that DC genes (especially those expressed in muscle) display a low CUB, well recognizable in codon hierarchical clustering. We defined the extremely biased codons as “zero codons” and found that their number is significantly higher in all DC genes, all tissues, and that this trend is conserved across mammals. Based on this calculation in different gene cohorts, we identified 5 codons which are more differentially used across genes and mammals, underlining that some genes have favorite synonymous codons in use. Since of the muscle genes clear clusters, and, among these, dystrophin gene surprisingly does not show any “zero codon” we adopted a novel approach to study CUB, we called “mapping-on-codons”. We positioned 2828 dystrophin missense and nonsense pathogenic variations on their respective codon, highlighting that its frequency and occurrence is not dependent on the CU values. We conclude our strategy consents to identify a hierarchical clustering of CU values in a gene cohort-specific fingerprints, with recognizable trend across mammals. In DC muscle genes also a disease-related fingerprint can be observed, allowing discrimination between DC and NDC genes. We propose that using our strategy which studies CU in specific gene cohorts, as rare disease genes, and tissue specific genes, may provide novel information about the CUB role in human and medical genetics, with implications on synonymous variations interpretation and codon optimization algorithms

    Molecular genetic analysis using targeted NGS analysis of 677 individuals with retinal dystrophy

    Get PDF
    Abstract Inherited retinal diseases (IRDs) are a common cause of visual impairment. IRD covers a set of genetically highly heterogeneous disorders with more than 150 genes associated with one or more clinical forms of IRD. Molecular genetic diagnosis has become increasingly important especially due to expanding number of gene therapy strategies under development. Next generation sequencing (NGS) of gene panels has proven a valuable diagnostic tool in IRD. We present the molecular findings of 677 individuals, residing in Denmark, with IRD and report 806 variants of which 187 are novel. We found that deletions and duplications spanning one or more exons can explain 3% of the cases, and thus copy number variation (CNV) analysis is important in molecular genetic diagnostics of IRD. Seven percent of the individuals have variants classified as pathogenic or likely-pathogenic in more than one gene. Possible Danish founder variants in EYS and RP1 are reported. A significant number of variants were classified as variants with unknown significance; reporting of these will hopefully contribute to the elucidation of the actual clinical consequence making the classification less troublesome in the future. In conclusion, this study underlines the relevance of performing targeted sequencing of IRD including CNV analysis as well as the importance of interaction with clinical diagnoses

    Incremental value of non-invasive myocardial work for the evaluation and prediction of coronary microvascular dysfunction in angina with no obstructive coronary artery disease

    Get PDF
    BackgroundEvidence suggests that patients suffering from angina with no obstructive coronary artery disease (ANOCA) experience coronary microvascular dysfunction (CMD). We aimed to understand the diagnosis value of noninvasive myocardial work indices (MWIs) with left ventricular pressure-strain loop (LV PSL) by echocardiography in ANOCA patients with CMD.Methods97 patients with ANOCA were recruited. All subjects underwent standard echocardiography with traditional ultrasound parameters, two-dimensional speckle-tracking echocardiography with global longitudinal strain (GLS), LV PSL with MWIs include global work index (GWI), global constructive work (GCW), global waste work (GWW) and global work efficiency (GWE). In addition, all enrolled cases underwent high-dose adenosine stress echocardiography (SE) with coronary flow velocity reserve (CFVR). CMD was defined as CFVR <2.0.ResultsOf the 97 patients with ANOCA, 52 were placed in the CMD group and 45 in the control group. GWI and GCW were decreased significantly in the CMD group compared with the control group (P < 0.001 for both). GWI and GCW were moderately correlated with CFVR (r = 0.430, P < 0.001 and r = 0.538, P < 0.001, respectively). In the multiple logistic regression analyses, GCW was identified as the only independent echocardiography parameter associated with CMD after adjusting for age and baseline APV [OR (95%CI) 1.009 (1.005–1.013); P < 0.001]. Moreover, the best predictor of CMD in patients with ANOCA using receiver operating characteristic (ROC) curve was GWI and GCW, with an area under the curve (AUC) of 0.800 and 0.832, sensitivity of 67.3% and 78.8%, specificity of 80.0% and 75.6%, respectively.ConclusionMWIs with LV PSL is a new method to detect LV systolic function noninvasively in ANOCA patients with CMD

    Myopia disease mouse models: a missense point mutation (S673G) and a protein-truncating mutation of the Zfp644 mimic human disease phenotype.

    Get PDF
    Zinc finger 644 (Zfp644 in mouse, ZNF644 in human) gene is a transcription factor whose mutation S672G is considered a potential genetic factor of inherited high myopia. ZNF644 interacts with G9a/GLP complex, which functions as a H3K9 methyltransferase to silence transcription. In this study, we generated mouse models to unravel the mechanisms leading to symptoms associated with high myopia. Employing TALEN technology, two mice mutants were generated, either with the disease-carrying mutation (Zfp644 S673G ) or with a truncated form of Zfp644 (Zfp644 Δ8 ). Eye morphology and visual functions were analysed in both mutants, revealing a significant difference in a vitreous chamber depth and lens diameter, however the physiological function of retina was preserved as found under the high-myopia conditions. Our findings prove that ZNF644/Zfp644 is involved in the development of high-myopia, indicating that mutations such as, Zfp644 S673G and Zfp644 Δ8 are causative for changes connected with the disease. The developed models represent a valuable tool to investigate the molecular basis of myopia pathogenesis and its potential treatment

    CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans

    Full text link
    Human readers or radiologists routinely perform full-body multi-organ multi-disease detection and diagnosis in clinical practice, while most medical AI systems are built to focus on single organs with a narrow list of a few diseases. This might severely limit AI's clinical adoption. A certain number of AI models need to be assembled non-trivially to match the diagnostic process of a human reading a CT scan. In this paper, we construct a Unified Tumor Transformer (CancerUniT) model to jointly detect tumor existence & location and diagnose tumor characteristics for eight major cancers in CT scans. CancerUniT is a query-based Mask Transformer model with the output of multi-tumor prediction. We decouple the object queries into organ queries, tumor detection queries and tumor diagnosis queries, and further establish hierarchical relationships among the three groups. This clinically-inspired architecture effectively assists inter- and intra-organ representation learning of tumors and facilitates the resolution of these complex, anatomically related multi-organ cancer image reading tasks. CancerUniT is trained end-to-end using a curated large-scale CT images of 10,042 patients including eight major types of cancers and occurring non-cancer tumors (all are pathology-confirmed with 3D tumor masks annotated by radiologists). On the test set of 631 patients, CancerUniT has demonstrated strong performance under a set of clinically relevant evaluation metrics, substantially outperforming both multi-disease methods and an assembly of eight single-organ expert models in tumor detection, segmentation, and diagnosis. This moves one step closer towards a universal high performance cancer screening tool.Comment: ICCV 2023 Camera Ready Versio

    Targeting the tubulin C-terminal tail by charged small molecules

    Get PDF
    10 p.-6 fig.The disordered tubulin C-terminal tail (CTT), which possesses a higher degree of heterogeneity, is the target for the interaction of many proteins and cellular components. Compared to the seven well-described binding sites of microtubule-targeting agents (MTAs) that localize on the globular tubulin core, tubulin CTT is far less explored. Therefore, tubulin CTT can be regarded as a novel site for the development of MTAs with distinct biochemical and cell biological properties. Here, we designed and synthesized linear and cyclic peptides containing multiple arginines (RRR), which are complementary to multiple acidic residues in tubulin CTT. Some of them showed moderate induction and promotion of tubulin polymerization. The most potent macrocyclic compound 1f was found to bind to tubulin CTT and thus exert its bioactivity. Such RRR containing compounds represent a starting point for the discovery of tubulin CTT-targeting agents with therapeutic potential.This research was funded by the CAMS Innovation Fund for Medical Sciences (Grant No. 2016-I2M-1-010), State Key Lab Grant type C (Grant No. GTZC201709) (W.-S. F.), Ministerio de Ciencia e Innovación (Grant No. PID2021-123399OB-I00 /AEI/10.13039/501100011033) (M. Á. O.), Ministerio de Ciencia e Innovacion (Grant No. PID2019-104545RB-I00 /AEI/10.13039/501100011033), European Commission-NextGenerationsEU (Regulation EU 2020/2094, through CSIC's Global Health Platform (PTI Salud Global)), Proyecto de Investigación en Neurociencia Fundacion Tatiana Pérez de Guzmán el Bueno 2020 (J. F. D.), the Voelcker Fund (A. R.), the Natural Science Foundation of Guangdong Province of China (Grant No. 2022A1515011419), and the Key Project in Higher Education of Guangdong, China (Grant No. 2022ZDZX2029) (Z. Y.).Peer reviewe
    corecore