20 research outputs found

    Refined mapping of autoimmune disease associated genetic variants with gene expression suggests an important role for non-coding RNAs

    Get PDF
    Genome-wide association and fine-mapping studies in 14 autoimmune diseases (AID) have implicated more than 250 loci in one or more of these diseases. As more than 90% of AID-associated SNPs are intergenic or intronic, pinpointing the causal genes is challenging. We performed a systematic analysis to link 460 SNPs that are associated with 14 AID to causal genes using transcriptomic data from 629 blood samples. We were able to link 71 (39%) of the AID-SNPs to two or more nearby genes, providing evidence that for part of the AID loci multiple causal genes exist. While 54 of the AID loci are shared by one or more AID, 17% of them do not share candidate causal genes. In addition to finding novel genes such as ULK3, we also implicate novel disease mechanisms and pathways like autophagy in celiac disease pathogenesis. Furthermore, 42 of the AID SNPs specifically affected the expression of 53 non-coding RNA genes. To further understand how the non-coding genome contributes to AID, the SNPs were linked to functional regulatory elements, which suggest a model where AID genes are regulated by network of chromatin looping/non-coding RNAs interactions. The looping model also explains how a causal candidate gene is not necessarily the gene closest to the AID SNP, which was the case in nearly 50% of cases

    Implementation of early next-generation sequencing for inborn errors of immunity: a prospective observational cohort study of diagnostic yield and clinical implications in Dutch genome diagnostic centers

    Get PDF
    Objective Inborn errors of immunity (IEI) are a heterogeneous group of disorders, affecting different components of the immune system. Over 450 IEI related genes have been identified, with new genes continually being recognized. This makes the early application of next-generation sequencing (NGS) as a diagnostic method in the evaluation of IEI a promising development. We aimed to provide an overview of the diagnostic yield and time to diagnosis in a cohort of patients suspected of IEI and evaluated by an NGS based IEI panel early in the diagnostic trajectory in a multicenter setting in the Netherlands. Study DesignWe performed a prospective observational cohort study. We collected data of 165 patients with a clinical suspicion of IEI without prior NGS based panel evaluation that were referred for early NGS using a uniform IEI gene panel. The diagnostic yield was assessed in terms of definitive genetic diagnoses, inconclusive diagnoses and patients without abnormalities in the IEI gene panel. We also assessed time to diagnosis and clinical implications. ResultsFor children, the median time from first consultation to diagnosis was 119 days versus 124 days for adult patients (U=2323; p=0.644). The median turn-around time (TAT) of genetic testing was 56 days in pediatric patients and 60 days in adult patients (U=1892; p=0.191). A definitive molecular diagnosis was made in 25/65 (24.6%) of pediatric patients and 9/100 (9%) of adults. Most diagnosed disorders were identified in the categories of immune dysregulation (n=10/25; 40%), antibody deficiencies (n=5/25; 20%), and phagocyte diseases (n=5/25; 20%). Inconclusive outcomes were found in 76/165 (46.1%) patients. Within the patient group with a genetic diagnosis, a change in disease management occurred in 76% of patients. ConclusionIn this cohort, the highest yields of NGS based evaluation for IEI early in the diagnostic trajectory were found in pediatric patients, and in the disease categories immune dysregulation and phagocyte diseases. In cases where a definitive diagnosis was made, this led to important disease management implications in a large majority of patients. More research is needed to establish a uniform diagnostic pathway for cases with inconclusive diagnoses, including variants of unknown significance.Transplantation and immunomodulatio

    Discovery of widespread transcription initiation at microsatellites predictable by sequence-based deep neural network

    Get PDF
    Using the Cap Analysis of Gene Expression (CAGE) technology, the FANTOM5 consortium provided one of the most comprehensive maps of transcription start sites (TSSs) in several species. Strikingly, ~72% of them could not be assigned to a specific gene and initiate at unconventional regions, outside promoters or enhancers. Here, we probe these unassigned TSSs and show that, in all species studied, a significant fraction of CAGE peaks initiate at microsatellites, also called short tandem repeats (STRs). To confirm this transcription, we develop Cap Trap RNA-seq, a technology which combines cap trapping and long read MinION sequencing. We train sequence-based deep learning models able to predict CAGE signal at STRs with high accuracy. These models unveil the importance of STR surrounding sequences not only to distinguish STR classes, but also to predict the level of transcription initiation. Importantly, genetic variants linked to human diseases are preferentially found at STRs with high transcription initiation level, supporting the biological and clinical relevance of transcription initiation at STRs. Together, our results extend the repertoire of non-coding transcription associated with DNA tandem repeats and complexify STR polymorphism

    ACO-based Bayesian network ensembles for the hierarchical classification of ageing-related proteins

    No full text
    The task of predicting protein functions using computational techniques is a major research area in the field of bioinformatics. Casting the task into a classification problem makes it challenging, since the classes (functions) to be predicted are hierarchically related, and a protein can have more than one function. One approach is to produce a set of local classifiers; each is responsible for discriminating between a subset of the classes in a certain level of the hierarchy. In this paper we tackle the hierarchical classification problem in a local fashion, by learning an ensemble of Bayesian network classifiers for each class in the hierarchy and combining their outputs with four alternative methods: a) selecting the best classifier, b) majority voting, c) weighted voting, and d) constructing a meta-classifier. The ensemble is built using ABC-Miner, our recently introduced Ant-based Bayesian Classification algorithm. We use different types of protein representations to learn different classification models. We empirically evaluate our proposed methods on an ageing-related protein dataset created for this research
    corecore