88 research outputs found

    cnvHiTSeq: integrative models for high-resolution copy number variation detection and genotyping using population sequencing data

    Get PDF
    Recent advances in sequencing technologies provide the means for identifying copy number variation (CNV) at an unprecedented resolution. A single next-generation sequencing experiment offers several features that can be used to detect CNV, yet current methods do not incorporate all available signatures into a unified model. cnvHiTSeq is an integrative probabilistic method for CNV discovery and genotyping that jointly analyzes multiple features at the population level. By combining evidence from complementary sources, cnvHiTSeq achieves high genotyping accuracy and a substantial improvement in CNV detection sensitivity over existing methods, while maintaining a low false discovery rate. cnvHiTSeq is available at http://sourceforge.net/projects/cnvhitse

    A multi-platform approach to identify a blood-based host protein signature for distinguishing between bacterial and viral infections in febrile children (PERFORM): a multi-cohort machine learning study

    Get PDF
    BACKGROUND: Differentiating between self-resolving viral infections and bacterial infections in children who are febrile is a common challenge, causing difficulties in identifying which individuals require antibiotics. Studying the host response to infection can provide useful insights and can lead to the identification of biomarkers of infection with diagnostic potential. This study aimed to identify host protein biomarkers for future development into an accurate, rapid point-of-care test that can distinguish between bacterial and viral infections, by recruiting children presenting to health-care settings with fever or a history of fever in the previous 72 h. METHODS: In this multi-cohort machine learning study, patient data were taken from EUCLIDS, the Swiss Pediatric Sepsis study, the GENDRES study, and the PERFORM study, which were all based in Europe. We generated three high-dimensional proteomic datasets (SomaScan and two via liquid chromatography tandem mass spectrometry, referred to as MS-A and MS-B) using targeted and untargeted platforms (SomaScan and liquid chromatography mass spectrometry). Protein biomarkers were then shortlisted using differential abundance analysis, feature selection using forward selection-partial least squares (FS-PLS; 100 iterations), along with a literature search. Identified proteins were tested with Luminex and ELISA and iterative FS-PLS was done again (25 iterations) on the Luminex results alone, and the Luminex and ELISA results together. A sparse protein signature for distinguishing between bacterial and viral infections was identified from the selected proteins. The performance of this signature was finally tested using Luminex assays and by calculating disease risk scores. FINDINGS: 376 children provided serum or plasma samples for use in the discovery of protein biomarkers. 79 serum samples were collected for the generation of the SomaScan dataset, 147 plasma samples for the MS-A dataset, and 150 plasma samples for the MS-B dataset. Differential abundance analysis, and the first round of feature selection using FS-PLS identified 35 protein biomarker candidates, of which 13 had commercial ELISA or Luminex tests available. 16 proteins with ELISA or Luminex tests available were identified by literature review. Further evaluation via Luminex and ELISA and the second round of feature selection using FS-PLS revealed a six-protein signature: three of the included proteins are elevated in bacterial infections (SELE, NGAL, and IFN-γ), and three are elevated in viral infections (IL18, NCAM1, and LG3BP). Performance testing of the signature using Luminex assays revealed area under the receiver operating characteristic curve values between 89·4% and 93·6%. INTERPRETATION: This study has led to the identification of a protein signature that could be ultimately developed into a blood-based point-of-care diagnostic test for rapidly diagnosing bacterial and viral infections in febrile children. Such a test has the potential to greatly improve care of children who are febrile, ensuring that the correct individuals receive antibiotics. FUNDING: European Union's Horizon 2020 research and innovation programme, the European Union's Seventh Framework Programme (EUCLIDS), Imperial Biomedical Research Centre of the National Institute for Health Research, the Wellcome Trust and Medical Research Foundation, Instituto de Salud Carlos III, Consorcio Centro de Investigación Biomédica en Red de Enfermedades Respiratorias, Grupos de Refeencia Competitiva, Swiss State Secretariat for Education, Research and Innovation

    Diagnosis of childhood febrile illness using a multi-class blood RNA molecular signature

    Get PDF
    BACKGROUND: Appropriate treatment and management of children presenting with fever depend on accurate and timely diagnosis, but current diagnostic tests lack sensitivity and specificity and are frequently too slow to inform initial treatment. As an alternative to pathogen detection, host gene expression signatures in blood have shown promise in discriminating several infectious and inflammatory diseases in a dichotomous manner. However, differential diagnosis requires simultaneous consideration of multiple diseases. Here, we show that diverse infectious and inflammatory diseases can be discriminated by the expression levels of a single panel of genes in blood. METHODS: A multi-class supervised machine-learning approach, incorporating clinical consequence of misdiagnosis as a "cost" weighting, was applied to a whole-blood transcriptomic microarray dataset, incorporating 12 publicly available datasets, including 1,212 children with 18 infectious or inflammatory diseases. The transcriptional panel identified was further validated in a new RNA sequencing dataset comprising 411 febrile children. FINDINGS: We identified 161 transcripts that classified patients into 18 disease categories, reflecting individual causative pathogen and specific disease, as well as reliable prediction of broad classes comprising bacterial infection, viral infection, malaria, tuberculosis, or inflammatory disease. The transcriptional panel was validated in an independent cohort and benchmarked against existing dichotomous RNA signatures. CONCLUSIONS: Our data suggest that classification of febrile illness can be achieved with a single blood sample and opens the way for a new approach for clinical diagnosis. FUNDING: European Union's Seventh Framework no. 279185; Horizon2020 no. 668303 PERFORM; Wellcome Trust (206508/Z/17/Z); Medical Research Foundation (MRF-160-0008-ELP-KAFO-C0801); NIHR Imperial BRC

    Insights from computational modeling in inflammation and acute rejection in limb transplantation

    Get PDF
    Acute skin rejection in vascularized composite allotransplantation (VCA) is the major obstacle for wider adoption in clinical practice. This study utilized computational modeling to identify biomarkers for diagnosis and targets for treatment of skin rejection. Protein levels of 14 inflammatory mediators in skin and muscle biopsies from syngeneic grafts [n = 10], allogeneic transplants without immunosuppression [n = 10] and allografts treated with tacrolimus [n = 10] were assessed by multiplexed analysis technology. Hierarchical Clustering Analysis, Principal Component Analysis, Random Forest Classification and Multinomial Logistic Regression models were used to segregate experimental groups. Based on Random Forest Classification, Multinomial Logistic Regression and Hierarchical Clustering Analysis models, IL-4, TNF-α and IL-12p70 were the best predictors of skin rejection and identified rejection well in advance of histopathological alterations. TNF-α and IL-12p70 were the best predictors of muscle rejection and also preceded histopathological alterations. Principal Component Analysis identified IL-1α, IL-18, IL-1β, and IL-4 as principal drivers of transplant rejection. Thus, inflammatory patterns associated with rejection are specific for the individual tissue and may be superior for early detection and targeted treatment of rejection. © 2014 Wolfram et al

    A Latent Variable Partial Least Squares Path Modeling Approach to Regional Association and Polygenic Effect with Applications to a Human Obesity Study

    Get PDF
    Genetic association studies are now routinely used to identify single nucleotide polymorphisms (SNPs) linked with human diseases or traits through single SNP-single trait tests. Here we introduced partial least squares path modeling (PLSPM) for association between single or multiple SNPs and a latent trait that can involve single or multiple correlated measurement(s). Furthermore, the framework naturally provides estimators of polygenic effect by appropriately weighting trait-attributing alleles. We conducted computer simulations to assess the performance via multiple SNPs and human obesity-related traits as measured by body mass index (BMI), waist and hip circumferences. Our results showed that the associate statistics had type I error rates close to nominal level and were powerful for a range of effect and sample sizes. When applied to 12 candidate regions in data (N = 2,417) from the European Prospective Investigation of Cancer (EPIC)-Norfolk study, a region in FTO was found to have stronger association (rs7204609∼rs9939881 at the first intron P = 4.29×10−7) than single SNP analysis (all with P>10−4) and a latent quantitative phenotype was obtained using a subset sample of EPIC-Norfolk (N = 12,559). We believe our method is appropriate for assessment of regional association and polygenic effect on a single or multiple traits

    Diagnosis of multisystem inflammatory syndrome in children by a whole-blood transcriptional signature

    Get PDF
    BACKGROUND: To identify a diagnostic blood transcriptomic signature that distinguishes multisystem inflammatory syndrome in children (MIS-C) from Kawasaki disease (KD), bacterial infections, and viral infections. METHODS: Children presenting with MIS-C to participating hospitals in the United Kingdom and the European Union between April 2020 and April 2021 were prospectively recruited. Whole-blood RNA Sequencing was performed, contrasting the transcriptomes of children with MIS-C (n = 38) to those from children with KD (n = 136), definite bacterial (DB; n = 188) and viral infections (DV; n = 138). Genes significantly differentially expressed (SDE) between MIS-C and comparator groups were identified. Feature selection was used to identify genes that optimally distinguish MIS-C from other diseases, which were subsequently translated into RT-qPCR assays and evaluated in an independent validation set comprising MIS-C (n = 37), KD (n = 19), DB (n = 56), DV (n = 43), and COVID-19 (n = 39). RESULTS: In the discovery set, 5696 genes were SDE between MIS-C and combined comparator disease groups. Five genes were identified as potential MIS-C diagnostic biomarkers (HSPBAP1, VPS37C, TGFB1, MX2, and TRBV11-2), achieving an AUC of 96.8% (95% CI: 94.6%-98.9%) in the discovery set, and were translated into RT-qPCR assays. The RT-qPCR 5-gene signature achieved an AUC of 93.2% (95% CI: 88.3%-97.7%) in the independent validation set when distinguishing MIS-C from KD, DB, and DV. CONCLUSIONS: MIS-C can be distinguished from KD, DB, and DV groups using a 5-gene blood RNA expression signature. The small number of genes in the signature and good performance in both discovery and validation sets should enable the development of a diagnostic test for MIS-C

    Age-Related Neuronal Degeneration: Complementary Roles of Nucleotide Excision Repair and Transcription-Coupled Repair in Preventing Neuropathology

    Get PDF
    Neuronal degeneration is a hallmark of many DNA repair syndromes. Yet, how DNA damage causes neuronal degeneration and whether defects in different repair systems affect the brain differently is largely unknown. Here, we performed a systematic detailed analysis of neurodegenerative changes in mouse models deficient in nucleotide excision repair (NER) and transcription-coupled repair (TCR), two partially overlapping DNA repair systems that remove helix-distorting and transcription-blocking lesions, respectively, and that are associated with the UV-sensitive syndromes xeroderma pigmentosum (XP) and Cockayne syndrome (CS). TCR–deficient Csa−/− and Csb−/− CS mice showed activated microglia cells surrounding oligodendrocytes in regions with myelinated axons throughout the nervous system. This white matter microglia activation was not observed in NER–deficient Xpa−/− and Xpc−/− XP mice, but also occurred in XpdXPCS mice carrying a point mutation (G602D) in the Xpd gene that is associated with a combined XPCS disorder and causes a partial NER and TCR defect. The white matter abnormalities in TCR–deficient mice are compatible with focal dysmyelination in CS patients. Both TCR–deficient and NER–deficient mice showed no evidence for neuronal degeneration apart from p53 activation in sporadic (Csa−/−, Csb−/−) or highly sporadic (Xpa−/−, Xpc−/−) neurons and astrocytes. To examine to what extent overlap occurs between both repair systems, we generated TCR–deficient mice with selective inactivation of NER in postnatal neurons. These mice develop dramatic age-related cumulative neuronal loss indicating DNA damage substrate overlap and synergism between TCR and NER pathways in neurons, and they uncover the occurrence of spontaneous DNA injury that may trigger neuronal degeneration. We propose that, while Csa−/− and Csb−/− TCR–deficient mice represent powerful animal models to study the mechanisms underlying myelin abnormalities in CS, neuron-specific inactivation of NER in TCR–deficient mice represents a valuable model for the role of NER in neuronal maintenance and survival

    GWAS on longitudinal growth traits reveals different genetic factors influencing infant, child, and adult BMI

    Get PDF
    Early childhood growth patterns are associated with adult health, yet the genetic factors and the developmental stages involved are not fully understood. Here, we combine genome-wide association studies with modeling of longitudinal growth traits to study the genetics of infant and child growth, followed by functional, pathway, genetic correlation, risk score, and colocalization analyses to determine how developmental timings, molecular pathways, and genetic determinants of these traits overlap with those of adult health. We found a robust overlap between the genetics of child and adult body mass index (BMI), with variants associated with adult BMI acting as early as 4 to 6 years old. However, we demonstrated a completely distinct genetic makeup for peak BMI during infancy, influenced by variation at the LEPR/LEPROT locus. These findings suggest that different genetic factors control infant and child BMI. In light of the obesity epidemic, these findings are important to inform the timing and targets of prevention strategies.Peer reviewe

    Detection of Tuberculosis in HIV-Infected and -Uninfected African Adults Using Whole Blood RNA Expression Signatures: A Case-Control Study

    Get PDF
    BACKGROUND: A major impediment to tuberculosis control in Africa is the difficulty in diagnosing active tuberculosis (TB), particularly in the context of HIV infection. We hypothesized that a unique host blood RNA transcriptional signature would distinguish TB from other diseases (OD) in HIV-infected and -uninfected patients, and that this could be the basis of a simple diagnostic test. METHODS AND FINDINGS: Adult case-control cohorts were established in South Africa and Malawi of HIV-infected or -uninfected individuals consisting of 584 patients with either TB (confirmed by culture of Mycobacterium tuberculosis [M.TB] from sputum or tissue sample in a patient under investigation for TB), OD (i.e., TB was considered in the differential diagnosis but then excluded), or healthy individuals with latent TB infection (LTBI). Individuals were randomized into training (80%) and test (20%) cohorts. Blood transcriptional profiles were assessed and minimal sets of significantly differentially expressed transcripts distinguishing TB from LTBI and OD were identified in the training cohort. A 27 transcript signature distinguished TB from LTBI and a 44 transcript signature distinguished TB from OD. To evaluate our signatures, we used a novel computational method to calculate a disease risk score (DRS) for each patient. The classification based on this score was first evaluated in the test cohort, and then validated in an independent publically available dataset (GSE19491). In our test cohort, the DRS classified TB from LTBI (sensitivity 95%, 95% CI [87-100]; specificity 90%, 95% CI [80-97]) and TB from OD (sensitivity 93%, 95% CI [83-100]; specificity 88%, 95% CI [74-97]). In the independent validation cohort, TB patients were distinguished both from LTBI individuals (sensitivity 95%, 95% CI [85-100]; specificity 94%, 95% CI [84-100]) and OD patients (sensitivity 100%, 95% CI [100-100]; specificity 96%, 95% CI [93-100]). Limitations of our study include the use of only culture confirmed TB patients, and the potential that TB may have been misdiagnosed in a small proportion of OD patients despite the extensive clinical investigation used to assign each patient to their diagnostic group. CONCLUSIONS: In our study, blood transcriptional signatures distinguished TB from other conditions prevalent in HIV-infected and -uninfected African adults. Our DRS, based on these signatures, could be developed as a test for TB suitable for use in HIV endemic countries. Further evaluation of the performance of the signatures and DRS in prospective populations of patients with symptoms consistent with TB will be needed to define their clinical value under operational conditions. Please see later in the article for the Editors' Summary
    corecore