47 research outputs found

    Childhood tuberculosis is associated with decreased abundance of T cell gene transcripts and impaired T cell function

    Get PDF
    The WHO estimates around a million children contract tuberculosis (TB) annually with over 80 000 deaths from dissemination of infection outside of the lungs. The insidious onset and association with skin test anergy suggests failure of the immune system to both recognise and respond to infection. To understand the immune mechanisms, we studied genome-wide whole blood RNA expression in children with TB meningitis (TBM). Findings were validated in a second cohort of children with TBM and pulmonary TB (PTB), and functional T-cell responses studied in a third cohort of children with TBM, other extrapulmonary TB (EPTB) and PTB. The predominant RNA transcriptional response in children with TBM was decreased abundance of multiple genes, with 140/204 (68%) of all differentially regulated genes showing reduced abundance compared to healthy controls. Findings were validated in a second cohort with concordance of the direction of differential expression in both TBM (r2 = 0.78 p = 2x10-16) and PTB patients (r2 = 0.71 p = 2x10-16) when compared to a second group of healthy controls. Although the direction of expression of these significant genes was similar in the PTB patients, the magnitude of differential transcript abundance was less in PTB than in TBM. The majority of genes were involved in activation of leucocytes (p = 2.67E-11) and T-cell receptor signalling (p = 6.56E-07). Less abundant gene expression in immune cells was associated with a functional defect in T-cell proliferation that recovered after full TB treatment (p<0.0003). Multiple genes involved in T-cell activation show decreased abundance in children with acute TB, who also have impaired functional T-cell responses. Our data suggest that childhood TB is associated with an acquired immune defect, potentially resulting in failure to contain the pathogen. Elucidation of the mechanism causing the immune paresis may identify new treatment and prevention strategies

    Diagnosis of Kawasaki Disease Using a Minimal Whole-Blood Gene Expression Signature.

    Get PDF
    Importance: To date, there is no diagnostic test for Kawasaki disease (KD). Diagnosis is based on clinical features shared with other febrile conditions, frequently resulting in delayed or missed treatment and an increased risk of coronary artery aneurysms. Objective: To identify a whole-blood gene expression signature that distinguishes children with KD in the first week of illness from other febrile conditions. Design, Setting, and Participants: The case-control study comprised a discovery group that included a training and test set and a validation group of children with KD or comparator febrile illness. The setting was pediatric centers in the United Kingdom, Spain, the Netherlands, and the United States. The training and test discovery group comprised 404 children with infectious and inflammatory conditions (78 KD, 84 other inflammatory diseases, and 242 bacterial or viral infections) and 55 healthy controls. The independent validation group comprised 102 patients with KD, including 72 in the first 7 days of illness, and 130 febrile controls. The study dates were March 1, 2009, to November 14, 2013, and data analysis took place from January 1, 2015, to December 31, 2017. Main Outcomes and Measures: Whole-blood gene expression was evaluated using microarrays, and minimal transcript sets distinguishing KD were identified using a novel variable selection method (parallel regularized regression model search). The ability of transcript signatures (implemented as disease risk scores) to discriminate KD cases from controls was assessed by area under the curve (AUC), sensitivity, and specificity at the optimal cut point according to the Youden index. Results: Among 404 patients in the discovery set, there were 78 with KD (median age, 27 months; 55.1% male) and 326 febrile controls (median age, 37 months; 56.4% male). Among 202 patients in the validation set, there were 72 with KD (median age, 34 months; 62.5% male) and 130 febrile controls (median age, 17 months; 56.9% male). A 13-transcript signature identified in the discovery training set distinguished KD from other infectious and inflammatory conditions in the discovery test set, with AUC of 96.2% (95% CI, 92.5%-99.9%), sensitivity of 81.7% (95% CI, 60.0%-94.8%), and specificity of 92.1% (95% CI, 84.0%-97.0%). In the validation set, the signature distinguished KD from febrile controls, with AUC of 94.6% (95% CI, 91.3%-98.0%), sensitivity of 85.9% (95% CI, 76.8%-92.6%), and specificity of 89.1% (95% CI, 83.0%-93.7%). The signature was applied to clinically defined categories of definite, highly probable, and possible KD, resulting in AUCs of 98.1% (95% CI, 94.5%-100%), 96.3% (95% CI, 93.3%-99.4%), and 70.0% (95% CI, 53.4%-86.6%), respectively, mirroring certainty of clinical diagnosis. Conclusions and Relevance: In this study, a 13-transcript blood gene expression signature distinguished KD from other febrile conditions. Diagnostic accuracy increased with certainty of clinical diagnosis. A test incorporating the 13-transcript disease risk score may enable earlier diagnosis and treatment of KD and reduce inappropriate treatment in those with other diagnoses

    Statistical Methods for Replicated, High-Dimensional Biological Time Series

    No full text
    The processes which govern the function of biological organisms are inherently dynamic and studying their behaviour over time is critical for gaining insight into their underlying mechanisms. They are also incredibly complex with tens of thousands of interacting variables comprising their state. In recent years, the development of high-throughput assaying technologies such as microarrays and nuclear magnetic resonance spectroscopy have revolutionised the fields of genomics and metabolomics respectively with their ability to quickly and easily interrogate these states at a single moment in time. When these assaying technologies are used to collect measurements repeatedly on the same biological unit, such as a human patient, laboratory rat or cell line, then the temporal behaviour of the system can begin to emerge. Furthermore, when several of these units are studied simultaneously then the experiment is said to be biologically replicated and such data sets permit the inference of systemic behaviour in the population as a whole. The time series data sets arising from these replicated `omics experiments possess unique characteristics that make for challenging statistical analysis. They are very short (3-10 time points is typical), heterogeneous, noisy, frequently irregularly sampled and often have missing observations, in addition to being very highly dimensional. To overcome some of these difficulties, researchers in the field of genomics have turned to functional data analysis, which has proven to be successful in modelling unreplicated data sets. Replicated data sets, however, have received far less attention, due to the complexity introduced by the extremely small sample sizes and multiple levels of variation - the between-variable and the between-replicate. Furthermore, despite the remarkable similarities between genomics and metabolomics time series data sets, these methods have been far less successful at establishing themselves in the latter field. In this thesis we present a general statistical framework for the analysis of replicated, high-dimensional biological time series data sets. Supported by three case studies, we develop novel models and algorithms for tackling the unique challenges that each data set presents. We show how these fitted models can be used in dimensionality reduction, summarising the thousands of observed time series into a small number of representative temporal profiles that are eminently biologically interpretable. We introduce a novel moderated functional t -statistic that can be used for detecting variables that differ significantly between two biological groups, leveraging the high dimensionality of the data in order to increase power. In all instances detailed simulation studies are used to demonstrate that the methods outperform existing state-of-the-art approaches. With practical data analysis in mind, careful consideration is given to the implementation of the methods in software that is computationally efficient, with parallel programming exploited wherever possible. In most instances, the methods have resulted in novel biological findings when applied to real data, and represent, as far as we are aware, the first application of such functional data analysis models to metabolomics time series experiments

    Functional Modelling of Microarray Time Series

    No full text
    corecore