493 research outputs found

    Joint and individual variation explained (JIVE) for integrated analysis of multiple data types

    Get PDF
    Research in several fields now requires the analysis of data sets in which multiple high-dimensional types of data are available for a common set of objects. In particular, The Cancer Genome Atlas (TCGA) includes data from several diverse genomic technologies on the same cancerous tumor samples. In this paper we introduce Joint and Individual Variation Explained (JIVE), a general decomposition of variation for the integrated analysis of such data sets. The decomposition consists of three terms: a low-rank approximation capturing joint variation across data types, low-rank approximations for structured variation individual to each data type, and residual noise. JIVE quantifies the amount of joint variation between data types, reduces the dimensionality of the data and provides new directions for the visual exploration of joint and individual structures. The proposed method represents an extension of Principal Component Analysis and has clear advantages over popular two-block methods such as Canonical Correlation Analysis and Partial Least Squares. A JIVE analysis of gene expression and miRNA data on Glioblastoma Multiforme tumor samples reveals gene-miRNA associations and provides better characterization of tumor types. Data and software are available at https://genome.unc.edu/jive/Comment: Published in at http://dx.doi.org/10.1214/12-AOAS597 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org

    Live Coding, Live Notation, Live Performance

    Get PDF
    This paper/demonstration explores relationships between code, notation including representation, visualisation and performance. Performative aspects of live coding activities are increasingly being investigated as the live coding movement continues to grow and develop. Although live instrumental performance is sometimes included as an accompaniment to live coding, it is often not a fully integrated part of the performance, relying on improvisation and/or basic indicative forms of notation with varying levels of sophistication and universality. Technologies are developing which enable the use of fully explicit music notations as well as more graphic ones, allowing more fully integrated systems of code in and as performance which can also include notations of arbitrary complexity. This itself allows the full skills of instrumental musicians to be utilised and synchronised in the process. This presentation/demonstration presents work and performances already undertaken with these technologies, including technologies for body sensing and data acquisition in the translation of the movements of dancers and musicians into synchronously performable notation, integrated by live and prepared coding. The author together with clarinetist Ian Mitchell present a short live performance utilising these techniques, discuss methods for the dissemination and interpretation of live generated notations and investigate how they take advantage of instrumental musicians’ training-related neuroplasticity skills

    Clinical outcome following acute ischaemic stroke relates to both activation and autoregulatory inhibition of cytokine production

    Get PDF
    BACKGROUND: As critical mediators of local and systemic inflammatory responses, cytokines are produced in the brain following ischaemic stroke. Some have been detected in the circulation of stroke patients, but their role and source is unclear. Focusing primarily on interleukin(IL)-1-related mechanisms, we serially measured plasma inflammatory markers, and the production of cytokines by whole blood, from 36 patients recruited within 12 h and followed up to 1 year after acute ischaemic stroke (AIS). RESULTS: Admission plasma IL-1 receptor antagonist (IL-1ra) concentration was elevated, relative to age-, sex-, and atherosclerosis-matched controls. IL-1β, soluble IL-1 receptor type II, tumour necrosis factor (TNF)-α, TNF-RII, IL-10 and leptin concentrations did not significantly differ from controls, but peak soluble TNF receptor type I (sTNF-RI) in the first week correlated strongly with computed tomography infarct volume at 5–7 days, mRS and BI at 3 and 12 months. Neopterin was raised in patients at 5–7 d, relative to controls, and in subjects with significant atherosclerosis. Spontaneous IL-1β, TNF-α and IL-6 gene and protein expression by blood cells was minimal, and induction of these cytokines by lipopolysaccharide (LPS) was significantly lower in patients than in controls during the first week. Minimum LPS-induced cytokine production correlated strongly with mRS and BI, and also with plasma cortisol. CONCLUSION: Absence of spontaneous whole blood gene activation or cytokine production suggests that peripheral blood cells are not the source of cytokines measured in plasma after AIS. Increased plasma IL-1ra within 12 h of AIS onset, the relationship between sTNF-RI and stroke severity, and suppressed cytokine induction suggests early activation of endogenous immunosuppressive mechanisms after AIS

    Improving statistical inference on pathogen densities estimated by quantitative molecular methods: malaria gametocytaemia as a case study

    Get PDF
    BACKGROUND: Quantitative molecular methods (QMMs) such as quantitative real-time polymerase chain reaction (q-PCR), reverse-transcriptase PCR (qRT-PCR) and quantitative nucleic acid sequence-based amplification (QT-NASBA) are increasingly used to estimate pathogen density in a variety of clinical and epidemiological contexts. These methods are often classified as semi-quantitative, yet estimates of reliability or sensitivity are seldom reported. Here, a statistical framework is developed for assessing the reliability (uncertainty) of pathogen densities estimated using QMMs and the associated diagnostic sensitivity. The method is illustrated with quantification of Plasmodium falciparum gametocytaemia by QT-NASBA. RESULTS: The reliability of pathogen (e.g. gametocyte) densities, and the accompanying diagnostic sensitivity, estimated by two contrasting statistical calibration techniques, are compared; a traditional method and a mixed model Bayesian approach. The latter accounts for statistical dependence of QMM assays run under identical laboratory protocols and permits structural modelling of experimental measurements, allowing precision to vary with pathogen density. Traditional calibration cannot account for inter-assay variability arising from imperfect QMMs and generates estimates of pathogen density that have poor reliability, are variable among assays and inaccurately reflect diagnostic sensitivity. The Bayesian mixed model approach assimilates information from replica QMM assays, improving reliability and inter-assay homogeneity, providing an accurate appraisal of quantitative and diagnostic performance. CONCLUSIONS: Bayesian mixed model statistical calibration supersedes traditional techniques in the context of QMM-derived estimates of pathogen density, offering the potential to improve substantially the depth and quality of clinical and epidemiological inference for a wide variety of pathogens

    Tumor innate immunity primed by specific interferon-stimulated endogenous retroviruses.

    Get PDF
    Mesenchymal tumor subpopulations secrete pro-tumorigenic cytokines and promote treatment resistance1-4. This phenomenon has been implicated in chemorefractory small cell lung cancer and resistance to targeted therapies5-8, but remains incompletely defined. Here, we identify a subclass of endogenous retroviruses (ERVs) that engages innate immune signaling in these cells. Stimulated 3 prime antisense retroviral coding sequences (SPARCS) are oriented inversely in 3' untranslated regions of specific genes enriched for regulation by STAT1 and EZH2. Derepression of these loci results in double-stranded RNA generation following IFN-γ exposure due to bi-directional transcription from the STAT1-activated gene promoter and the 5' long terminal repeat of the antisense ERV. Engagement of MAVS and STING activates downstream TBK1, IRF3, and STAT1 signaling, sustaining a positive feedback loop. SPARCS induction in human tumors is tightly associated with major histocompatibility complex class 1 expression, mesenchymal markers, and downregulation of chromatin modifying enzymes, including EZH2. Analysis of cell lines with high inducible SPARCS expression reveals strong association with an AXL/MET-positive mesenchymal cell state. While SPARCS-high tumors are immune infiltrated, they also exhibit multiple features of an immune-suppressed microenviroment. Together, these data unveil a subclass of ERVs whose derepression triggers pathologic innate immune signaling in cancer, with important implications for cancer immunotherapy

    Breast cancer PAM50 signature: Correlation and concordance between RNA-Seq and digital multiplexed gene expression technologies in a triple negative breast cancer series

    Get PDF
    Background: Full RNA-Seq is a fundamental research tool for whole transcriptome analysis. However, it is too costly and time consuming to be used in routine clinical practice. We evaluated the transcript quantification agreement between RNA-Seq and a digital multiplexed gene expression platform, and the subtype call after running the PAM50 assay in a series of breast cancer patients classified as triple negative by IHC/FISH. The goal of this study is to analyze the concordance between both expression platforms overall, and for calling PAM50 triple negative breast cancer intrinsic subtypes in particular. Results: The analyses were performed in paraffin-embedded tissues from 96 patients recruited in a multicenter, prospective, non-randomized neoadjuvant triple negative breast cancer trial (NCT01560663). Pre-treatment core biopsies were obtained following clinical practice guidelines and conserved as FFPE for further RNA extraction. PAM50 was performed on both digital multiplexed gene expression and RNA-Seq platforms. Subtype assignment was based on the nearest centroid classification following this procedure for both platforms and it was concordant on 96% of the cases (N = 96). In four cases, digital multiplexed gene expression analysis and RNA-Seq were discordant. The Spearman correlation to each of the centroids and the risk of recurrence were above 0.89 in both platforms while the agreement on Proliferation Score reached up to 0.97. In addition, 82% of the individual PAM50 genes showed a correlation coefficient > 0.80. Conclusions: In our analysis, the subtype calling in most of the samples was concordant in both platforms and the potential discordances had reduced clinical implications in terms of prognosis. If speed and cost are the main driving forces then the preferred technique is the digital multiplexed platform, while if whole genome patterns and subtype are the driving forces, then RNA-Seq is the preferred method

    An Integrated TCGA Pan-Cancer Clinical Data Resource to Drive High-Quality Survival Outcome Analytics

    Get PDF
    For a decade, The Cancer Genome Atlas (TCGA) program collected clinicopathologic annotation data along with multi-platform molecular profiles of more than 11,000 human tumors across 33 different cancer types. TCGA clinical data contain key features representing the democratized nature of the data collection process. To ensure proper use of this large clinical dataset associated with genomic features, we developed a standardized dataset named the TCGA Pan-Cancer Clinical Data Resource (TCGA-CDR), which includes four major clinical outcome endpoints. In addition to detailing major challenges and statistical limitations encountered during the effort of integrating the acquired clinical data, we present a summary that includes endpoint usage recommendations for each cancer type. These TCGA-CDR findings appear to be consistent with cancer genomics studies independent of the TCGA effort and provide opportunities for investigating cancer biology using clinical correlates at an unprecedented scale. Analysis of clinicopathologic annotations for over 11,000 cancer patients in the TCGA program leads to the generation of TCGA Clinical Data Resource, which provides recommendations of clinical outcome endpoint usage for 33 cancer types

    A UV-to-NIR Study of Molecular Gas in the Dust Cavity around RY Lupi

    Get PDF
    We present a study of molecular gas in the inner disk (r ∼ 0.4± 0.1 au; {r(narrow,H₂)} ∼ 3± 2 au). The 4.7 μm ¹²CO emission lines are also well fit by two-component profiles ( {{r}broad,CO} =0.4± 0.1 au; {{r}narrow,CO} =15± 2 au). We combine these results with 10 μm observations to form a picture of gapped structure within the mm-imaged dust cavity, providing the first such overview of the inner regions of a young disk. The HST SED of RY Lupi is available online for use in modeling efforts
    corecore