90 research outputs found

    Exploration of large molecular datasets using global gene networks : computational methods and tools

    Get PDF
    Defining gene expression profiles and mapping complex interactions between molecular regulators and proteins is a key for understanding biological processes and the functional properties of cells, which is therefore, the focus on numerous experimental studies. Small-scale biochemical analyses deliver high-quality data, but lack coverage, whereas high throughput sequencing reveals thousands of interactions which can be error-prone and require proper computational methods to discover true relations. Furthermore, all these approaches usually focus on one type of interaction at a time. This makes experimental mapping of the genome-wide network a cost and time-intensive procedure. In the first part of the thesis, I present the developed network analysis tools for exploring large- scale datasets in the context of a global network of functional coupling. Paper I introduces NEArender, a method for performing pathway analysis and determines the relations between gene sets using a global network. Traditionally, pathway analysis did not consider network relations, thereby covering a minor part of the whole picture. Placing the gene sets in the context of a network provides additional information for pathway analysis, which reveals a more comprehensive picture. Paper II presents EviNet, a user-friendly web interface for using NEArender algorithm. The user can either input gene lists or manage and integrate highly complex experimental designs via the interactive Venn diagram-based interface. The web resource provides access to biological networks and pathways from multiple public or users’ own resources. The analysis typically takes seconds or minutes, and the results are presented in a graphic and tabular format. Paper III describes NEAmarker, a method to predict anti-cancer drug targets from enrichment scores calculated by NEArender, thus presenting a practical usage of network enrichment tool. The method can integrate data from multiple omics platforms to model drug sensitivity with enrichment variables. In parallel, alternative methods for pathway enrichment analysis were benchmarked in the paper. The second part of the thesis is focused on identifying spatial and temporal mechanisms that govern the formation of neural cell diversity in the developing brain. High-throughput platforms for RNA- and ChIP-sequencing were applied to provide data for studying the underlying biological hypothesis at the genome-wide scale. In Paper IV, I defined the role of the transcription factor Foxa2 during the specification and differentiation of floor plate cells of the ventral neural tube. By RNA-seq analyses of Foxa2-/- cells, a large set of candidate genes involved in floor plate differentiation were identified. Analysis of Foxa2 ChIP-seq dataset suggested that Foxa2 directly regulated more than 250 genes expressed by the floor plate and identified Rfx4 and Ascl1 as co-regulators of many floor plate genes. Experimental studies suggested a cooperative activator function for Foxa2 and Rfx4 and a suppressive role for Ascl1 in spatially constraining floor plate induction. Paper V addresses how time is measured during sequential specification of neurons from multipotent progenitor cells during the development of ventral hindbrain. An underlying timer circuitry which leads to the sequential generation of motor neurons and serotonergic neurons has been identified by integrating experimental and computational data modeling

    Whole-genome informed circulating tumor DNA analysis by multiplex digital PCR for disease monitoring in B-cell lymphomas: a proof-of-concept study

    Get PDF
    IntroductionAnalyzing liquid biopsies for tumor-specific aberrations can facilitate detection of measurable residual disease (MRD) during treatment and at follow-up. In this study, we assessed the clinical potential of using whole-genome sequencing (WGS) of lymphomas at diagnosis to identify patient-specific structural (SVs) and single nucleotide variants (SNVs) to enable longitudinal, multi-targeted droplet digital PCR analysis (ddPCR) of cell-free DNA (cfDNA).MethodsIn 9 patients with B-cell lymphoma (diffuse large B-cell lymphoma and follicular lymphoma), comprehensive genomic profiling at diagnosis was performed by 30X WGS of paired tumor and normal specimens. Patient-specific multiplex ddPCR (m-ddPCR) assays were designed for simultaneous detection of multiple SNVs, indels and/or SVs, with a detection sensitivity of 0.0025% for SV assays and 0.02% for SNVs/indel assays. M-ddPCR was applied to analyze cfDNA isolated from serially collected plasma at clinically critical timepoints during primary and/or relapse treatment and at follow-up.ResultsA total of 164 SNVs/indels were identified by WGS including 30 variants known to be functionally relevant in lymphoma pathogenesis. The most frequently mutated genes included KMT2D, PIM1, SOCS1 and BCL2. WGS analysis further identified recurrent SVs including t(14;18)(q32;q21) (IGH::BCL2), and t(6;14)(p25;q32) (IGH::IRF4). Plasma analysis at diagnosis showed positive circulating tumor DNA (ctDNA) levels in 88% of patients and the ctDNA burden correlated with baseline clinical parameters (LDH and sedimentation rate, p-value <0.01). While clearance of ctDNA levels after primary treatment cycle 1 was observed in 3/6 patients, all patients analyzed at final evaluation of primary treatment showed negative ctDNA, hence correlating with PET-CT imaging. One patient with positive ctDNA at interim also displayed detectable ctDNA (average variant allele frequency (VAF) 6.9%) in the follow-up plasma sample collected 2 years after final evaluation of primary treatment and 25 weeks before clinical manifestation of relapse.ConclusionIn summary, we demonstrate that multi-targeted cfDNA analysis, using a combination of SNVs/indels and SVs candidates identified by WGS analysis, provides a sensitive tool for MRD monitoring and can detect lymphoma relapse earlier than clinical manifestation

    Additional file 1: Figure S1. of NEArender: an R package for functional interpretation of ‘omics’ data via network enrichment analysis

    No full text
    Sensitivity and sources of bias in randomization-based versus binomial calculation of network enrichment. Figure S2. Rank correlation coefficients between results of differential expression analysis on different sample pairs and groups (Additional file 2). Figure S3. Agreement between biological replicates in alternative approaches to differential expression analysis. (DOCX 1.3 mb

    Prediction of response to anti-cancer drugs becomes robust via network integration of molecular data

    No full text
    Abstract Despite the widening range of high-throughput platforms and exponential growth of generated data volume, the validation of biomarkers discovered from large-scale data remains a challenging field. In order to tackle cancer heterogeneity and comply with the data dimensionality, a number of network and pathway approaches were invented but rarely systematically applied to this task. We propose a new method, called NEAmarker, for finding sensitive and robust biomarkers at the pathway level. scores from network enrichment analysis transform the original space of altered genes into a lower-dimensional space of pathways. These dimensions are then correlated with phenotype variables. The method was first tested using in vitro data from three anti-cancer drug screens and then on clinical data of The Cancer Genome Atlas. It proved superior to the single-gene and alternative enrichment analyses in terms of (1) universal applicability to different data types with a possibility of cross-platform integration, (2) consistency of the discovered correlates between independent drug screens, and (3) ability to explain differential survival of treated patients. Our new screen of anti-cancer compounds validated the performance of multivariate models of drug sensitivity. The previously proposed methods of enrichment analysis could achieve comparable levels of performance in certain tests. However, only our method could discover predictors of both in vitro response and patient survival given administration of the same drug

    A Shh/Gli-driven three-node timer motif controls temporal identity and fate of neural stem cells

    No full text
    How time is measured by neural stem cells during temporal neurogenesis has remained unresolved. By combining experiments and computational modeling, we define a Shh/Gli-driven three-node timer underlying the sequential generation of motor neurons (MNs) and serotonergic neurons in the brainstem. The timer is founded on temporal decline of Gli-activator and Gli-repressor activities established through down-regulation of Gli transcription. The circuitry conforms an incoherent feed-forward loop, whereby Gli proteins not only promote expression of Phox2b and thereby MN-fate but also account for a delayed activation of a self-promoting transforming growth factor–β (Tgfβ) node triggering a fate switch by repressing Phox2b. Hysteresis and spatial averaging by diffusion of Tgfβ counteract noise and increase temporal accuracy at the population level, providing a functional rationale for the intrinsically programmed activation of extrinsic switch signals in temporal patterning. Our study defines how time is reliably encoded during the sequential specification of neurons.ISSN:2375-254

    A Shh/Gli-driven three-node timer motif controls temporal identity and fate of neural stem cells

    No full text
    How time is measured by neural stem cells during temporal neurogenesis has remained unresolved. By combining experiments and computational modelling, we here define a Shh/Gli-driven three-node timer underlying the sequential generation of motor neurons (MNs) and serotonergic neurons in the brainstem. The timer is founded on temporal decline of Gli-activator and Gli-repressor activities established through downregulation of Gli transcription. The circuitry conforms an incoherent feedforward loop, whereby Gli proteins promote expression of Phox2b and thereby MN-fate, but also account for a delayed activation of a self-promoting Tgfβ-node triggering a fate switch by repressing Phox2b. Hysteresis and spatial averaging by diffusion of Tgfβ counteracts noise and increases temporal accuracy at the population level. Our study defines how time is reliably encoded during the sequential specification of neurons

    Sensitive Detection of Cell-Free Tumour DNA Using Optimised Targeted Sequencing Can Predict Prognosis in Gastro-Oesophageal Cancer

    No full text
    In this longitudinal study, cell-free tumour DNA (a liquid biopsy) from plasma was explored as a prognostic biomarker for gastro-oesophageal cancer. Both tumour-informed and tumour-agnostic approaches for plasma variant filtering were evaluated in 47 participants. This was possible through sequencing of DNA from tissue biopsies from all participants and cell-free DNA from plasma sampled before and after surgery (n = 42), as well as DNA from white blood cells (n = 21) using a custom gene panel with and without unique molecular identifiers (UMIs). A subset of the plasma samples (n = 12) was also assayed with targeted droplet digital PCR (ddPCR). In 17/31 (55%) diagnostic plasma samples, tissue-verified cancer-associated variants could be detected by the gene panel. In the tumour-agnostic approach, 26 participants (59%) had cancer-associated variants, and UMIs were necessary to filter the true variants from the technical artefacts. Additionally, clonal haematopoietic variants could be excluded using the matched white blood cells or follow-up plasma samples. ddPCR detected its targets in 10/12 (83%) and provided an ultra-sensitive method for follow-up. Detectable cancer-associated variants in plasma correlated to a shorter overall survival and shorter time to progression, with a significant correlation for the tumour-informed approaches. In summary, liquid biopsy gene panel sequencing using a tumour-agnostic approach can be applied to all patients regardless of the presence of a tissue biopsy, although this requires UMIs and the exclusion of clonal haematopoietic variants. However, if sequencing data from tumour biopsies are available, a tumour-informed approach improves the value of cell-free tumour DNA as a negative prognostic biomarker in gastro-oesophageal cancer patients
    corecore