18 research outputs found

    A review of estimation of distribution algorithms in bioinformatics

    Get PDF
    Evolutionary search algorithms have become an essential asset in the algorithmic toolbox for solving high-dimensional optimization problems in across a broad range of bioinformatics problems. Genetic algorithms, the most well-known and representative evolutionary search technique, have been the subject of the major part of such applications. Estimation of distribution algorithms (EDAs) offer a novel evolutionary paradigm that constitutes a natural and attractive alternative to genetic algorithms. They make use of a probabilistic model, learnt from the promising solutions, to guide the search process. In this paper, we set out a basic taxonomy of EDA techniques, underlining the nature and complexity of the probabilistic model of each EDA variant. We review a set of innovative works that make use of EDA techniques to solve challenging bioinformatics problems, emphasizing the EDA paradigm's potential for further research in this domain

    Validating module network learning algorithms using simulated data

    Get PDF
    In recent years, several authors have used probabilistic graphical models to learn expression modules and their regulatory programs from gene expression data. Here, we demonstrate the use of the synthetic data generator SynTReN for the purpose of testing and comparing module network learning algorithms. We introduce a software package for learning module networks, called LeMoNe, which incorporates a novel strategy for learning regulatory programs. Novelties include the use of a bottom-up Bayesian hierarchical clustering to construct the regulatory programs, and the use of a conditional entropy measure to assign regulators to the regulation program nodes. Using SynTReN data, we test the performance of LeMoNe in a completely controlled situation and assess the effect of the methodological changes we made with respect to an existing software package, namely Genomica. Additionally, we assess the effect of various parameters, such as the size of the data set and the amount of noise, on the inference performance. Overall, application of Genomica and LeMoNe to simulated data sets gave comparable results. However, LeMoNe offers some advantages, one of them being that the learning process is considerably faster for larger data sets. Additionally, we show that the location of the regulators in the LeMoNe regulation programs and their conditional entropy may be used to prioritize regulators for functional validation, and that the combination of the bottom-up clustering strategy with the conditional entropy-based assignment of regulators improves the handling of missing or hidden regulators.Comment: 13 pages, 6 figures + 2 pages, 2 figures supplementary informatio

    Stabilization of cytokine mRNAs in iNKT cells requires the serine-threonine kinase IRE1alpha

    Get PDF
    Activated invariant natural killer T (iNKT) cells rapidly produce large amounts of cytokines, but how cytokine mRNAs are induced, stabilized and mobilized following iNKT activation is still unclear. Here we show that an endoplasmic reticulum stress sensor, inositol-requiring enzyme 1α (IRE1α), links key cellular processes required for iNKT cell effector functions in specific iNKT subsets, in which TCR-dependent activation of IRE1α is associated with downstream activation of p38 MAPK and the stabilization of preformed cytokine mRNAs. Importantly, genetic deletion of IRE1α in iNKT cells reduces cytokine production and protects mice from oxazolone colitis. We therefore propose that an IRE1α-dependent signaling cascade couples constitutive cytokine mRNA expression to the rapid induction of cytokine secretion and effector functions in activated iNKT cells

    NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms

    Get PDF
    One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available

    High dimensional profiling identifies specific immune types along the recovery trajectories of critically ill COVID19 patients

    No full text
    The COVID-19 pandemic poses a major burden on healthcare and economic systems across the globe. Even though a majority of the population develops only minor symptoms upon SARS-CoV-2 infection, a significant number are hospitalized at intensive care units (ICU) requiring critical care. While insights into the early stages of the disease are rapidly expanding, the dynamic immunological processes occurring in critically ill patients throughout their recovery at ICU are far less understood. Here, we have analysed whole blood samples serially collected from 40 surviving COVID-19 patients throughout their recovery in ICU using high-dimensional cytometry by time-of-flight (CyTOF) and cytokine multiplexing. Based on the neutrophil-to-lymphocyte ratio (NLR), we defined four sequential immunotypes during recovery that correlated to various clinical parameters, including the level of respiratory support at concomitant sampling times. We identified classical monocytes as the first immune cell type to recover by restoration of HLA-DR-positivity and the reduction of immunosuppressive CD163 + monocytes, followed by the recovery of CD8 + and CD4 + T cell and non-classical monocyte populations. The identified immunotypes also correlated to aberrant cytokine and acute-phase reactant levels. Finally, integrative analysis of cytokines and immune cell profiles showed a shift from an initially dysregulated immune response to a more coordinated immunogenic interplay, highlighting the importance of longitudinal sampling to understand the pathophysiology underlying recovery from severe COVID-19

    Single-cell molecular profiling using ex vivo functional readouts fuels precision oncology in glioblastoma

    No full text
    Background Functional profiling of freshly isolated glioblastoma (GBM) cells is being evaluated as a next-generation method for precision oncology. While promising, its success largely depends on the method to evaluate treatment activity which requires sufficient resolution and specificity. Methods Here, we describe the 'precision oncology by single-cell profiling using ex vivoreadouts of functionality' (PROSPERO) assay to evaluate the intrinsic susceptibility of high-grade brain tumor cells to respond to therapy. Different from other assays, PROSPERO extends beyond life/death screening by rapidly evaluating acute molecular drug responses at single-cell resolution. Results The PROSPERO assay was developed by correlating short-term single-cell molecular signatures using mass cytometry by time-of-flight (CyTOF) to long-term cytotoxicity readouts in representative patient-derived glioblastoma cell cultures (n = 14) that were exposed to radiotherapy and the small-molecule p53/MDM2 inhibitor AMG232. The predictive model was subsequently projected to evaluate drug activity in freshly resected GBM samples from patients (n = 34). Here, PROSPERO revealed an overall limited capacity of tumor cells to respond to therapy, as reflected by the inability to induce key molecular markers upon ex vivo treatment exposure, while retaining proliferative capacity, insights that were validated in patient-derived xenograft (PDX) models. This approach also allowed the investigation of cellular plasticity, which in PDCLs highlighted therapy-induced proneural-to-mesenchymal (PMT) transitions, while in patients' samples this was more heterogeneous. Conclusion PROSPERO provides a precise way to evaluate therapy efficacy by measuring molecular drug responses using specific biomarker changes in freshly resected brain tumor samples, in addition to providing key functional insights in cellular behavior, which may ultimately complement standard, clinical biomarker evaluations

    Computational flow cytometry: helping to make sense of high-dimensional immunology data

    No full text
    Recent advances in flow cytometry allow scientists to measure an increasing number of parameters per cell, generating huge and high-dimensional datasets. To analyse, visualize and interpret these data, newly available computational techniques should be adopted, evaluated and improved upon by the immunological community. Computational flow cytometry is emerging as an important new field at the intersection of immunology and computational biology; it allows new biological knowledge to be extracted from high-throughput single-cell data. This Review provides non-experts with a broad and practical overview of the many recent developments in computational flow cytometry
    corecore