18 research outputs found
A review of estimation of distribution algorithms in bioinformatics
Evolutionary search algorithms have become an essential asset in the algorithmic toolbox for solving high-dimensional optimization problems in across a broad range of bioinformatics problems. Genetic algorithms, the most well-known and representative evolutionary search technique, have been the subject of the major part of such applications. Estimation of distribution algorithms (EDAs) offer a novel evolutionary paradigm that constitutes a natural and attractive alternative to genetic algorithms. They make use of a probabilistic model, learnt from the promising solutions, to guide the search process. In this paper, we set out a basic taxonomy of EDA techniques, underlining the nature and complexity of the probabilistic model of each EDA variant. We review a set of innovative works that make use of EDA techniques to solve challenging bioinformatics problems, emphasizing the EDA paradigm's potential for further research in this domain
Validating module network learning algorithms using simulated data
In recent years, several authors have used probabilistic graphical models to
learn expression modules and their regulatory programs from gene expression
data. Here, we demonstrate the use of the synthetic data generator SynTReN for
the purpose of testing and comparing module network learning algorithms. We
introduce a software package for learning module networks, called LeMoNe, which
incorporates a novel strategy for learning regulatory programs. Novelties
include the use of a bottom-up Bayesian hierarchical clustering to construct
the regulatory programs, and the use of a conditional entropy measure to assign
regulators to the regulation program nodes. Using SynTReN data, we test the
performance of LeMoNe in a completely controlled situation and assess the
effect of the methodological changes we made with respect to an existing
software package, namely Genomica. Additionally, we assess the effect of
various parameters, such as the size of the data set and the amount of noise,
on the inference performance. Overall, application of Genomica and LeMoNe to
simulated data sets gave comparable results. However, LeMoNe offers some
advantages, one of them being that the learning process is considerably faster
for larger data sets. Additionally, we show that the location of the regulators
in the LeMoNe regulation programs and their conditional entropy may be used to
prioritize regulators for functional validation, and that the combination of
the bottom-up clustering strategy with the conditional entropy-based assignment
of regulators improves the handling of missing or hidden regulators.Comment: 13 pages, 6 figures + 2 pages, 2 figures supplementary informatio
Stabilization of cytokine mRNAs in iNKT cells requires the serine-threonine kinase IRE1alpha
Activated invariant natural killer T (iNKT) cells rapidly produce large amounts of cytokines, but how cytokine mRNAs are induced, stabilized and mobilized following iNKT activation is still unclear. Here we show that an endoplasmic reticulum stress sensor, inositol-requiring enzyme 1α (IRE1α), links key cellular processes required for iNKT cell effector functions in specific iNKT subsets, in which TCR-dependent activation of IRE1α is associated with downstream activation of p38 MAPK and the stabilization of preformed cytokine mRNAs. Importantly, genetic deletion of IRE1α in iNKT cells reduces cytokine production and protects mice from oxazolone colitis. We therefore propose that an IRE1α-dependent signaling cascade couples constitutive cytokine mRNA expression to the rapid induction of cytokine secretion and effector functions in activated iNKT cells
NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms
One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available
High dimensional profiling identifies specific immune types along the recovery trajectories of critically ill COVID19 patients
The COVID-19 pandemic poses a major burden on healthcare and economic systems across the globe. Even though a majority of the population develops only minor symptoms upon SARS-CoV-2 infection, a significant number are hospitalized at intensive care units (ICU) requiring critical care. While insights into the early stages of the disease are rapidly expanding, the dynamic immunological processes occurring in critically ill patients throughout their recovery at ICU are far less understood. Here, we have analysed whole blood samples serially collected from 40 surviving COVID-19 patients throughout their recovery in ICU using high-dimensional cytometry by time-of-flight (CyTOF) and cytokine multiplexing. Based on the neutrophil-to-lymphocyte ratio (NLR), we defined four sequential immunotypes during recovery that correlated to various clinical parameters, including the level of respiratory support at concomitant sampling times. We identified classical monocytes as the first immune cell type to recover by restoration of HLA-DR-positivity and the reduction of immunosuppressive CD163 + monocytes, followed by the recovery of CD8 + and CD4 + T cell and non-classical monocyte populations. The identified immunotypes also correlated to aberrant cytokine and acute-phase reactant levels. Finally, integrative analysis of cytokines and immune cell profiles showed a shift from an initially dysregulated immune response to a more coordinated immunogenic interplay, highlighting the importance of longitudinal sampling to understand the pathophysiology underlying recovery from severe COVID-19
Single-cell molecular profiling using ex vivo functional readouts fuels precision oncology in glioblastoma
Background
Functional profiling of freshly isolated glioblastoma (GBM) cells is being evaluated as a next-generation method for precision oncology. While promising, its success largely depends on the method to evaluate treatment activity which requires sufficient resolution and specificity.
Methods
Here, we describe the 'precision oncology by single-cell profiling using ex vivoreadouts of functionality' (PROSPERO) assay to evaluate the intrinsic susceptibility of high-grade brain tumor cells to respond to therapy. Different from other assays, PROSPERO extends beyond life/death screening by rapidly evaluating acute molecular drug responses at single-cell resolution.
Results
The PROSPERO assay was developed by correlating short-term single-cell molecular signatures using mass cytometry by time-of-flight (CyTOF) to long-term cytotoxicity readouts in representative patient-derived glioblastoma cell cultures (n = 14) that were exposed to radiotherapy and the small-molecule p53/MDM2 inhibitor AMG232. The predictive model was subsequently projected to evaluate drug activity in freshly resected GBM samples from patients (n = 34). Here, PROSPERO revealed an overall limited capacity of tumor cells to respond to therapy, as reflected by the inability to induce key molecular markers upon ex vivo treatment exposure, while retaining proliferative capacity, insights that were validated in patient-derived xenograft (PDX) models. This approach also allowed the investigation of cellular plasticity, which in PDCLs highlighted therapy-induced proneural-to-mesenchymal (PMT) transitions, while in patients' samples this was more heterogeneous.
Conclusion
PROSPERO provides a precise way to evaluate therapy efficacy by measuring molecular drug responses using specific biomarker changes in freshly resected brain tumor samples, in addition to providing key functional insights in cellular behavior, which may ultimately complement standard, clinical biomarker evaluations
Author Correction: A human immune dysregulation syndrome characterized by severe hyperinflammation with a homozygous nonsense Roquin-1 mutation (Nature Communications, (2019), 10, 1, (4779), 10.1038/s41467-019-12704-6)
The original version of the Supplementary Information associated with this Article included an incorrect Supplementary Information file, in which only the first page of the file was included. The HTML has been updated to include a corrected and complete version of the Supplementary Information file
Computational flow cytometry: helping to make sense of high-dimensional immunology data
Recent advances in flow cytometry allow scientists to measure an increasing number of parameters per cell, generating huge and high-dimensional datasets. To analyse, visualize and interpret these data, newly available computational techniques should be adopted, evaluated and improved upon by the immunological community. Computational flow cytometry is emerging as an important new field at the intersection of immunology and computational biology; it allows new biological knowledge to be extracted from high-throughput single-cell data. This Review provides non-experts with a broad and practical overview of the many recent developments in computational flow cytometry