318 research outputs found

    Regression with Linear Factored Functions

    Full text link
    Many applications that use empirically estimated functions face a curse of dimensionality, because the integrals over most function classes must be approximated by sampling. This paper introduces a novel regression-algorithm that learns linear factored functions (LFF). This class of functions has structural properties that allow to analytically solve certain integrals and to calculate point-wise products. Applications like belief propagation and reinforcement learning can exploit these properties to break the curse and speed up computation. We derive a regularized greedy optimization scheme, that learns factored basis functions during training. The novel regression algorithm performs competitively to Gaussian processes on benchmark tasks, and the learned LFF functions are with 4-9 factored basis functions on average very compact.Comment: Under review as conference paper at ECML/PKDD 201

    Predicting Fluid Intelligence of Children using T1-weighted MR Images and a StackNet

    Full text link
    In this work, we utilize T1-weighted MR images and StackNet to predict fluid intelligence in adolescents. Our framework includes feature extraction, feature normalization, feature denoising, feature selection, training a StackNet, and predicting fluid intelligence. The extracted feature is the distribution of different brain tissues in different brain parcellation regions. The proposed StackNet consists of three layers and 11 models. Each layer uses the predictions from all previous layers including the input layer. The proposed StackNet is tested on a public benchmark Adolescent Brain Cognitive Development Neurocognitive Prediction Challenge 2019 and achieves a mean squared error of 82.42 on the combined training and validation set with 10-fold cross-validation. In addition, the proposed StackNet also achieves a mean squared error of 94.25 on the testing data. The source code is available on GitHub.Comment: 8 pages, 2 figures, 3 tables, Accepted by MICCAI ABCD-NP Challenge 2019; Added ND

    ABCD Neurocognitive Prediction Challenge 2019: Predicting individual fluid intelligence scores from structural MRI using probabilistic segmentation and kernel ridge regression

    Get PDF
    We applied several regression and deep learning methods to predict fluid intelligence scores from T1-weighted MRI scans as part of the ABCD Neurocognitive Prediction Challenge (ABCD-NP-Challenge) 2019. We used voxel intensities and probabilistic tissue-type labels derived from these as features to train the models. The best predictive performance (lowest mean-squared error) came from Kernel Ridge Regression (KRR; λ=10\lambda=10), which produced a mean-squared error of 69.7204 on the validation set and 92.1298 on the test set. This placed our group in the fifth position on the validation leader board and first place on the final (test) leader board.Comment: Winning entry in the ABCD Neurocognitive Prediction Challenge at MICCAI 2019. 7 pages plus references, 3 figures, 1 tabl

    Reinforcement learning or active inference?

    Get PDF
    This paper questions the need for reinforcement learning or control theory when optimising behaviour. We show that it is fairly simple to teach an agent complicated and adaptive behaviours using a free-energy formulation of perception. In this formulation, agents adjust their internal states and sampling of the environment to minimize their free-energy. Such agents learn causal structure in the environment and sample it in an adaptive and self-supervised fashion. This results in behavioural policies that reproduce those optimised by reinforcement learning and dynamic programming. Critically, we do not need to invoke the notion of reward, value or utility. We illustrate these points by solving a benchmark problem in dynamic programming; namely the mountain-car problem, using active perception or inference under the free-energy principle. The ensuing proof-of-concept may be important because the free-energy formulation furnishes a unified account of both action and perception and may speak to a reappraisal of the role of dopamine in the brain

    Effect of fulvic acids on lead-induced oxidative stress to metal sensitive Vicia faba L. plant

    Get PDF
    Lead (Pb) is a ubiquitous environmental pollutant capable to induce various morphological, physiological, and biochemical functions in plants. Only few publications focus on the influence of Pb speciation both on its phytoavailability and phytotoxicity. Therefore, Pb toxicity (in terms of lipid peroxidation, hydrogen peroxide induction, and photosynthetic pigments contents) was studied in Vicia faba plants in relation with Pb uptake and speciation. V. faba seedlings were exposed to Pb supplied as Pb(NO3)2 or complexed by two fulvic acids (FAs), i.e. Suwannee River fulvic acid (SRFA) and Elliott Soil fulvic acid (ESFA), for 1, 12, and 24 h under controlled hydroponic conditions. For both FAs, Pb uptake and translocation by Vicia faba increased at low level (5 mg l−1), whereas decreased at high level of application (25 mg l−1). Despite the increased Pb uptake with FAs at low concentrations, there was no influence on the Pb toxicity to the plants. However, at high concentrations, FAs reduced Pb toxicity by reducing its uptake. These results highlighted the role of the dilution factor for FAs reactivity in relation with structure; SRFA was more effective than ESFA in reducing Pb uptake and alleviating Pb toxicity to V. faba due to comparatively strong binding affinity for the heavy metal

    Evaluation of O2PLS in Omics data integration

    Get PDF
    Background: Rapid computational and technological developments made large amounts of omics data available in different biological levels. It is becoming clear that simultaneous data analysis methods are needed for better interpretation and understanding of the underlying systems biology. Different methods have been proposed for this task, among them Partial Least Squares (PLS) related methods. To also deal with orthogonal variation, systematic variation in the data unrelated to one another, we consider the Two-way Orthogonal PLS (O2PLS): an integrative data analysis method which is capable of modeling systematic variation, while providing more parsimonious models aiding interpretation. Results: A simulation study to assess the performance of O2PLS showed positive results in both low and higher dimensions. More noise (50 % of the data) only affected the systematic part estimates. A data analysis was conducted using data on metabolomics and transcriptomics from a large Finnish cohort (DILGOM). A previous sequential study, using the same data, showed significant correlations between the Lipo-Leukocyte (LL) module and lipoprotein metabolites. The O2PLS results were in agreement with these findings, identifying almost the same set of co-varying variables. Moreover, our integrative approach identified other associative genes and metabolites, while taking into account systematic variation in the data. Including orthogonal components enhanced overall fit, but the orthogonal variation was difficult to interpret. Conclusions: Simulations showed that the O2PLS estimates were close to the true parameters in both low and higher dimensions. In the presence of more noise (50 %), the orthogonal part estimates could not distinguish well between joint and unique variation. The joint estimates were not systematically affected. Simultaneous analysis with O2PLS on metabolome and transcriptome data showed that the LL module, together with VLDL and HDL metabolites, were important for the metabolomic and transcriptomic relation. This is in agreement with an earlier study. In addition more gene expression and metabolites are identified being important for the joint covariation

    Predictive gene lists for breast cancer prognosis: A topographic visualisation study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The controversy surrounding the non-uniqueness of predictive gene lists (PGL) of small selected subsets of genes from very large potential candidates as available in DNA microarray experiments is now widely acknowledged <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Many of these studies have focused on constructing discriminative semi-parametric models and as such are also subject to the issue of random correlations of sparse model selection in high dimensional spaces. In this work we outline a different approach based around an unsupervised patient-specific nonlinear topographic projection in predictive gene lists.</p> <p>Methods</p> <p>We construct nonlinear topographic projection maps based on inter-patient gene-list relative dissimilarities. The Neuroscale, the Stochastic Neighbor Embedding(SNE) and the Locally Linear Embedding(LLE) techniques have been used to construct two-dimensional projective visualisation plots of 70 dimensional PGLs per patient, classifiers are also constructed to identify the prognosis indicator of each patient using the resulting projections from those visualisation techniques and investigate whether <it>a-posteriori </it>two prognosis groups are separable on the evidence of the gene lists.</p> <p>A literature-proposed predictive gene list for breast cancer is benchmarked against a separate gene list using the above methods. Generalisation ability is investigated by using the mapping capability of Neuroscale to visualise the follow-up study, but based on the projections derived from the original dataset.</p> <p>Results</p> <p>The results indicate that small subsets of patient-specific PGLs have insufficient prognostic dissimilarity to permit a distinction between two prognosis patients. Uncertainty and diversity across multiple gene expressions prevents unambiguous or even confident patient grouping. Comparative projections across different PGLs provide similar results.</p> <p>Conclusion</p> <p>The random correlation effect to an arbitrary outcome induced by small subset selection from very high dimensional interrelated gene expression profiles leads to an outcome with associated uncertainty. This continuum and uncertainty precludes any attempts at constructing discriminative classifiers.</p> <p>However a patient's gene expression profile could possibly be used in treatment planning, based on knowledge of other patients' responses.</p> <p>We conclude that many of the patients involved in such medical studies are <it>intrinsically unclassifiable </it>on the basis of provided PGL evidence. This additional category of 'unclassifiable' should be accommodated within medical decision support systems if serious errors and unnecessary adjuvant therapy are to be avoided.</p

    Γ-stochastic neighbour embedding for feed-forward data visualization

    Get PDF
    t-distributed Stochastic Neighbour Embedding (t-SNE) is one of the most popular nonlinear dimension reduction techniques used in multiple application domains. In this paper we propose a variation on the embedding neighbourhood distribution, resulting in Γ-SNE, which can construct a feed-forward mapping using an RBF network. We compare the visualizations generated by Γ-SNE with those of t-SNE and provide empirical evidence suggesting the network is capable of robust interpolation and automatic weight regularization

    Fast empirical Bayesian LASSO for multiple quantitative trait locus mapping

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The Bayesian shrinkage technique has been applied to multiple quantitative trait loci (QTLs) mapping to estimate the genetic effects of QTLs on quantitative traits from a very large set of possible effects including the main and epistatic effects of QTLs. Although the recently developed empirical Bayes (EB) method significantly reduced computation comparing with the fully Bayesian approach, its speed and accuracy are limited by the fact that numerical optimization is required to estimate the variance components in the QTL model.</p> <p>Results</p> <p>We developed a fast empirical Bayesian LASSO (EBLASSO) method for multiple QTL mapping. The fact that the EBLASSO can estimate the variance components in a closed form along with other algorithmic techniques render the EBLASSO method more efficient and accurate. Comparing with the EB method, our simulation study demonstrated that the EBLASSO method could substantially improve the computational speed and detect more QTL effects without increasing the false positive rate. Particularly, the EBLASSO algorithm running on a personal computer could easily handle a linear QTL model with more than 100,000 variables in our simulation study. Real data analysis also demonstrated that the EBLASSO method detected more reasonable effects than the EB method. Comparing with the LASSO, our simulation showed that the current version of the EBLASSO implemented in Matlab had similar speed as the LASSO implemented in Fortran, and that the EBLASSO detected the same number of true effects as the LASSO but a much smaller number of false positive effects.</p> <p>Conclusions</p> <p>The EBLASSO method can handle a large number of effects possibly including both the main and epistatic QTL effects, environmental effects and the effects of gene-environment interactions. It will be a very useful tool for multiple QTL mapping.</p

    Bezlotoxumab for Prevention of Recurrent Clostridium difficile Infection

    No full text
    BACKGROUND Clostridium difficile is the most common cause of infectious diarrhea in hospitalized patients. Recurrences are common after antibiotic therapy. Actoxumab and bezlotoxumab are human monoclonal antibodies against C. difficile toxins A and B, respectively. METHODS We conducted two double-blind, randomized, placebo-controlled, phase 3 trials, MODIFY I and MODIFY II, involving 2655 adults receiving oral standard-of-care antibiotics for primary or recurrent C. difficile infection. Participants received an infusion of bezlotoxumab (10 mg per kilogram of body weight), actoxumab plus bezlotoxumab (10 mg per kilogram each), or placebo; actoxumab alone (10 mg per kilogram) was given in MODIFY I but discontinued after a planned interim analysis. The primary end point was recurrent infection (new episode after initial clinical cure) within 12 weeks after infusion in the modified intention-to-treat population. RESULTS In both trials, the rate of recurrent C. difficile infection was significantly lower with bezlotoxumab alone than with placebo (MODIFY I: 17% [67 of 386] vs. 28% [109 of 395]; adjusted difference, −10.1 percentage points; 95% confidence interval [CI], −15.9 to −4.3; P<0.001; MODIFY II: 16% [62 of 395] vs. 26% [97 of 378]; adjusted difference, −9.9 percentage points; 95% CI, −15.5 to −4.3; P<0.001) and was significantly lower with actoxumab plus bezlotoxumab than with placebo (MODIFY I: 16% [61 of 383] vs. 28% [109 of 395]; adjusted difference, −11.6 percentage points; 95% CI, −17.4 to −5.9; P<0.001; MODIFY II: 15% [58 of 390] vs. 26% [97 of 378]; adjusted difference, −10.7 percentage points; 95% CI, −16.4 to −5.1; P<0.001). In prespecified subgroup analyses (combined data set), rates of recurrent infection were lower in both groups that received bezlotoxumab than in the placebo group in subpopulations at high risk for recurrent infection or for an adverse outcome. The rates of initial clinical cure were 80% with bezlotoxumab alone, 73% with actoxumab plus bezlotoxumab, and 80% with placebo; the rates of sustained cure (initial clinical cure without recurrent infection in 12 weeks) were 64%, 58%, and 54%, respectively. The rates of adverse events were similar among these groups; the most common events were diarrhea and nausea. CONCLUSIONS Among participants receiving antibiotic treatment for primary or recurrent C. difficile infection, bezlotoxumab was associated with a substantially lower rate of recurrent infection than placebo and had a safety profile similar to that of placebo. The addition of actoxumab did not improve efficacy. (Funded by Merck; MODIFY I and MODIFY II ClinicalTrials.gov numbers, NCT01241552 and NCT01513239.
    corecore