20,857 research outputs found
Machine Learning and the Future of Realism
The preceding three decades have seen the emergence, rise, and proliferation
of machine learning (ML). From half-recognised beginnings in perceptrons,
neural nets, and decision trees, algorithms that extract correlations (that is,
patterns) from a set of data points have broken free from their origin in
computational cognition to embrace all forms of problem solving, from voice
recognition to medical diagnosis to automated scientific research and
driverless cars, and it is now widely opined that the real industrial
revolution lies less in mobile phone and similar than in the maturation and
universal application of ML. Among the consequences just might be the triumph
of anti-realism over realism
The Applicability of Nonlinear Systems Dynamics Chaos Measures to Cardiovascular Physiology Variables
Three measures of nonlinear chaos (fractal dimension, Approximate Entropy (ApEn), and Lyapunov exponents) were studied as potential measures of cardiovascular condition. It is suggested that these measures have potential in the assessment of cardiovascular condition in environments of normal cardiovascular stress (normal gravity on the Earth surface), cardiovascular deconditioning (microgravity of space), and increased cardiovascular stress (lower body negative pressure (LBNP) treatments)
Formal Hypothesis Tests for Additive Structure in Random Forests
While statistical learning methods have proved powerful tools for predictive
modeling, the black-box nature of the models they produce can severely limit
their interpretability and the ability to conduct formal inference. However,
the natural structure of ensemble learners like bagged trees and random forests
has been shown to admit desirable asymptotic properties when base learners are
built with proper subsamples. In this work, we demonstrate that by defining an
appropriate grid structure on the covariate space, we may carry out formal
hypothesis tests for both variable importance and underlying additive model
structure. To our knowledge, these tests represent the first statistical tools
for investigating the underlying regression structure in a context such as
random forests. We develop notions of total and partial additivity and further
demonstrate that testing can be carried out at no additional computational cost
by estimating the variance within the process of constructing the ensemble.
Furthermore, we propose a novel extension of these testing procedures utilizing
random projections in order to allow for computationally efficient testing
procedures that retain high power even when the grid size is much larger than
that of the training set
- …
