100,493 research outputs found
High-Dimensional Feature Selection by Feature-Wise Kernelized Lasso
The goal of supervised feature selection is to find a subset of input
features that are responsible for predicting output values. The least absolute
shrinkage and selection operator (Lasso) allows computationally efficient
feature selection based on linear dependency between input features and output
values. In this paper, we consider a feature-wise kernelized Lasso for
capturing non-linear input-output dependency. We first show that, with
particular choices of kernel functions, non-redundant features with strong
statistical dependence on output values can be found in terms of kernel-based
independence measures. We then show that the globally optimal solution can be
efficiently computed; this makes the approach scalable to high-dimensional
problems. The effectiveness of the proposed method is demonstrated through
feature selection experiments with thousands of features.Comment: 18 page
Deep Learning versus Classical Regression for Brain Tumor Patient Survival Prediction
Deep learning for regression tasks on medical imaging data has shown
promising results. However, compared to other approaches, their power is
strongly linked to the dataset size. In this study, we evaluate
3D-convolutional neural networks (CNNs) and classical regression methods with
hand-crafted features for survival time regression of patients with high grade
brain tumors. The tested CNNs for regression showed promising but unstable
results. The best performing deep learning approach reached an accuracy of
51.5% on held-out samples of the training set. All tested deep learning
experiments were outperformed by a Support Vector Classifier (SVC) using 30
radiomic features. The investigated features included intensity, shape,
location and deep features. The submitted method to the BraTS 2018 survival
prediction challenge is an ensemble of SVCs, which reached a cross-validated
accuracy of 72.2% on the BraTS 2018 training set, 57.1% on the validation set,
and 42.9% on the testing set. The results suggest that more training data is
necessary for a stable performance of a CNN model for direct regression from
magnetic resonance images, and that non-imaging clinical patient information is
crucial along with imaging information.Comment: Contribution to The International Multimodal Brain Tumor Segmentation
(BraTS) Challenge 2018, survival prediction tas
Detecting Generalized Synchronization Between Chaotic Signals: A Kernel-based Approach
A unified framework for analyzing generalized synchronization in coupled
chaotic systems from data is proposed. The key of the proposed approach is the
use of the kernel methods recently developed in the field of machine learning.
Several successful applications are presented, which show the capability of the
kernel-based approach for detecting generalized synchronization. It is also
shown that the dynamical change of the coupling coefficient between two chaotic
systems can be captured by the proposed approach.Comment: 20 pages, 15 figures. massively revised as a full paper; issues on
the choice of parameters by cross validation, tests by surrogated data, etc.
are added as well as additional examples and figure
Generalization in Deep Learning
This paper provides theoretical insights into why and how deep learning can
generalize well, despite its large capacity, complexity, possible algorithmic
instability, nonrobustness, and sharp minima, responding to an open question in
the literature. We also discuss approaches to provide non-vacuous
generalization guarantees for deep learning. Based on theoretical observations,
we propose new open problems and discuss the limitations of our results.Comment: To appear in Mathematics of Deep Learning, Cambridge University
Press. All previous results remain unchange
Automated Lensing Learner: Automated Strong Lensing Identification with a Computer Vision Technique
Forthcoming surveys such as the Large Synoptic Survey Telescope (LSST) and
Euclid necessitate automatic and efficient identification methods of strong
lensing systems. We present a strong lensing identification approach that
utilizes a feature extraction method from computer vision, the Histogram of
Oriented Gradients (HOG), to capture edge patterns of arcs. We train a
supervised classifier model on the HOG of mock strong galaxy-galaxy lens images
similar to observations from the Hubble Space Telescope (HST) and LSST. We
assess model performance with the area under the curve (AUC) of a Receiver
Operating Characteristic (ROC) curve. Models trained on 10,000 lens and
non-lens containing images images exhibit an AUC of 0.975 for an HST-like
sample, 0.625 for one exposure of LSST, and 0.809 for 10-year mock LSST
observations. Performance appears to continually improve with the training set
size. Models trained on fewer images perform better in absence of the lens
galaxy light. However, with larger training data sets, information from the
lens galaxy actually improves model performance, indicating that HOG captures
much of the morphological complexity of the arc finding problem. We test our
classifier on data from the Sloan Lens ACS Survey and find that small scale
image features reduces the efficiency of our trained model. However, these
preliminary tests indicate that some parameterizations of HOG can compensate
for differences between observed mock data. One example best-case
parameterization results in an AUC of 0.6 in the F814 filter image with other
parameterization results equivalent to random performance.Comment: 18 pages, 14 figures, summarizing results in figure
- …