820 research outputs found
Improving Viewpoint Robustness for Visual Recognition via Adversarial Training
Viewpoint invariance remains challenging for visual recognition in the 3D
world, as altering the viewing directions can significantly impact predictions
for the same object. While substantial efforts have been dedicated to making
neural networks invariant to 2D image translations and rotations, viewpoint
invariance is rarely investigated. Motivated by the success of adversarial
training in enhancing model robustness, we propose Viewpoint-Invariant
Adversarial Training (VIAT) to improve the viewpoint robustness of image
classifiers. Regarding viewpoint transformation as an attack, we formulate VIAT
as a minimax optimization problem, where the inner maximization characterizes
diverse adversarial viewpoints by learning a Gaussian mixture distribution
based on the proposed attack method GMVFool. The outer minimization obtains a
viewpoint-invariant classifier by minimizing the expected loss over the
worst-case viewpoint distributions that can share the same one for different
objects within the same category. Based on GMVFool, we contribute a large-scale
dataset called ImageNet-V+ to benchmark viewpoint robustness. Experimental
results show that VIAT significantly improves the viewpoint robustness of
various image classifiers based on the diversity of adversarial viewpoints
generated by GMVFool. Furthermore, we propose ViewRS, a certified viewpoint
robustness method that provides a certified radius and accuracy to demonstrate
the effectiveness of VIAT from the theoretical perspective.Comment: 14 pages, 12 figures. arXiv admin note: substantial text overlap with
arXiv:2307.1023
Breast MRI radiomics and machine learning radiomics-based predictions of response to neoadjuvant chemotherapy -- how are they affected by variations in tumour delineation?
Manual delineation of volumes of interest (VOIs) by experts is considered the
gold-standard method in radiomics analysis. However, it suffers from inter- and
intra-operator variability. A quantitative assessment of the impact of
variations in these delineations on the performance of the radiomics predictors
is required to develop robust radiomics based prediction models. In this study,
we developed radiomics models for the prediction of pathological complete
response to neoadjuvant chemotherapy in patients with two different breast
cancer subtypes based on contrast-enhanced magnetic resonance imaging acquired
prior to treatment (baseline MRI scans). Different mathematical operations such
as erosion, smoothing, dilation, randomization, and ellipse fitting were
applied to the original VOIs delineated by experts to simulate variations of
segmentation masks. The effects of such VOI modifications on various steps of
the radiomics workflow, including feature extraction, feature selection, and
prediction performance, were evaluated. Using manual tumor VOIs and radiomics
features extracted from baseline MRI scans, an AUC of up to 0.96 and 0.89 was
achieved for human epidermal growth receptor 2 positive and triple-negative
breast cancer, respectively. For smoothing and erosion, VOIs yielded the
highest number of robust features and the best prediction performance, while
ellipse fitting and dilation lead to the lowest robustness and prediction
performance for both breast cancer subtypes. At most 28% of the selected
features were similar to manual VOIs when different VOI delineation data were
used. Differences in VOI delineation affects different steps of radiomics
analysis, and their quantification is therefore important for development of
standardized radiomics research
Advances on Time Series Analysis using Elastic Measures of Similarity
A sequence is a collection of data instances arranged in a structured manner. When this arrangement is held in the time domain, sequences are instead referred to as time series. As such, each observation in a time series represents an observation drawn from an underlying process, produced at a specific time instant. However, other type of data indexing structures, such as space- or threshold-based arrangements are possible. Data points that compose a time series are often correlated with each other. To account for this correlation in data mining tasks, time series are usually studied as a whole data object rather than as a collection of independent observations. In this context, techniques for time series analysis aim at analyzing this type of data structures by applying specific approaches developed to leverage intrinsic properties of the time series for a wide range of problems, such as classification, clustering and other tasks alike.
The development of monitoring and storage devices has made time se- ries analysis proliferate in numerous application fields, including medicine, economics, manufacturing and telecommunications, among others. Over the years, the community has gathered efforts towards the development of new data-based techniques for time series analysis suited to address the problems and needs of such application fields. In the related literature, such techniques can be divided in three main groups: feature-, model- and distance-based methods. The first group (feature-based) transforms time series into a collection of features, which are then used by conventional learning algorithms to provide solutions to the task under consideration. In contrast, methods belonging to the second group (model-based) assume that each time series is drawn from a generative model, which is then har- nessed to elicit knowledge from data. Finally, distance-based techniques operate directly on raw time series. To this end, these methods resort to specially defined measures of distance or similarity for comparing time series, without requiring any further processing. Among them, elastic sim- ilarity measures (e.g., dynamic time warping and edit distance) compute the closeness between two sequences by finding the best alignment between them, disregarding differences in time, and thus focusing exclusively on shape differences.
This Thesis presents several contributions to the field of distance-based techniques for time series analysis, namely: i) a novel multi-dimensional elastic similarity learning method for time series classification; ii) an adap- tation of elastic measures to streaming time series scenarios; and iii) the use of distance-based time series analysis to make machine learning meth- ods for image classification robust against adversarial attacks. Throughout the Thesis, each contribution is framed within its related state of the art, explained in detail and empirically evaluated. The obtained results lead to new insights on the application of distance-based time series methods for the considered scenarios, and motivates research directions that highlight the vibrant momentum of this research area
- …