280 research outputs found
Action Class Relation Detection and Classification Across Multiple Video Datasets
The Meta Video Dataset (MetaVD) provides annotated relations between action
classes in major datasets for human action recognition in videos. Although
these annotated relations enable dataset augmentation, it is only applicable to
those covered by MetaVD. For an external dataset to enjoy the same benefit, the
relations between its action classes and those in MetaVD need to be determined.
To address this issue, we consider two new machine learning tasks: action class
relation detection and classification. We propose a unified model to predict
relations between action classes, using language and visual information
associated with classes. Experimental results show that (i) pre-trained recent
neural network models for texts and videos contribute to high predictive
performance, (ii) the relation prediction based on action label texts is more
accurate than based on videos, and (iii) a blending approach that combines
predictions by both modalities can further improve the predictive performance
in some cases.Comment: Accepted to Pattern Recognition Letters. 12 pages, 4 figure
Learning Decorrelated Representations Efficiently Using Fast Fourier Transform
Barlow Twins and VICReg are self-supervised representation learning models
that use regularizers to decorrelate features. Although these models are as
effective as conventional representation learning models, their training can be
computationally demanding if the dimension d of the projected embeddings is
high. As the regularizers are defined in terms of individual elements of a
cross-correlation or covariance matrix, computing the loss for n samples takes
O(n d^2) time. In this paper, we propose a relaxed decorrelating regularizer
that can be computed in O(n d log d) time by Fast Fourier Transform. We also
propose an inexpensive technique to mitigate undesirable local minima that
develop with the relaxation. The proposed regularizer exhibits accuracy
comparable to that of existing regularizers in downstream tasks, whereas their
training requires less memory and is faster for large d. The source code is
available.Comment: Accepted for CVPR 202
A nonlinear parametric model based on a power law relationship for predicting the coastal tsunami height
When a subduction-zone earthquake occurs, the tsunami height must be predicted to cope with the damage generated by the tsunami. Therefore, tsunami height prediction methods have been studied using simulation data acquired by large-scale calculations. In this research, we consider the existence of a nonlinear power law relationship between the water pressure gauge data observed by the Dense Oceanfloor Network System for Earthquakes and Tsunamis (DONET) and the coastal tsunami height. Using this relationship, we propose a nonlinear parametric model and conduct a prediction experiment to compare the accuracy of the proposed method with those of previous methods and implement particular improvements to the extrapolation accuracy
- …