2,141 research outputs found

    Multi-label Class-imbalanced Action Recognition in Hockey Videos via 3D Convolutional Neural Networks

    Get PDF
    Automatic analysis of the video is one of most complex problems in the fields of computer vision and machine learning. A significant part of this research deals with (human) activity recognition (HAR) since humans, and the activities that they perform, generate most of the video semantics. Video-based HAR has applications in various domains, but one of the most important and challenging is HAR in sports videos. Some of the major issues include high inter- and intra-class variations, large class imbalance, the presence of both group actions and single player actions, and recognizing simultaneous actions, i.e., the multi-label learning problem. Keeping in mind these challenges and the recent success of CNNs in solving various computer vision problems, in this work, we implement a 3D CNN based multi-label deep HAR system for multi-label class-imbalanced action recognition in hockey videos. We test our system for two different scenarios: an ensemble of kk binary networks vs. a single kk-output network, on a publicly available dataset. We also compare our results with the system that was originally designed for the chosen dataset. Experimental results show that the proposed approach performs better than the existing solution.Comment: Accepted to IEEE/ACIS SNPD 2018, 6 pages, 3 figure

    Action Recognition in Videos: from Motion Capture Labs to the Web

    Full text link
    This paper presents a survey of human action recognition approaches based on visual data recorded from a single video camera. We propose an organizing framework which puts in evidence the evolution of the area, with techniques moving from heavily constrained motion capture scenarios towards more challenging, realistic, "in the wild" videos. The proposed organization is based on the representation used as input for the recognition task, emphasizing the hypothesis assumed and thus, the constraints imposed on the type of video that each technique is able to address. Expliciting the hypothesis and constraints makes the framework particularly useful to select a method, given an application. Another advantage of the proposed organization is that it allows categorizing newest approaches seamlessly with traditional ones, while providing an insightful perspective of the evolution of the action recognition task up to now. That perspective is the basis for the discussion in the end of the paper, where we also present the main open issues in the area.Comment: Preprint submitted to CVIU, survey paper, 46 pages, 2 figures, 4 table

    Event detection in field sports video using audio-visual features and a support vector machine

    Get PDF
    In this paper, we propose a novel audio-visual feature-based framework for event detection in broadcast video of multiple different field sports. Features indicating significant events are selected and robust detectors built. These features are rooted in characteristics common to all genres of field sports. The evidence gathered by the feature detectors is combined by means of a support vector machine, which infers the occurrence of an event based on a model generated during a training phase. The system is tested generically across multiple genres of field sports including soccer, rugby, hockey, and Gaelic football and the results suggest that high event retrieval and content rejection statistics are achievable

    Real-time event classification in field sport videos

    Get PDF
    The paper presents a novel approach to real-time event detection in sports broadcasts. We present how the same underlying audio-visual feature extraction algorithm based on new global image descriptors is robust across a range of different sports alleviating the need to tailor it to a particular sport. In addition, we propose and evaluate three different classifiers in order to detect events using these features: a feed-forward neural network, an Elman neural network and a decision tree. Each are investigated and evaluated in terms of their usefulness for real-time event classification. We also propose a ground truth dataset together with an annotation technique for performance evaluation of each classifier useful to others interested in this problem

    Semantic Based Sport Video Browsing

    Get PDF

    The tactics of successful attacks in professional association football:large-scale spatiotemporal analysis of dynamic subgroups using position tracking data

    Get PDF
    Association football teams can be considered complex dynamical systems of individuals grouped in subgroups (defenders, midfielders and attackers), coordinating their behaviour to achieve a shared goal. As research often focusses on collective behaviour, or on static subgroups, the current study aims to analyse spatiotemporal behaviour of dynamic subgroups in relation to successful attacks. We collected position tracking data of 118 Dutch Eredivisie matches, containing 12424 attacks. Attacks were classified as successful (N = 1237) or non-successful (N = 11187) based on the potential of creating a scoring opportunity. Using unsupervised machine learning, we automatically identified dynamic formations based on position tracking data, and identified dynamic subgroups for every timeframe in a match. We then compared the subgroup centroids to assess the intra- and inter-team spatiotemporal synchronisation during successful and non-successful attacks, using circular statistics. Our results indicated subgroup-level variables provided more information, and were more sensitive to disruption, in comparison to team-level variables. When comparing successful and non-successful attacks, we found decreases (p < .01) in longitudinal inter- and intra-team synchrony of interactions involving the defenders of the attacking team during successful attacks. This study provides the first large-scale dynamic subgroup analysis and reveals additional insights to team-level analyses
    corecore