Search CORE

148 research outputs found

Semantic Based Sport Video Browsing

Author: Xueming Qian
Publication venue: 'IntechOpen'
Publication date: 25/04/2012
Field of study

IntechOpen

Semantic Model Vectors for Complex Video Event Recognition

Author: Apostol Natsev
Bert Huang
Gang Hua
Lexing Xie
Michele Merler
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date
Field of study

Crossref

Automated classification of cricket pitch frames in cricket video

Author: Bananki Jayanth Sandesh
Srinivasa Gowri
Publication venue: 'Universitat Autonoma de Barcelona'
Publication date: 01/01/2014
Field of study

The automated detection of the cricket pitch in a video recording of a cricket match is a fundamental step in content-based indexing and summarization of cricket videos. In this paper, we propose visualcontent based algorithms to automate the extraction of video frames with the cricket pitch in focus. As a preprocessing step, we first select a subset of frames with a view of the cricket field, of which the cricket pitch forms a part. This filtering process reduces the search space by eliminating frames that contain a view of the audience, close-up shots of specific players, advertisements, etc. The subset of frames containing the cricket field is then subject to statistical modeling of the grayscale (brightness) histogram (SMoG). Since SMoG does not utilize color or domain-specific information such as the region in the frame where the pitch is expected to be located, we propose an alternative algorithm: component quantization based region of interest extraction (CQRE) for the extraction of pitch frames. Experimental results demonstrate that, regardless of the quality of the input, successive application of the two methods outperforms either one applied exclusively. The SMoG-CQRE combination for pitch frame classification yields an average accuracy of 98:6% in the best case (a high resolution video with good contrast) and an average accuracy of 87:9% in the worst case (a low resolution video with poor contrast). Since, the extraction of pitch frames forms the first step in analyzing the important events in a match, we also present a post-processing step, viz. , an algorithm to detect players in the extracted pitch frames

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Crossref

Directory of Open Access Journals

Revistes Catalanes amb Accés Obert

Electronic Letters on Computer Vision and Image Analysis (ELCVIA - Universitat Autònoma de Barcelona)

Diposit Digital de Documents de la UAB

Integrated analysis of audiovisual signals and external information sources for event detection in team sports video

Author: XU HUAXIN
Publication venue
Publication date: 28/04/2008
Field of study

Ph.DDOCTOR OF PHILOSOPH

ScholarBank@NUS

A Literature Study On Video Retrieval Approaches

Author: S PADMAKALA
Publication venue: International Journal of Innovative Technology and Research
Publication date: 13/09/2019
Field of study

A detailed survey has been carried out to identify the various research articles available in the literature in all the categories of video retrieval and to do the analysis of the major contributions and their advantages, following are the literature used for the assessment of the state-of-art work on video retrieval. Here, a large number of papershave been studied

International Journal of Innovative Technology and Research (IJITR)

Object and event recognition in multimedia archives using local visual features

Author: Ballan Lamberto
Publication venue
Publication date: 01/01/2011
Field of study

Florence Research

A Neuro-Symbolic Approach for Real-World Event Recognition from Weak Supervision

Author: Apriceno Gianluca
Passerini Andrea
Serafini Luciano
Publication venue: LIPIcs - Leibniz International Proceedings in Informatics. 29th International Symposium on Temporal Representation and Reasoning (TIME 2022)
Publication date: 01/01/2022
Field of study

Events are structured entities involving different components (e.g, the participants, their roles etc.) and their relations. Structured events are typically defined in terms of (a subset of) simpler, atomic events and a set of temporal relation between them. Temporal Event Detection (TED) is the task of detecting structured and atomic events within data streams, most often text or video sequences, and has numerous applications, from video surveillance to sports analytics. Existing deep learning approaches solve TED task by implicitly learning the temporal correlations among events from data. As consequence, these approaches often fail in ensuring a consistent prediction in terms of the relationship between structured and atomic events. On the other hand, neuro-symbolic approaches have shown their capability to constrain the output of the neural networks to be consistent with respect to the background knowledge of the domain. In this paper, we propose a neuro-symbolic approach for TED in a real world scenario involving sports activities. We show how by incorporating simple knowledge involving the relative order of atomic events and constraints on their duration, the approach substantially outperforms a fully neural solution in terms of recognition accuracy, when little or even no supervision is available on the atomic events

Archivio della ricerca - Fondazione Bruno Kessler

Dagstuhl Research Online Publication Server

Content-based video indexing for sports applications using integrated multi-modal approach

Author: Tjondronegoro Dian W.
Publication venue: Deakin University, Faculty of Science and Technology, School of Information Technology
Publication date: 01/01/2005
Field of study

This thesis presents a research work based on an integrated multi-modal approach for sports video indexing and retrieval. By combining specific features extractable from multiple (audio-visual) modalities, generic structure and specific events can be detected and classified. During browsing and retrieval, users will benefit from the integration of high-level semantic and some descriptive mid-level features such as whistle and close-up view of player(s). The main objective is to contribute to the three major components of sports video indexing systems. The first component is a set of powerful techniques to extract audio-visual features and semantic contents automatically. The main purposes are to reduce manual annotations and to summarize the lengthy contents into a compact, meaningful and more enjoyable presentation. The second component is an expressive and flexible indexing technique that supports gradual index construction. Indexing scheme is essential to determine the methods by which users can access a video database. The third and last component is a query language that can generate dynamic video summaries for smart browsing and support user-oriented retrievals

Deakin Research Online