Search CORE

542 research outputs found

Content-based Video Retrieval by Integrating Spatio-Temporal and Stochastic Recognition of Events

Author: Jonker W.
Petkovic M.
Publication venue: IEEE
Publication date: 01/01/2001
Field of study

As amounts of publicly available video data grow the need to query this data efficiently becomes significant. Consequently content-based retrieval of video data turns out to be a challenging and important problem. We address the specific aspect of inferring semantics automatically from raw video data. In particular, we introduce a new video data model that supports the integrated use of two different approaches for mapping low-level features to high-level concepts. Firstly, the model is extended with a rule-based approach that supports spatio-temporal formalization of high-level concepts, and then with a stochastic approach. Furthermore, results on real tennis video data are presented, demonstrating the validity of both approaches, as well us advantages of their integrated us

CiteSeerX

Pure OAI Repository

University of Twente Research Information

A spatio-temporal and a probabilistic approach for video retrieval

Author: Jonker Willem
Petkovic M.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

University of Twente Research Information

An Overview of Video Shot Clustering and Summarization Techniques for Mobile Applications

Author: Adami N.
Benini S.
Leonardi R.
Publication venue: Mobimedia
Publication date: 01/01/2006
Field of study

The problem of content characterization of video programmes is of great interest because video appeals to large audiences and its efficient distribution over various networks should contribute to widespread usage of multimedia services. In this paper we analyze several techniques proposed in literature for content characterization of video programmes, including movies and sports, that could be helpful for mobile media consumption. In particular we focus our analysis on shot clustering methods and effective video summarization techniques since, in the current video analysis scenario, they facilitate the access to the content and help in quick understanding of the associated semantics. First we consider the shot clustering techniques based on low-level features, using visual, audio and motion information, even combined in a multi-modal fashion. Then we concentrate on summarization techniques, such as static storyboards, dynamic video skimming and the extraction of sport highlights. Discussed summarization methods can be employed in the development of tools that would be greatly useful to most mobile users: in fact these algorithms automatically shorten the original video while preserving most events by highlighting only the important content. The effectiveness of each approach has been analyzed, showing that it mainly depends on the kind of video programme it relates to, and the type of summary or highlights we are focusing on

Crossref

Archivio istituzionale della ricerca - Università di Brescia

A novel framework for semantic analysis of an illumination-variant soccer video

Author
Publication venue: Springer
Publication date: 03/11/2014
Field of study

Springer - Publisher Connector

An HMM-Based Framework for Video Semantic Analysis

Author: Ma Yu-Fei
Xu Gu
Yang Shiqiang
Zhang Hong-Jiang
Publication venue: Scholars\u27 Mine
Publication date: 01/01/2005
Field of study

Video semantic analysis is essential in video indexing and structuring. However, due to the lack of robust and generic algorithms, most of the existing works on semantic analysis are limited to specific domains. In this paper, we present a novel hidden Markove model (HMM)-based framework as a general solution to video semantic analysis. In the proposed framework, semantics in different granularities are mapped to a hierarchical model space, which is composed of detectors and connectors. In this manner, our model decomposes a complex analysis problem into simpler subproblems during the training process and automatically integrates those subproblems for recognition. The proposed framework is not only suitable for a broad range of applications, but also capable of modeling semantics in different semantic granularities. Additionally, we also present a new motion representation scheme, which is robust to different motion vector sources. The applications of the proposed framework in basketball event detection, soccer shot classification, and volleyball sequence analysis have demonstrated the effectiveness of the proposed framework on video semantic analysis

Missouri University of Science and Technology (Missouri S&T): Scholars' Mine

A novel framework for semantic analysis of an illumination-variant soccer video

Author: AG Money
B Li
B Lucas
C Huang
C Wu
D Zhang
DA Sadlier
Devang S Pandya
DW Tjondronegoro
G Xu
G Yue
H Chen
H Chung-Lin
J Assfalg
J Basak
J Liu
K Ren
L Ying
M Petkovic
M Roach
MS Hosseini
Mukesh A Zaveri
R Leonardi
Ren
S Alipour
S Carrato
S Ling
S Wei
V Kiani
V Mihajlovic
W Jinjun
W Wolf
W Zhou
X Lexing
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Automated classification of cricket pitch frames in cricket video

Author: Bananki Jayanth Sandesh
Srinivasa Gowri
Publication venue: 'Universitat Autonoma de Barcelona'
Publication date: 01/01/2014
Field of study

The automated detection of the cricket pitch in a video recording of a cricket match is a fundamental step in content-based indexing and summarization of cricket videos. In this paper, we propose visualcontent based algorithms to automate the extraction of video frames with the cricket pitch in focus. As a preprocessing step, we first select a subset of frames with a view of the cricket field, of which the cricket pitch forms a part. This filtering process reduces the search space by eliminating frames that contain a view of the audience, close-up shots of specific players, advertisements, etc. The subset of frames containing the cricket field is then subject to statistical modeling of the grayscale (brightness) histogram (SMoG). Since SMoG does not utilize color or domain-specific information such as the region in the frame where the pitch is expected to be located, we propose an alternative algorithm: component quantization based region of interest extraction (CQRE) for the extraction of pitch frames. Experimental results demonstrate that, regardless of the quality of the input, successive application of the two methods outperforms either one applied exclusively. The SMoG-CQRE combination for pitch frame classification yields an average accuracy of 98:6% in the best case (a high resolution video with good contrast) and an average accuracy of 87:9% in the worst case (a low resolution video with poor contrast). Since, the extraction of pitch frames forms the first step in analyzing the important events in a match, we also present a post-processing step, viz. , an algorithm to detect players in the extracted pitch frames

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Directory of Open Access Journals

Revistes Catalanes amb Accés Obert

Electronic Letters on Computer Vision and Image Analysis (ELCVIA - Universitat Autònoma de Barcelona)

Diposit Digital de Documents de la UAB

Tracking interacting targets in multi-modal sensors

Author: Taj Murtaza
Publication venue
Publication date: 01/01/2009
Field of study

PhDObject tracking is one of the fundamental tasks in various applications such as surveillance, sports, video conferencing and activity recognition. Factors such as occlusions, illumination changes and limited field of observance of the sensor make tracking a challenging task. To overcome these challenges the focus of this thesis is on using multiple modalities such as audio and video for multi-target, multi-modal tracking. Particularly, this thesis presents contributions to four related research topics, namely, pre-processing of input signals to reduce noise, multi-modal tracking, simultaneous detection and tracking, and interaction recognition. To improve the performance of detection algorithms, especially in the presence of noise, this thesis investigate filtering of the input data through spatio-temporal feature analysis as well as through frequency band analysis. The pre-processed data from multiple modalities is then fused within Particle filtering (PF). To further minimise the discrepancy between the real and the estimated positions, we propose a strategy that associates the hypotheses and the measurements with a real target, using a Weighted Probabilistic Data Association (WPDA). Since the filtering involved in the detection process reduces the available information and is inapplicable on low signal-to-noise ratio data, we investigate simultaneous detection and tracking approaches and propose a multi-target track-beforedetect Particle filtering (MT-TBD-PF). The proposed MT-TBD-PF algorithm bypasses the detection step and performs tracking in the raw signal. Finally, we apply the proposed multi-modal tracking to recognise interactions between targets in regions within, as well as outside the cameras’ fields of view. The efficiency of the proposed approaches are demonstrated on large uni-modal, multi-modal and multi-sensor scenarios from real world detections, tracking and event recognition datasets and through participation in evaluation campaigns

Queen Mary Research Online

OpenGrey Repository

Multimodal framework based on audio‐visual features for summarisation of cricket videos

Author: Adnan Syed
Irtaza Aun
Javed Ali
Mahmood Muhammad Tariq
Malik Hafiz
Publication venue: 'Institution of Engineering and Technology (IET)'
Publication date: 01/03/2019
Field of study

Peer Reviewedhttp://deepblue.lib.umich.edu/bitstream/2027.42/166171/1/ipr2bf02094.pd

Deep Blue Documents at the University of Michigan