Search CORE

3,168 research outputs found

Object tracking using level set and MPEG 7 color features

Author: Oussalah M.
Shabash M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/10/2012
Field of study

Crossref

University of Birmingham Research Portal

Video semantic content analysis framework based on ontology combined MPEG-7

Author: A. Artale
A. Ekin
A. Hanjalic
A. Jaimes
A. Yoshitaka
H.X. Xu
I. Kompatsiaris
S. Dasiopoulou
S.-F. Chang
V. Mezaris
W. Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/07/2007
Field of study

The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standard, MPEG-7, provides the rich functionalities to enable the generation of audiovisual descriptions and is expressed solely in XML Schema which provides little support for expressing semantic knowledge. In this paper, a video semantic content analysis framework based on ontology combined MPEG-7 is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. MPEG-7 metadata terms of audiovisual descriptions and video content analysis algorithms are expressed in this ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how low-level features and algorithms for video analysis should be applied according to different perception content. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in sports video domain and shows promising results

Crossref

DCU Online Research Access Service

Real-time detection and tracking of multiple objects with partial decoding in H.264/AVC bitstream domain

Author: Kim Munchurl
Sabirin M. S. Houari
You Wonsang
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 21/02/2012
Field of study

In this paper, we show that we can apply probabilistic spatiotemporal macroblock filtering (PSMF) and partial decoding processes to effectively detect and track multiple objects in real time in H.264|AVC bitstreams with stationary background. Our contribution is that our method cannot only show fast processing time but also handle multiple moving objects that are articulated, changing in size or internally have monotonous color, even though they contain a chaotic set of non-homogeneous motion vectors inside. In addition, our partial decoding process for H.264|AVC bitstreams enables to improve the accuracy of object trajectories and overcome long occlusion by using extracted color information.Comment: SPIE Real-Time Image and Video Processing Conference 200

arXiv.org e-Print Archive

Crossref

Vision-Based Production of Personalized Video

Author: Chatzis S.
Doulamis A.
Doulamis N.
Kosmopoulos D.I.
Makris A.
Middleton S.E.
Publication venue
Publication date: 01/01/2008
Field of study

In this paper we present a novel vision-based system for the automated production of personalised video souvenirs for visitors in leisure and cultural heritage venues. Visitors are visually identified and tracked through a camera network. The system produces a personalized DVD souvenir at the end of a visitor’s stay allowing visitors to relive their experiences. We analyze how we identify visitors by fusing facial and body features, how we track visitors, how the tracker recovers from failures due to occlusions, as well as how we annotate and compile the final product. Our experiments demonstrate the feasibility of the proposed approach

CiteSeerX

Southampton (e-Prints Soton)

DSpace at NTUA

Video semantic content analysis based on ontology

Author: Bai Liang
Jones Gareth J.F.
Lao Songyang
Smeaton Alan F.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2007
Field of study

The rapid increase in the available amount of video data is creating a growing demand for efficient methods for understanding and managing it at the semantic level. New multimedia standards, such as MPEG-4 and MPEG-7, provide the basic functionalities in order to manipulate and transmit objects and metadata. But importantly, most of the content of video data at a semantic level is out of the scope of the standards. In this paper, a video semantic content analysis framework based on ontology is presented. Domain ontology is used to define high level semantic concepts and their relations in the context of the examined domain. And low-level features (e.g. visual and aural) and video content analysis algorithms are integrated into the ontology to enrich video semantic analysis. OWL is used for the ontology description. Rules in Description Logic are defined to describe how features and algorithms for video analysis should be applied according to different perception content and low-level features. Temporal Description Logic is used to describe the semantic events, and a reasoning algorithm is proposed for events detection. The proposed framework is demonstrated in a soccer video domain and shows promising results

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Advanced content-based semantic scene analysis and information retrieval: the SCHEMA project

Author: E. Izquierdo
I. Kompatsiaris
J. R. Casas
M. G. Strintzis
Noel E
Noel E. O&apos
P. Migliorati
R. Leonardi
Publication venue: 'World Scientific Pub Co Pte Lt'
Publication date: 01/01/2003
Field of study

The aim of the SCHEMA Network of Excellence is to bring together a critical mass of universities, research centers, industrial partners and end users, in order to design a reference system for content-based semantic scene analysis, interpretation and understanding. Relevant research areas include: content-based multimedia analysis and automatic annotation of semantic multimedia content, combined textual and multimedia information retrieval, semantic -web, MPEG-7 and MPEG-21 standards, user interfaces and human factors. In this paper, recent advances in content-based analysis, indexing and retrieval of digital media within the SCHEMA Network are presented. These advances will be integrated in the SCHEMA module-based, expandable reference system

CiteSeerX

Archivio istituzionale della ricerca - Università di Brescia

DCU Online Research Access Service

A morphological approach for segmentation and tracking of human faces

Author: Marqués Acosta Fernando
Vilaplana Besler Verónica
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2000
Field of study

A new technique for segmenting and tracking human faces in video sequences is presented. The technique relies on morphological tools such as using connected operators to extract the connected component that more likely belongs to a face, and partition projection to track this component through the sequence. A binary partition tree (BPT) is used to implement the connected operator. The BPT is constructed based on the chrominance criteria and its nodes are analyzed so that the selected node maximizes an estimation of the likelihood of being part of a face. The tracking is performed using a partition projection approach. Images are divided into face and non-face parts, which are tracked through the sequence. The technique has been successfully assessed using several test sequences from the MPEG-4 (raw format) and the MPEG-7 databases (MPEG-1 format).Peer ReviewedPostprint (published version

UPCommons. Portal del coneixement obert de la UPC

Combining textual and visual information processing for interactive video retrieval: SCHEMA's participation in TRECVID 2004

Author: Doulaverakis Haralambos
Herrmann Stephan
Lehane Bart
Mezaris Vasileios
O'Connor Noel E.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2004
Field of study

In this paper, the two different applications based on the Schema Reference System that were developed by the SCHEMA NoE for participation to the search task of TRECVID 2004 are illustrated. The first application, named ”Schema-Text”, is an interactive retrieval application that employs only textual information while the second one, named ”Schema-XM”, is an extension of the former, employing algorithms and methods for combining textual, visual and higher level information. Two runs for each application were submitted, I A 2 SCHEMA-Text 3, I A 2 SCHEMA-Text 4 for Schema-Text and I A 2 SCHEMA-XM 1, I A 2 SCHEMA-XM 2 for Schema-XM. The comparison of these two applications in terms of retrieval efficiency revealed that the combination of information from different data sources can provide higher efficiency for retrieval systems. Experimental testing additionally revealed that initially performing a text-based query and subsequently proceeding with visual similarity search using one of the returned relevant keyframes as an example image is a good scheme for combining visual and textual information

Irish Universities

DCU Online Research Access Service