Search CORE

6 research outputs found

Video Analysis Tools for Annotating User-Generated Content from Social Events

Author: Bulterman D.C.A. (Dick)
César Garcia P.S. (Pablo Santiago)
Guimarães R.L. (Rodrigo)
Hofmann A. (Albert)
Kaiser R.
Publication venue
Publication date: 01/12/2010
Field of study

In this presentation we present how low-level metadata extraction tools have been applied in the context of a pan-European project called Together Anywhere, Together Anytime (TA2). The TA2 project studies new forms of computer-mediated social communications between spatially and temporally distant people. In particular, we concentrate on automatic video analysis tools in an asynchronous community-based video sharing environment called MyVideos, in which users can experience and share personalized music concert videos within their social grou

CWI's Institutional Repository

Automatic Time Skew Detection and Correction

Author: Korchagin Danil
Publication venue: Martigny, Switzerland
Publication date: 07/02/2011
Field of study

In this paper, we propose a new approach for the automatic time skew detection and correction for multisource audiovisual data, recorded by different cameras/recorders during the same event. All recorded data are successfully tested for potential time skew problem and corrected based on ASR-related features. The core of the algorithm is based on perceptual time-quefrency analysis with a precision of 10 ms. The results show correct time skew detection and elimination in 100% of cases for a real life dataset of 32 broken sessions and surpass the performance of fast cross correlation while keeping lower system requirements

Infoscience - École polytechnique fédérale de Lausanne

Social Focus of Attention as a Time Function Derived from Multimodal Signals

Author: Abutalebi Hamid Reza
Korchagin Danil
Publication venue
Publication date: 19/05/2011
Field of study

In this paper, we present the results of a study on the social focus of attention as a time function derived from the multisource multimodal signals, recorded by different personal capturing devices during social events. The core of the approach is based on fission and fusion of multichannel audio, video and social modalities to derive the social focus of attention. The results achieved to date on 16+ hours of real-life data prove the feasibility of the approach

Infoscience - École polytechnique fédérale de Lausanne

Crossref

Socially-Aware Multimedia Authoring

Author: Laiola Guimaraes R.
Publication venue: Amsterdam: Vrije Universiteit
Publication date: 01/01/2014
Field of study

Bulterman, D.C.A. [Promotor]Cesar, P.S. [Copromotor

VU Research Portal

CWI's Institutional Repository

Bayesian Approaches to Uncertainty in Speech Processing

Author: Garner Philip N.
Publication venue: School of Computing Sciences, University of East Anglia
Publication date: 19/12/2013
Field of study

Infoscience - École polytechnique fédérale de Lausanne

AUTOMATIC TEMPORAL ALIGNMENT OF AV DATA WITH CONFIDENCE ESTIMATION

Author: Danil Korchagin
John Dines
Philip N. Garner
Publication venue
Publication date: 01/01/2010
Field of study

In this paper, we propose a new approach for the automatic audio-based temporal alignment with confidence estimation of audio-visual data, recorded by different cameras, camcorders or mobile phones during social events. All recorded data is temporally aligned based on ASR-related features with a common master track, recorded by a reference camera, and the corresponding confidence of alignment is estimated. The core of the algorithm is based on perceptual time-frequency analysis with a precision of 10 ms. The results show correct alignment in 99 % of cases for a real life dataset and surpass the performance of cross correlation while keeping lower system requirements. Index Terms — time-frequency analysis, time synchronisation, pattern matching, reliability estimatio

Infoscience - École polytechnique fédérale de Lausanne

CiteSeerX

Crossref