Search CORE

227 research outputs found

TRECVID 2008 - goals, tasks, data, evaluation mechanisms and metrics

Author: Awad George M.
Fiscus Jon
Kraaij Wessel
Over Paul
Rose Travis
Smeaton Alan F.
Publication venue: National Institute for Standards and Technology (NIST)
Publication date: 17/11/2008
Field of study

The TREC Video Retrieval Evaluation (TRECVID) 2008 is a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital video via open, metrics-based evaluation. Over the last 7 years this effort has yielded a better understanding of how systems can effectively accomplish such processing and how one can reliably benchmark their performance. In 2008, 77 teams (see Table 1) from various research organizations --- 24 from Asia, 39 from Europe, 13 from North America, and 1 from Australia --- participated in one or more of five tasks: high-level feature extraction, search (fully automatic, manually assisted, or interactive), pre-production video (rushes) summarization, copy detection, or surveillance event detection. The copy detection and surveillance event detection tasks are being run for the first time in TRECVID. This paper presents an overview of TRECVid in 2008

Irish Universities

DCU Online Research Access Service

Radboud Repository

AXES at TRECVID 2012: KIS, INS, and MED

Author: Aly Robin
Arandjelovic Relja
Chatfield Ken
Chen Shu
Douze Matthijs
Fernando Basura
Harchaoui Zaid
McGuinness Kevin
O'Connor Noel E.
Oneata Dan
Parkhi Omkar M.
Potapov Danila
Revaud Jérôme
Schmid Cordelia
Schwenninger Jochen
Tuytelaars Tinne
Verbeek Jakob
Wang Heng
Zisserman Andrew
Publication venue
Publication date: 01/01/2012
Field of study

The AXES project participated in the interactive instance search task (INS), the known-item search task (KIS), and the multimedia event detection task (MED) for TRECVid 2012. As in our TRECVid 2011 system, we used nearly identical search systems and user interfaces for both INS and KIS. Our interactive INS and KIS systems focused this year on using classifiers trained at query time with positive examples collected from external search engines. Participants in our KIS experiments were media professionals from the BBC; our INS experiments were carried out by students and researchers at Dublin City University. We performed comparatively well in both experiments. Our best KIS run found 13 of the 25 topics, and our best INS runs outperformed all other submitted runs in terms of P@100. For MED, the system presented was based on a minimal number of low-level descriptors, which we chose to be as large as computationally feasible. These descriptors are aggregated to produce high-dimensional video-level signatures, which are used to train a set of linear classifiers. Our MED system achieved the second-best score of all submitted runs in the main track, and best score in the ad-hoc track, suggesting that a simple system based on state-of-the-art low-level descriptors can give relatively high performance. This paper describes in detail our KIS, INS, and MED systems and the results and findings of our experiments

Hal - Université Grenoble Alpes

Fraunhofer-ePrints

Irish Universities

INRIA a CCSD electronic archive server

DCU Online Research Access Service

HAL-Rennes 1

TRECVID 2007 - Overview

Author: Awad George M.
Kraaij Wessel
Over Paul
Smeaton Alan F.
Publication venue: 'University of Aden - Faculty of Economics and Administration'
Publication date: 01/11/2007
Field of study

DCU Online Research Access Service

SAVASA project @ TRECVID 2012: interactive surveillance event detection

Author: Clawson Kathy
Direkoglu Cem
Gimenez Roberto
Jargalsaikhan Iveel
Li Hao
Little Suzanne
Martinez Llorens Ana
Mereu Anna
Nieto Marcos
O'Connor Noel E.
Rodriguez Aitor
Sanchez Pedro
Santos de la Camara Raul
Smeaton Alan F.
Villarroel Peniza Karina
Publication venue
Publication date: 26/11/2012
Field of study

In this paper we describe our participation in the interactive surveillance event detection task at TRECVid 2012. The system we developed was comprised of individual classifiers brought together behind a simple video search interface that enabled users to select relevant segments based on down~sampled animated gifs. Two types of user -- `experts' and `end users' -- performed the evaluations. Due to time constraints we focussed on three events -- ObjectPut, PersonRuns and Pointing -- and two of the five available cameras (1 and 3). Results from the interactive runs as well as discussion of the performance of the underlying retrospective classifiers are presented

DCU Online Research Access Service

Search and hyperlinking task at MediaEval 2012

Author: Aly Robin
Chen Shu
Eskevich Maria
Jones Gareth J.F.
Larson Martha
Ordelman Roeland
Publication venue: CEUR-WS.org
Publication date: 04/10/2012
Field of study

The Search and Hyperlinking Task was one of the Brave New Tasks at MediaEval 2012. The Task consisted of two subtasks which focused on search and linking in retrieval from a collection of semi-professional video content. These tasks followed up on research carried out within the MediaEval 2011 Rich Speech Retrieval (RSR) Task and the VideoCLEF 2009 Linking Task

DCU Online Research Access Service

University of Twente Research Information

Recommended from our members

A database and challenge for acoustic scene classification and event detection

Author: Benetos E.
Giannoulis D.
Lagrange M.
Plumbley M. D.
Rossignol M.
Stowell D.
Publication venue
Publication date: 01/01/2013
Field of study

City Research Online

Unified Embedding and Metric Learning for Zero-Exemplar Event Detection

Author: Gavves Efstratios
Hussein Noureldien
Smeulders Arnold W. M.
Publication venue
Publication date: 01/01/2017
Field of study

Event detection in unconstrained videos is conceived as a content-based video retrieval with two modalities: textual and visual. Given a text describing a novel event, the goal is to rank related videos accordingly. This task is zero-exemplar, no video examples are given to the novel event. Related works train a bank of concept detectors on external data sources. These detectors predict confidence scores for test videos, which are ranked and retrieved accordingly. In contrast, we learn a joint space in which the visual and textual representations are embedded. The space casts a novel event as a probability of pre-defined events. Also, it learns to measure the distance between an event and its related videos. Our model is trained end-to-end on publicly available EventNet. When applied to TRECVID Multimedia Event Detection dataset, it outperforms the state-of-the-art by a considerable margin.Comment: IEEE CVPR 201

arXiv.org e-Print Archive

UvA-DARE

International Migration, Integration and Social Cohesion online publications

TRECVID 2014 -- An Overview of the Goals, Tasks, Data, Evaluation Mechanisms and Metrics

Author: Awad George
Fiscus Jon
Joy David
Kraaij Wessel
Michel Martial
Over Paul
Quénot Georges
Sanders Greg
Smeaton Alan,
Publication venue: HAL CCSD
Publication date: 01/01/2014
Field of study

International audienceThe TREC Video Retrieval Evaluation (TRECVID) 2014 was a TREC-style video analysis and retrieval evaluation, the goal of which remains to promote progress in content-based exploitation of digital video via open, metrics-based evaluation. Over the last dozen years this effort has yielded a better under- standing of how systems can effectively accomplish such processing and how one can reliably benchmark their performance. TRECVID is funded by the NIST with support from other US government agencies. Many organizations and individuals worldwide contribute significant time and effort

Hal - Université Grenoble Alpes

Blip10000: a social video dataset containing SPUG content for tagging and retrieval

Author: Eskevich Maria
Estève Yannick
Ferrané Isabelle
Jones Gareth J.F.
Kofler Christoph
Lamel Lori
Larson Martha
Schmiedeke Sebastian
Sikora Thomas
Xu Peng
Publication venue: 'American College of Medical Physics (ACMP)'
Publication date: 27/02/2013
Field of study

The increasing amount of digital multimedia content available is inspiring potential new types of user interaction with video data. Users want to easilyfind the content by searching and browsing. For this reason, techniques are needed that allow automatic categorisation, searching the content and linking to related information. In this work, we present a dataset that contains comprehensive semi-professional user generated (SPUG) content, including audiovisual content, user-contributed metadata, automatic speech recognition transcripts, automatic shot boundary les, and social information for multiple `social levels'. We describe the principal characteristics of this dataset and present results that have been achieved on different tasks

DCU Online Research Access Service