Search CORE

15,871 research outputs found

The CAMOMILE collaborative annotation platform for multi-modal, multi-lingual and multi-media documents

Author: Adda Gilles
Barras Claude
Bredin Herve
Budnik Mateusz
Hernando Pericás Francisco Javier
Mariani Joseph
Morros Rubió Josep Ramon
Poignant Johann
Publication venue: European Language Resources Association
Publication date: 01/01/2016
Field of study

In this paper, we describe the organization and the implementation of the CAMOMILE collaborative annotation framework for multimodal, multimedia, multilingual (3M) data. Given the versatile nature of the analysis which can be performed on 3M data, the structure of the server was kept intentionally simple in order to preserve its genericity, relying on standard Web technologies. Layers of annotations, defined as data associated to a media fragment from the corpus, are stored in a database and can be managed through standard interfaces with authentication. Interfaces tailored specifically to the needed task can then be developed in an agile way, relying on simple but reliable services for the management of the centralized annotations. We then present our implementation of an active learning scenario for person annotation in video, relying on the CAMOMILE server; during a dry run experiment, the manual annotation of 716 speech segments was thus propagated to 3504 labeled tracks. The code of the CAMOMILE framework is distributed in open source.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Crowdsourcing in Computer Vision

Author: Fei-Fei Li
Grauman Kristen
Kovashka Adriana
Russakovsky Olga
Publication venue: 'Now Publishers'
Publication date: 01/01/2016
Field of study

Computer vision systems require large amounts of manually annotated data to properly learn challenging visual concepts. Crowdsourcing platforms offer an inexpensive method to capture human knowledge and understanding, for a vast number of visual perception tasks. In this survey, we describe the types of annotations computer vision researchers have collected using crowdsourcing, and how they have ensured that this data is of high quality while annotation effort is minimized. We begin by discussing data collection on both classic (e.g., object recognition) and recent (e.g., visual story-telling) vision tasks. We then summarize key design decisions for creating effective data collection interfaces and workflows, and present strategies for intelligently selecting the most important data instances to annotate. Finally, we conclude with some thoughts on the future of crowdsourcing in computer vision.Comment: A 69-page meta review of the field, Foundations and Trends in Computer Graphics and Vision, 201

arXiv.org e-Print Archive

Crossref

A decade of work on semantic concept detection from video: TRECVid’s contribution

Author: Kraaij Wessel
Quénot Georges
Smeaton Alan F.
Publication venue
Publication date: 13/12/2012
Field of study

DCU Online Research Access Service

High-level feature detection from video in TRECVid: a 5-year retrospective of achievements

Author: A. F. Smeaton
A. F. Smeaton
A. Loui
A. P. Natsev
A. Smeulders
C. G. M. Snoek
C. G. Snoek
E. Yilmaz
M. G. Christel
M. Naphade
M. R. Naphade
P. Joly
P. Over
T. Volkmer
W. Kraaij
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Successful and effective content-based access to digital video requires fast, accurate and scalable methods to determine the video content automatically. A variety of contemporary approaches to this rely on text taken from speech within the video, or on matching one video frame against others using low-level characteristics like colour, texture, or shapes, or on determining and matching objects appearing within the video. Possibly the most important technique, however, is one which determines the presence or absence of a high-level or semantic feature, within a video clip or shot. By utilizing dozens, hundreds or even thousands of such semantic features we can support many kinds of content-based video navigation. Critically however, this depends on being able to determine whether each feature is or is not present in a video clip. The last 5 years have seen much progress in the development of techniques to determine the presence of semantic features within video. This progress can be tracked in the annual TRECVid benchmarking activity where dozens of research groups measure the effectiveness of their techniques on common data and using an open, metrics-based approach. In this chapter we summarise the work done on the TRECVid high-level feature task, showing the progress made year-on-year. This provides a fairly comprehensive statement on where the state-of-the-art is regarding this important task, not just for one research group or for one approach, but across the spectrum. We then use this past and on-going work as a basis for highlighting the trends that are emerging in this area, and the questions which remain to be addressed before we can achieve large-scale, fast and reliable high-level feature detection on video

CiteSeerX

Crossref

Irish Universities

DCU Online Research Access Service

Query independent measures of annotation and annotator impact

Author: Lanagan James
Smeaton Alan F.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2009
Field of study

The modern-day web-user plays a far more active role in the creation of content for the web as a whole. In this paper we present Annoby, a free-text annotation system built to give users a more interactive experience of the events of the Rugby World Cup 2007. Annotations can be used for query-independent ranking of both the annotations and the original recorded video footage (or documents) which has been annotated, based on the social interactions of a community of users. We present two algorithms, AuthorRank and MessageRank, designed to take advantage of these interactions so as to provide a means of ranking documents by their social impact

CiteSeerX

Irish Universities

DCU Online Research Access Service

Adaptive Information Cluster at Dublin City University

Author: Byrne Daragh
Lee Hyowon
McGivney Sinéad
Smeaton Alan F.
Publication venue: 'Glucksman Library University of Limerick'
Publication date: 01/05/2007
Field of study

The Adaptive Information Cluster (AIC) is a collaboration between Dublin City University and University College Dublin, and in the AIC at DCU, we investigate and develop as one stream of our research activities, various content analysis tools that can automatically index and structure video information. This includes movies or CCTV footage and the motivation is to support useful searching and browsing features for the envisaged end-users of such systems. We bring in the HCI perspective to this highly-technically-oriented research by brainstorming, generating scenarios, sketching and prototyping the user-interfaces to the resulting video retrieval systems we develop, and we conduct usability studies to better understand the usage and opinions of such systems so as to guide the future direction of our technological research

DCU Online Research Access Service

Recommended from our members

Semantic Concept Co-Occurrence Patterns for Image Annotation and Retrieval.

Author: Bhanu Bir
Feng Linan
Publication venue: eScholarship, University of California
Publication date: 01/04/2016
Field of study

Describing visual image contents by semantic concepts is an effective and straightforward way to facilitate various high level applications. Inferring semantic concepts from low-level pictorial feature analysis is challenging due to the semantic gap problem, while manually labeling concepts is unwise because of a large number of images in both online and offline collections. In this paper, we present a novel approach to automatically generate intermediate image descriptors by exploiting concept co-occurrence patterns in the pre-labeled training set that renders it possible to depict complex scene images semantically. Our work is motivated by the fact that multiple concepts that frequently co-occur across images form patterns which could provide contextual cues for individual concept inference. We discover the co-occurrence patterns as hierarchical communities by graph modularity maximization in a network with nodes and edges representing concepts and co-occurrence relationships separately. A random walk process working on the inferred concept probabilities with the discovered co-occurrence patterns is applied to acquire the refined concept signature representation. Through experiments in automatic image annotation and semantic image retrieval on several challenging datasets, we demonstrate the effectiveness of the proposed concept co-occurrence patterns as well as the concept signature representation in comparison with state-of-the-art approaches

eScholarship - University of California

Evaluating Interaction Techniques in an Interactive Workspace: Comparing the Effectiveness of a Textual Interface, Virtual Paths Interface, and ARIS

Author: Bailey Brian P.
Biehl Jacob T.
Publication venue
Publication date: 01/09/2004
Field of study

ARIS is an interface that enables users to visually relocate applications and redirect input among myriad devices in an interactive workspace. While we previously claimed that ARIS is more effective than other interfaces for performing these tasks, this work seeks to empirically validate our claim. We compared the use of ARIS to an interaction design of a text-based and virtual paths interface for relocating applications and redirecting input in an interactive workspace. Results show that (i) users can relocate applications and redirect input faster with ARIS than a text-based interface, (ii) users commit fewer errors with ARIS than a text-based interface, (iii) users experience less workload and are more satisfied with ARIS than a text-based interface, and (iv) ARIS was comparable to the use of a virtual paths interface. ARIS is more effective than an interaction design that requires a user to mentally map and select textual identifiers, while supporting functionality beyond that of a virtual paths interface

Illinois Digital Environment for Access to Learning and Scholarship Repository