Search CORE

528 research outputs found

The TREC2001 video track: information retrieval on digital video information

Author: Alan F. Smeaton
Alexander Hauptmann
Arjen P. De Vries
Cash J. Costello
David Doermann
Er Hauptmann
John R. Smith
Lide Wu
Mark E. Rorvig
Paul Over
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2002
Field of study

The development of techniques to support content-based access to archives of digital video information has recently started to receive much attention from the research community. During 2001, the annual TREC activity, which has been benchmarking the performance of information retrieval techniques on a range of media for 10 years, included a ”track“ or activity which allowed investigation into approaches to support searching through a video library. This paper is not intended to provide a comprehensive picture of the different approaches taken by the TREC2001 video track participants but instead we give an overview of the TREC video search task and a thumbnail sketch of the approaches taken by different groups. The reason for writing this paper is to highlight the message from the TREC video track that there are now a variety of approaches available for searching and browsing through digital video archives, that these approaches do work, are scalable to larger archives and can yield useful retrieval performance for users. This has important implications in making digital libraries of video information attainable

CiteSeerX

Crossref

CWI's Institutional Repository

Irish Universities

DCU Online Research Access Service

Físchlár-DiamondTouch: collaborative video searching on a table

Author: Foley Colum
Gurrin Cathal
Lee Hyowon
McGivney Sinéad
Smeaton Alan F.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 01/01/2006
Field of study

In this paper we present the system we have developed for our participation in the annual TRECVid benchmarking activity, specically the system we have developed, Físchlár-DT, for participation in the interactive search task of TRECVid 2005. Our back-end search engine uses a combination of a text search which operates over the automatic speech recognised text, and an image search which uses low-level image features matched against video keyframes. The two novel aspects of our work are the fact that we are evaluating collaborative, team-based search among groups of users working together, and that we are using a novel touch-sensitive tabletop interface and interaction device known as the DiamondTouch to support this collaborative search. The paper summarises the backend search systems as well as presenting the interface we have developed, in detail

DCU Online Research Access Service

Everything You Wanted to Know About MPEG-7: Part 1

Author: Lindsay A.T.
Nack F.-M. (Frank)
Publication venue: I.E.E.E. Computer Society Press
Publication date: 01/01/1999
Field of study

Part I of this article provides an overview of the development, functionality, and applicability of MPEG-7. We ll first present the role of MPEG-7 within the context of past MPEG standards. We then outline ideas of what should be possible using MPEG-7 technology. In Part II, we ll discuss the description of MPEG-7 s concepts, terminology, and requirements. We ll then compare MPEG-7 to other approaches on multimedia content description

CWI's Institutional Repository

Multimedia Vocabularies on the Semantic Web

Author: Boll S.C.J. (Susanne)
Bürger T.
Celma O.
Halaschek-Wiener C.
Hausenblas M.
Mannens E.
Troncy R. (Raphael)
Publication venue: W3C
Publication date: 01/01/2007
Field of study

This document gives an overview on the state-of-the-art of multimedia metadata formats. Initially, practical relevant vocabularies for developers of Semantic Web applications are listed according to their modality scope. In the second part of this document, the focus is set on the integration of the multimedia vocabularies into the Semantic Web, that is to say, formal representations of the vocabularies are discussed

CWI's Institutional Repository

Digital Image Access & Retrieval

Author: Heidorn P. Bryan
Sandore Beth
Publication venue: Graduate School of Library and Information Science, University of Illinois at Urbana-Champaign
Publication date: 01/01/1997
Field of study

The 33th Annual Clinic on Library Applications of Data Processing, held at the University of Illinois at Urbana-Champaign in March of 1996, addressed the theme of "Digital Image Access & Retrieval." The papers from this conference cover a wide range of topics concerning digital imaging technology for visual resource collections. Papers covered three general areas: (1) systems, planning, and implementation; (2) automatic and semi-automatic indexing; and (3) preservation with the bulk of the conference focusing on indexing and retrieval.published or submitted for publicatio

Illinois Digital Environment for Access to Learning and Scholarship Repository

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

Content Based Image Retrieval (CBIR) in Remote Clinical Diagnosis and Healthcare

Author: A.Beach
A.Khokher
C. J. C.Burges
C.Guo
F.Liu
H.Frigui
J. L.Ozer
J.Holler
J.Wang
M. S.Venkatarajan
N.Goel
R.Veltkamp
S. D.Jain
S. K.Saha
S.Akramullah
S.Pattanaik
T.Deselaers
V. S.Murthy
Y. N.Mamatha
Y.Li
Publication venue: 'IGI Global'
Publication date: 01/01/2016
Field of study

Content-Based Image Retrieval (CBIR) locates, retrieves and displays images alike to one given as a query, using a set of features. It demands accessible data in medical archives and from medical equipment, to infer meaning after some processing. A problem similar in some sense to the target image can aid clinicians. CBIR complements text-based retrieval and improves evidence-based diagnosis, administration, teaching, and research in healthcare. It facilitates visual/automatic diagnosis and decision-making in real-time remote consultation/screening, store-and-forward tests, home care assistance and overall patient surveillance. Metrics help comparing visual data and improve diagnostic. Specially designed architectures can benefit from the application scenario. CBIR use calls for file storage standardization, querying procedures, efficient image transmission, realistic databases, global availability, access simplicity, and Internet-based structures. This chapter recommends important and complex aspects required to handle visual content in healthcare.Comment: 28 pages, 6 figures, Book Chapter from "Encyclopedia of E-Health and Telemedicine

arXiv.org e-Print Archive

Crossref

Who is the director of this movie? Automatic style recognition based on shot features

Author: Benini Sergio
Kovács András Bálint
Savardi Mattia
Signoroni Alberto
Svanera Michele
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 25/07/2018
Field of study

We show how low-level formal features, such as shot duration, meant as length of camera takes, and shot scale, i.e. the distance between the camera and the subject, are distinctive of a director's style in art movies. So far such features were thought of not having enough varieties to become distinctive of an author. However our investigation on the full filmographies of six different authors (Scorsese, Godard, Tarr, Fellini, Antonioni, and Bergman) for a total number of 120 movies analysed second by second, confirms that these shot-related features do not appear as random patterns in movies from the same director. For feature extraction we adopt methods based on both conventional and deep learning techniques. Our findings suggest that feature sequential patterns, i.e. how features evolve in time, are at least as important as the related feature distributions. To the best of our knowledge this is the first study dealing with automatic attribution of movie authorship, which opens up interesting lines of cross-disciplinary research on the impact of style on the aesthetic and emotional effects on the viewers

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università di Brescia

Enlighten

Interactive Image Data Labeling Using Self-Organizing Maps in an Augmented Reality Scenario

Author: Bekel Holger
Heidemann Gunther
Ritter Helge
Publication venue: 'Elsevier BV'
Publication date: 01/01/2005
Field of study

Bekel H, Heidemann G, Ritter H. Interactive Image Data Labeling Using Self-Organizing Maps in an Augmented Reality Scenario. Neural Networks. 2005;18(2005 Special Iss.):566-574.We present an approach for the convenient labeling of image patches gathered from an unrestricted environment. The system is employed for a mobile Augmented Reality (AR) gear: While the user walks around with the head-mounted AR-gear, context-free modules for focus-of-attention permanently sample the most “interesting” image patches. After this acquisition phase, a Self-Organizing Map (SOM) is trained on the complete set of patches, using combinations of MPEG-7 features as a data representation. The SOM allows visualization of the sampled patches and an easy manual sorting into categories. With very little effort, the user can compose a training set for a classifier, thus, unknown objects can be made known to the system. We evaluate the system for COIL-imagery and demonstrate that a user can reach satisfying categorization within few steps, even for image data sampled from walking in an office environment

Publications at Bielefeld University