Search CORE

12 research outputs found

Is the Reign of Interactive Search Eternal? Findings from the Video Browser Showdown 2020

Author: Bailer Werner
Gurrin Cathal
Jónsson Björn Thór
Kovalčík Gregor
Lokoč Jakub
Mejzlík František
Rossetto Luca
Sauter Loris
Schoeffmann Klaus
Song Jaeyub
Souček Tomáš
Veselý Patrik
Vrochidis Stefanos
Wu Jiaxin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/07/2021
Field of study

The IT University of Copenhagen's Repository

ZORA

Interactive video retrieval evaluation at a distance: comparing sixteen interactive video search systems in a remote setting at the 10th Video Browser Showdown

Author: Bailer Werner
Gsteiger Viktor
Gurrin Cathal
Heiko Schuldt
Heller Silvan
Jónsson Björn Þór
Leibetseder Andreas
Lokoč Jakub
Mejzlík František
Peska Ladislav
Rossetto Luca
Schall Konstantin
Schoeffmann Klaus
Schuldt Spiess
Tran Ly-Duyen
Vadicamo Lucia
Veselý Patrik
Vrochidis Stefanos
Wu Jiaxin
Publication venue: Springer
Publication date: 01/01/2022
Field of study

The Video Browser Showdown addresses difficult video search challenges through an annual interactive evaluation campaign attracting research teams focusing on interactive video retrieval. The campaign aims to provide insights into the performance of participating interactive video retrieval systems, tested by selected search tasks on large video collections. For the first time in its ten year history, the Video Browser Showdown 2021 was organized in a fully remote setting and hosted a record number of sixteen scoring systems. In this paper, we describe the competition setting, tasks and results and give an overview of state-of-the-art methods used by the competing systems. By looking at query result logs provided by ten systems, we analyze differences in retrieval model performances and browsing times before a correct submission. Through advances in data gathering methodology and tools, we provide a comprehensive analysis of ad-hoc video search tasks, discuss results, task design and methodological challenges. We highlight that almost all top performing systems utilize some sort of joint embedding for text-image retrieval and enable specification of temporal context in queries for known-item search. Whereas a combination of these techniques drive the currently top performing systems, we identify several future challenges for interactive video search engines and the Video Browser Showdown competition itself

NEUROSURGERY ENTHUSIASTIC WOMEN SOCIETY

Exquisitor at the Video Browser Showdown 2021: Relationships Between Semantic Classifiers

Author: Jónsson Björn Thór
Khan Omar Shahbaz
Koelma Dennis C.
Larsen Mathias Dybkjær
Poulsen Liam Alex Sonto
Rudinac Stevan
Worring Marcel
Zahálka Jan
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/08/2021
Field of study

The IT University of Copenhagen's Repository

Exquisitor at the Lifelog Search Challenge 2020

Author: Jónsson Björn Thór
Khan Omar Shahbaz
Koelma Dennis C.
Larsen Mathias Dybkjær
Poulsen Liam Alex Sonto
Rudinac Stevan
Worring Marcel
Zahálka Jan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2020
Field of study

We present an enhanced version of Exquisitor, our interactive and scalable media exploration system. At its core, Exquisitor is an interactive learning system using relevance feedback on media items to build a model of the users' information need. Relying on efficient media representation and indexing, it facilitates real-time user interaction. The new features for the Lifelog Search Challenge 2020 include support for timeline browsing, search functionality for finding positive examples, and significant interface improvements. Participation in the Lifelog Search Challenge allows us to compare our paradigm, relying predominantly on interactive learning, with more traditional search-based multimedia retrieval systems

Crossref

The IT University of Copenhagen's Repository

International Migration, Integration and Social Cohesion online publications

UvA-DARE

A VR interface for browsing visual spaces at VBS2021

Author: Caputo Annalina
Gurrin Cathal
Healy Graham
Nguyen Binh T.
Nguyen Manh-Duy
Nguyen Thao-Nhu
Tran Ly-Duyen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/01/2021
Field of study

The Video Browser Showdown (VBS) is an annual competition in which each participant prepares an interactive video retrieval system and partakes in a live comparative evaluation at the annual MMMConference. In this paper, we introduce Eolas, which is a prototype video/image retrieval system incorporating a novel virtual reality (VR)interface. For VBS’21, Eolas represented each keyframe of the collection by an embedded feature in a latent vector space, into which a query would also be projected to facilitate retrieval within a VR environment. A user could then explore the space and perform one of a number of filter operations to traverse the space and locate the correct result

Irish Universities

DCU Online Research Access Service

Temporal multimodal video and lifelog retrieval

Author: Heller Silvan
Publication venue
Publication date: 01/01/2023
Field of study

The past decades have seen exponential growth of both consumption and production of data, with multimedia such as images and videos contributing significantly to said growth. The widespread proliferation of smartphones has provided everyday users with the ability to consume and produce such content easily. As the complexity and diversity of multimedia data has grown, so has the need for more complex retrieval models which address the information needs of users. Finding relevant multimedia content is central in many scenarios, from internet search engines and medical retrieval to querying one's personal multimedia archive, also called lifelog. Traditional retrieval models have often focused on queries targeting small units of retrieval, yet users usually remember temporal context and expect results to include this. However, there is little research into enabling these information needs in interactive multimedia retrieval. In this thesis, we aim to close this research gap by making several contributions to multimedia retrieval with a focus on two scenarios, namely video and lifelog retrieval. We provide a retrieval model for complex information needs with temporal components, including a data model for multimedia retrieval, a query model for complex information needs, and a modular and adaptable query execution model which includes novel algorithms for result fusion. The concepts and models are implemented in vitrivr, an open-source multimodal multimedia retrieval system, which covers all aspects from extraction to query formulation and browsing. vitrivr has proven its usefulness in evaluation campaigns and is now used in two large-scale interdisciplinary research projects. We show the feasibility and effectiveness of our contributions in two ways: firstly, through results from user-centric evaluations which pit different user-system combinations against one another. Secondly, we perform a system-centric evaluation by creating a new dataset for temporal information needs in video and lifelog retrieval with which we quantitatively evaluate our models. The results show significant benefits for systems that enable users to specify more complex information needs with temporal components. Participation in interactive retrieval evaluation campaigns over multiple years provides insight into possible future developments and challenges of such campaigns

edoc

LifeSeeker 2.0: interactive lifelog search engine at LSC 2020

Author: Gurrin Cathal
Healy Graham
Le Tu-Khiem
Nguyen Hai-Dang
Nguyen Thanh-An
Ninh Van-Tu
Tran Minh-Triet
Zhou Liting
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/06/2020
Field of study

In this paper we present our interactive lifelog retrieval engine in the LSC’20 comparative benchmarking challenge. The LifeSeeker 2.0 interactive lifelog retrieval engine is developed by both Dublin City University and Ho Chi Minh University of Science, which represents an enhanced version of the two corresponding interactive lifelog retrieval engines in LSC’19. The implementation of LifeSeeker 2.0 has been designed to focus on the searching by text query using a Bag-of-Words model with visual concept augmentation and additional improvements in query processing time, enhanced result display and browsing support, and interacting with visual graphs for both query and filter purposes

DCU Online Research Access Service

Exquisitor:Interactive Learning for Multimedia

Author: Khan Omar Shahbaz
Publication venue: IT-Universitetet i København
Publication date: 01/01/2022
Field of study

The IT University of Copenhagen's Repository

FIRST - Flexible interactive retrieval SysTem for visual lifelog exploration at LSC 2020

Author: Do Trong-Le
Gurrin Cathal
Le Hoang-Anh
Le Tu-Khiem
Nguyen Hai-Dang
Nguyen Khanh
Nguyen Thanh-An
Ninh Van-Tu
Tran Mai-Khiem
Tran Minh-Triet
Tran Quoc-Cuong
Trang-Trung Hoang-Phuc
Vo-Ho Viet-Khoa
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/06/2020
Field of study

Lifelog can provide useful insights of our daily activities. It is essential to provide a flexible way for users to retrieve certain events or moments of interest, corresponding to a wide variation of query types. This motivates us to develop FIRST, a Flexible Interactive Retrieval SysTem, to help users to combine or integrate various query components in a flexible manner to handle different query scenarios, such as visual clustering data based on color histogram, visual similarity, GPS location, or scene attributes. We also employ personalized concept detection and image captioning to enhance image understanding from visual lifelog data, and develop an autoencoderlike approach for query text and image feature mapping. Furthermore, we refine the user interface of the retrieval system to better assist users in query expansion and verifying sequential events in a flexible temporal resolution to control the navigation speed through sequences of images

Crossref

DCU Online Research Access Service