Search CORE

5,293 research outputs found

Region-Based Image Retrieval Revisited

Author: Hinami Ryota
Matsui Yusuke
Satoh Shin'ichi
Publication venue
Publication date: 26/09/2017
Field of study

Region-based image retrieval (RBIR) technique is revisited. In early attempts at RBIR in the late 90s, researchers found many ways to specify region-based queries and spatial relationships; however, the way to characterize the regions, such as by using color histograms, were very poor at that time. Here, we revisit RBIR by incorporating semantic specification of objects and intuitive specification of spatial relationships. Our contributions are the following. First, to support multiple aspects of semantic object specification (category, instance, and attribute), we propose a multitask CNN feature that allows us to use deep learning technique and to jointly handle multi-aspect object specification. Second, to help users specify spatial relationships among objects in an intuitive way, we propose recommendation techniques of spatial relationships. In particular, by mining the search results, a system can recommend feasible spatial relationships among the objects. The system also can recommend likely spatial relationships by assigned object category names based on language prior. Moreover, object-level inverted indexing supports very fast shortlist generation, and re-ranking based on spatial constraints provides users with instant RBIR experiences.Comment: To appear in ACM Multimedia 2017 (Oral

arXiv.org e-Print Archive

Crossref

Prototypes for Content-Based Image Retrieval in Clinical Practice

Author: Abate AF
Adrien Depeursinge
Akgul CB
Antani S
Benedikt Fischer
Chu WW
Costa Oliveira M
Datta R
Depeursinge A
Deserno TM
Deserno TM
Doi K
Eakins J
El-Kwae YEA
Engle RL
Fischer B
Geradts Z
Greulich WW
Güld MO
Henning Müller
Horsch A
Korn P
Kruger RP
Lehmann TM
Leisch E
Liu Y
Long LR
Long LR
Meyers PH
Müller H
Müller H
Niblack W
Orel SG
Qi H
Roweis ST
Rui Y
Shyu CR
Smeulders AWM
Spreckelsen C
Spreckelsen C
Tagare HD
Tang LHY
Tao Y
Tao Y
Thodberg HH
Thomas M Deserno
Welter P
Yankaskas BC
Zhang W
Zhou X
Zhou XS
Publication venue: Bentham Open
Publication date: 01/01/2011
Field of study

Content-based image retrieval (CBIR) has been proposed as key technology for computer-aided diagnostics (CAD). This paper reviews the state of the art and future challenges in CBIR for CAD applied to clinical practice

CiteSeerX

Crossref

PubMed Central

Publikationsserver der RWTH Aachen University

Spatio-temporal Video Re-localization by Warp LSTM

Author: Feng Yang
Liu Wei
Luo Jiebo
Ma Lin
Publication venue
Publication date: 09/05/2019
Field of study

The need for efficiently finding the video content a user wants is increasing because of the erupting of user-generated videos on the Web. Existing keyword-based or content-based video retrieval methods usually determine what occurs in a video but not when and where. In this paper, we make an answer to the question of when and where by formulating a new task, namely spatio-temporal video re-localization. Specifically, given a query video and a reference video, spatio-temporal video re-localization aims to localize tubelets in the reference video such that the tubelets semantically correspond to the query. To accurately localize the desired tubelets in the reference video, we propose a novel warp LSTM network, which propagates the spatio-temporal information for a long period and thereby captures the corresponding long-term dependencies. Another issue for spatio-temporal video re-localization is the lack of properly labeled video datasets. Therefore, we reorganize the videos in the AVA dataset to form a new dataset for spatio-temporal video re-localization research. Extensive experimental results show that the proposed model achieves superior performances over the designed baselines on the spatio-temporal video re-localization task

arXiv.org e-Print Archive

Crossref

Spott : on-the-spot e-commerce for television using deep learning-based video analysis techniques

Author: Schuurman Dimitri
Vandecasteele Florian
Vandenbroucke Karel
Verstockt Steven
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Spott is an innovative second screen mobile multimedia application which offers viewers relevant information on objects (e.g., clothing, furniture, food) they see and like on their television screens. The application enables interaction between TV audiences and brands, so producers and advertisers can offer potential consumers tailored promotions, e-shop items, and/or free samples. In line with the current views on innovation management, the technological excellence of the Spott application is coupled with iterative user involvement throughout the entire development process. This article discusses both of these aspects and how they impact each other. First, we focus on the technological building blocks that facilitate the (semi-) automatic interactive tagging process of objects in the video streams. The majority of these building blocks extensively make use of novel and state-of-the-art deep learning concepts and methodologies. We show how these deep learning based video analysis techniques facilitate video summarization, semantic keyframe clustering, and (similar) object retrieval. Secondly, we provide insights in user tests that have been performed to evaluate and optimize the application's user experience. The lessons learned from these open field tests have already been an essential input in the technology development and will further shape the future modifications to the Spott application

Ghent University Academic Bibliography

Content-Based Image Feature Description and Retrieving

Author: Chang Wei-Han
Kuo Chung-Ming
Yang Nai-Chung
Publication venue: 'IntechOpen'
Publication date: 13/02/2013
Field of study

IntechOpen

Crossref

Detecting regions of interest using eye tracking for CBIR

Author: Chen Yongqiang
Hou Yuexian
Ren Qingtao
Song Dawei
Zhang Peng
Publication venue
Publication date: 13/08/2015
Field of study

Identifying Regions of Interest (ROIs) in images has been shown an effective way to enhance the performance of Content Based Image Retrieval (CBIR). Most existing ROI identification methods are based on salience detection, and the identified ROIs may not be the regions that users are really interested in. While manual selection of ROIs can directly reflect users’ interests, it puts extra cognitive overhead to users. To alleviate these limitations, in this paper, we propose a novel eye-tracking based method to detect ROIs for CBIR, in an unobtrusive way. Experimental results have demonstrated that our model performed effectively compared with various state of the art methods

CiteSeerX

Open Research Online (The Open University)

Information extraction from multimedia web documents: an open-source platform and testbed

Author: Blanco Roi
Boato Giulia
Costanzo Andrea
Demidova Elena
Dupplaw David
Fontani Marco
Griffiths Thomas
Hare Jonathon
Johansson Richard
Lewis Paul H.
Matthews Michael
Minack Enrico
Moschitti Alessandro
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/06/2014
Field of study

The LivingKnowledge project aimed to enhance the current state of the art in search, retrieval and knowledge management on the web by advancing the use of sentiment and opinion analysis within multimedia applications. To achieve this aim, a diverse set of novel and complementary analysis techniques have been integrated into a single, but extensible software platform on which such applications can be built. The platform combines state-of-the-art techniques for extracting facts, opinions and sentiment from multimedia documents, and unlike earlier platforms, it exploits both visual and textual techniques to support multimedia information retrieval. Foreseeing the usefulness of this software in the wider community, the platform has been made generally available as an open-source project. This paper describes the platform design, gives an overview of the analysis algorithms integrated into the system and describes two applications that utilise the system for multimedia information retrieval

Southampton (e-Prints Soton)