Search CORE

14,764 research outputs found

Visual Features with Semantic Combination Using Bayesian Network for a More Effective Image Retrieval

Author: Barrat Sabine
Tabbone Salvatore
Publication venue: HAL CCSD
Publication date: 08/12/2008
Field of study

International audienceIn many vision problems, instead of having fully annotated training data, it is easier to obtain just a subset of data with annotations, because it is less restrictive for the user. For this reason, in this paper, we consider especially the problem of weakly-annotated image retrieval, where just a small subset of the database is annotated with keywords. We present and evaluate a new method which improves the effectiveness of content-based image retrieval, by integrating semantic concepts extracted from text. Our model is inspired from the probabilistic graphical model theory: we propose a hierarchical mixture model which enables to handle missing values and to capture the user's preference by also considering a relevance feedback process. Results of visual-textual retrieval associated to a relevance feedback process, reported on a database of images collected from the Web, partially and manually annotated, show an improvement of about 44.5% in terms of recognition rate against content-based retrieval

INRIA a CCSD electronic archive server

HAL Descartes

Hal-Diderot

Interactive retrieval of video using pre-computed shot-shot similarities

Author: Boldareva L.
Hiemstra D.
Publication venue: Institution of Engineering and Technology
Publication date: 01/01/2005
Field of study

A probabilistic framework for content-based interactive video retrieval is described. The developed indexing of video fragments originates from the probability of the user's positive judgment about key-frames of video shots. Initial estimates of the probabilities are obtained from low-level feature representation. Only statistically significant estimates are picked out, the rest are replaced by an appropriate constant allowing efficient access at search time without loss of search quality and leading to improvement in most experiments. With time, these probability estimates are updated from the relevance judgment of users performing searches, resulting in further substantial increases in mean average precision

Radboud Repository

University of Twente Research Information

The relationship between IR and multimedia databases

Author: Blanken H.M.
Vries A.P. de
Publication venue: British Computer Society (BCS)
Publication date: 01/01/1998
Field of study

Modern extensible database systems support multimedia data through ADTs. However, because of the problems with multimedia query formulation, this support is not sufficient.\ud \ud Multimedia querying requires an iterative search process involving many different representations of the objects in the database. The support that is needed is very similar to the processes in information retrieval.\ud \ud Based on this observation, we develop the miRRor architecture for multimedia query processing. We design a layered framework based on information retrieval techniques, to provide a usable query interface to the multimedia database.\ud \ud First, we introduce a concept layer to enable reasoning over low-level concepts in the database.\ud \ud Second, we add an evidential reasoning layer as an intermediate between the user and the concept layer.\ud \ud Third, we add the functionality to process the users' relevance feedback.\ud \ud We then adapt the inference network model from text retrieval to an evidential reasoning model for multimedia query processing.\ud \ud We conclude with an outline for implementation of miRRor on top of the Monet extensible database system

CiteSeerX

Crossref

CWI's Institutional Repository

University of Twente Research Information

Beyond English text: Multilingual and multimedia information retrieval.

Author: Jones Gareth J.F.
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2005
Field of study

Non

CiteSeerX

DCU Online Research Access Service

Formal models, usability and related work in IR (editorial for special edition)

Author: Dunlop M.D.
Lalmas M.
Publication venue
Publication date: 01/01/2000
Field of study

The Glasgow IR group has carried out both theoretical and empirical work, aimed at giving end users efficient and effective access to large collections of multimedia data

University of Strathclyde Institutional Repository

The Mirror MMDBMS architecture

Author: Apers P.M.G.
Blanken H.M.
Doorn M.G.L.M. van
Vries A.P. de
Publication venue: Morgan Kaufmann Publishers Inc.
Publication date: 01/01/1999
Field of study

Handling large collections of digitized multimedia data, usually referred to as multimedia digital libraries, is a major challenge for information technology. The Mirror DBMS is a research database system that is developed to better understand the kind of data management that is required in the context of multimedia digital libraries (see also URL http://www.cs.utwente.nl/~arjen/mmdb.html). Its main features are an integrated approach to both content management and (traditional) structured data management, and the implementation of an extensible object-oriented logical data model on a binary relational physical data model. The focus of this work is aimed at design for scalability

CWI's Institutional Repository

University of Twente Research Information

An adaptive technique for content-based image retrieval

Author: AHM Hofstede ter
Cornelis J. van Rijsbergen
DM Squire
G Salton
G Salton
I Campbell
IJ Cox
J Peng
Jana Urban
JJ Rocchio
Joemon M. Jose
M Chalmers
M Dunlop
M Markkula
MJ Swain
MK Hu
P Ingwersen
S Santini
XS Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 27/07/2006
Field of study

We discuss an adaptive approach towards Content-Based Image Retrieval. It is based on the Ostensive Model of developing information needs—a special kind of relevance feedback model that learns from implicit user feedback and adds a temporal notion to relevance. The ostensive approach supports content-assisted browsing through visualising the interaction by adding user-selected images to a browsing path, which ends with a set of system recommendations. The suggestions are based on an adaptive query learning scheme, in which the query is learnt from previously selected images. Our approach is an adaptation of the original Ostensive Model based on textual features only, to include content-based features to characterise images. In the proposed scheme textual and colour features are combined using the Dempster-Shafer theory of evidence combination. Results from a user-centred, work-task oriented evaluation show that the ostensive interface is preferred over a traditional interface with manual query facilities. This is due to its ability to adapt to the user's need, its intuitiveness and the fluid way in which it operates. Studying and comparing the nature of the underlying information need, it emerges that our approach elicits changes in the user's need based on the interaction, and is successful in adapting the retrieval to match the changes. In addition, a preliminary study of the retrieval performance of the ostensive relevance feedback scheme shows that it can outperform a standard relevance feedback strategy in terms of image recall in category search

Crossref

Enlighten

Information-Theoretic Active Learning for Content-Based Image Retrieval

Author: A Freytag
A Freytag
A Genz
A Lütz
AW Smeulders
B Demir
E Rodner
IJ Cox
O Russakovsky
S Ayache
TN Cardoso
Y Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 19/03/2019
Field of study

We propose Information-Theoretic Active Learning (ITAL), a novel batch-mode active learning method for binary classification, and apply it for acquiring meaningful user feedback in the context of content-based image retrieval. Instead of combining different heuristics such as uncertainty, diversity, or density, our method is based on maximizing the mutual information between the predicted relevance of the images and the expected user feedback regarding the selected batch. We propose suitable approximations to this computationally demanding problem and also integrate an explicit model of user behavior that accounts for possible incorrect labels and unnameable instances. Furthermore, our approach does not only take the structure of the data but also the expected model output change caused by the user feedback into account. In contrast to other methods, ITAL turns out to be highly flexible and provides state-of-the-art performance across various datasets, such as MIRFLICKR and ImageNet.Comment: GCPR 2018 paper (14 pages text + 2 pages references + 6 pages appendix

arXiv.org e-Print Archive

Crossref