Thematico-Visual Image Retrieval: How to Deal With Partially Indexed Corpora

Abstract

Colloque avec actes et comité de lecture. internationale.International audienceIt becomes very easy to access large amounts of images when surfing the Internet. All images, however, are not always thematically indexed. We think that partially thematically indexed corpora can be organised in a way that facilitates retrieval. We assume that, concerning visual properties, the corpus is totally indexed by means of ``generic'' features. Based on these indexes, a hierarchical clustering technique is used to bring together images that share some similarities: two distinct structures are built (``dendrograms''). We propose a new retrieval strategy based on a virtual image that captures the user's need along the retrieval session, taking into account both thematic and visual aspects. Clusters are successively selected in each dendrogram. A combined method, called tunnels, allows dendrograms cooperation. Images are then ranked according to the virtual image. After each retrieval step, the virtual image is enriched within a relevance feedback process. Theme, colour and general layout of each images can be rated and the query is updated accordingly. In our experiments, we used two different corpora (2470 and 1100 images) to assess the performance of our thematico-visual approach within different indexing conditions. Experimentation results confirm the relevance of our approach and suggests improvement possibilities

    Similar works