Search CORE

14,970 research outputs found

Towards an All-Purpose Content-Based Multimedia Information Retrieval System

Author: Gasser Ralph
Rossetto Luca
Schuldt Heiko
Publication venue
Publication date: 01/01/2019
Field of study

The growth of multimedia collections - in terms of size, heterogeneity, and variety of media types - necessitates systems that are able to conjointly deal with several forms of media, especially when it comes to searching for particular objects. However, existing retrieval systems are organized in silos and treat different media types separately. As a consequence, retrieval across media types is either not supported at all or subject to major limitations. In this paper, we present vitrivr, a content-based multimedia information retrieval stack. As opposed to the keyword search approach implemented by most media management systems, vitrivr makes direct use of the object's content to facilitate different types of similarity search, such as Query-by-Example or Query-by-Sketch, for and, most importantly, across different media types - namely, images, audio, videos, and 3D models. Furthermore, we introduce a new web-based user interface that enables easy-to-use, multimodal retrieval from and browsing in mixed media collections. The effectiveness of vitrivr is shown on the basis of a user study that involves different query and media types. To the best of our knowledge, the full vitrivr stack is unique in that it is the first multimedia retrieval system that seamlessly integrates support for four different types of media. As such, it paves the way towards an all-purpose, content-based multimedia information retrieval system

arXiv.org e-Print Archive

edoc

Visual Information Retrieval in Endoscopic Video Archives

Author: Anagnostopoulos Nektarios
Giró-i-Nieto Xavier
Lux Mathias
Muñoz Pia
Roldan-Carlos Jennifer
Publication venue
Publication date: 01/01/2015
Field of study

In endoscopic procedures, surgeons work with live video streams from the inside of their subjects. A main source for documentation of procedures are still frames from the video, identified and taken during the surgery. However, with growing demands and technical means, the streams are saved to storage servers and the surgeons need to retrieve parts of the videos on demand. In this submission we present a demo application allowing for video retrieval based on visual features and late fusion, which allows surgeons to re-find shots taken during the procedure.Comment: Paper accepted at the IEEE/ACM 13th International Workshop on Content-Based Multimedia Indexing (CBMI) in Prague (Czech Republic) between 10 and 12 June 201

arXiv.org e-Print Archive

Crossref

UPCommons. Portal del coneixement obert de la UPC

Extending, trimming and fusing WordNet for technical documents

Author: Vossen P.
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2001
Field of study

This paper describes a tool for the automatic extension and trimming of a multilingual WordNet database for cross-lingual retrieval and multilingual ontology building in intranets and domain-specific document collections. Hierarchies, built from automatically extracted terms and combined with the WordNet relations, are trimmed with a disambiguation method based on the document salience of the words in the glosses. The disambiguation is tested in a cross-lingual retrieval task, showing considerable improvement (7%-11%). The condensed hierarchies can be used as browse-interfaces to the documents complementary to retrieval

CiteSeerX

VU Research Portal

The relationship between IR and multimedia databases

Author: Blanken H.M.
Vries A.P. de
Publication venue: British Computer Society (BCS)
Publication date: 01/01/1998
Field of study

Modern extensible database systems support multimedia data through ADTs. However, because of the problems with multimedia query formulation, this support is not sufficient.\ud \ud Multimedia querying requires an iterative search process involving many different representations of the objects in the database. The support that is needed is very similar to the processes in information retrieval.\ud \ud Based on this observation, we develop the miRRor architecture for multimedia query processing. We design a layered framework based on information retrieval techniques, to provide a usable query interface to the multimedia database.\ud \ud First, we introduce a concept layer to enable reasoning over low-level concepts in the database.\ud \ud Second, we add an evidential reasoning layer as an intermediate between the user and the concept layer.\ud \ud Third, we add the functionality to process the users' relevance feedback.\ud \ud We then adapt the inference network model from text retrieval to an evidential reasoning model for multimedia query processing.\ud \ud We conclude with an outline for implementation of miRRor on top of the Monet extensible database system

CiteSeerX

Crossref

CWI's Institutional Repository

University of Twente Research Information

VITALAS at TRECVID-2008

Author: Aly Robin
Delopoulos Anastasios
Dimitriou Nikos
Diou Christos
Panagiotopoulos Panagiotis
Papachristou Christos
Rode Henning
Stephanopoulos George
Tsikrika Theodora
Vries Arjen P. de
Publication venue: National Institute of Standards and Technology, NIST
Publication date: 01/11/2008
Field of study

In this paper, we present our experiments in TRECVID 2008 about High-Level feature extraction task. This is the first year for our participation in TRECVID, our system adopts some popular approaches that other workgroups proposed before. We proposed 2 advanced low-level features NEW Gabor texture descriptor and the Compact-SIFT Codeword histogram. Our system applied well-known LIBSVM to train the SVM classifier for the basic classifier. In fusion step, some methods were employed such as the Voting, SVM-base, HCRF and Bootstrap Average AdaBoost(BAAB)

CWI's Institutional Repository

University of Twente Research Information