In this paper we describe an approach that merges Semantic Web technologies (RDF, SKOS) and content based multimedia information annotation and retrieval techniques (MPEG-7). We demonstrate our approach with a system that allows a user to discover and select pieces of relevant information from video clips. The system uses both textual metadata such as duration of video, producer's name, main title and subtitle, copyright owner, keywords, production and broadcast dates, etc., and MPEG-7 low-level visual color and texture features that are extracted from key frames of each shot. The visual features are clustered using self-organizing maps and the centroid key frames are selected as visual "words". The relationships among visual words are represented using the RDF and SKOS standards and are used, together with textual metadata, for efficient faceted navigation. 1
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.