Search CORE

521 research outputs found

iCLEF 2006 Overview: Searching the Flickr WWW photo-sharing repository

Author: Clough Paul
Gonzalo Julio
Karlgren Jussi
Publication venue
Publication date: 01/01/2006
Field of study

This paper summarizes the task design for iCLEF 2006 (the CLEF interactive track). Compared to previous years, we have proposed a radically new task: searching images in a naturally multilingual database, Flickr, which has millions of photographs shared by people all over the planet, tagged and described in a wide variety of languages. Participants are expected to build a multilingual search front-end to Flickr (using Flickr’s search API) and study the behaviour of the users for a given set of searching tasks. The emphasis is put on studying the process, rather than evaluating its outcome

Crossref

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

White Rose Research Online

Living Knowledge

Author: Baldry Anthony
Dutta Biswanath
Giunchiglia Fausto
Maltese Vincenzo
Publication venue: Università degli Studi di Trento
Publication date: 01/03/2012
Field of study

Diversity, especially manifested in language and knowledge, is a function of local goals, needs, competences, beliefs, culture, opinions and personal experience. The Living Knowledge project considers diversity as an asset rather than a problem. With the project, foundational ideas emerged from the synergic contribution of different disciplines, methodologies (with which many partners were previously unfamiliar) and technologies flowed in concrete diversity-aware applications such as the Future Predictor and the Media Content Analyser providing users with better structured information while coping with Web scale complexities. The key notions of diversity, fact, opinion and bias have been defined in relation to three methodologies: Media Content Analysis (MCA) which operates from a social sciences perspective; Multimodal Genre Analysis (MGA) which operates from a semiotic perspective and Facet Analysis (FA) which operates from a knowledge representation and organization perspective. A conceptual architecture that pulls all of them together has become the core of the tools for automatic extraction and the way they interact. In particular, the conceptual architecture has been implemented with the Media Content Analyser application. The scientific and technological results obtained are described in the following

Unitn-eprints Research

Promoting Social Media Dissemination of Digital Images Through CBR-Based Tag Recommendation

Author: Cordero-Gutiérrez Rebeca
De La Iglesia Daniel H.
Martín-Gómez Lucía
Pérez-Marcos Javier
Publication venue: 'Universidad Internacional de La Rioja'
Publication date: 13/12/2022
Field of study

Multimedia content has become an essential tool to share knowledge, sell products or disseminate messages. Some social networks use multimedia content to promote information and create social communities. In order to increase the impact of the digital content, those images or videos are labeled with different words, denominated tags. In this paper, we propose a recommender system which analyzes multimedia content and suggests tags to maximize its influence in the social community. It implements a Case-Based Reasoning architecture (CBR), which allows to learn from previous tagged content. The system has been evaluated through cross fold validation with a training and validation sets carefully constructed and extracted from Instagram. The results demonstrate that the system can suggest good options to label our image and maximize the influence of the multimedia content

Re-UNIR

Enrichment and ranking of the YouTube tag space and integration with the Linked Data cloud

Author: A. Hotho
M. Naphade
M.R. Quillian
P. Mika
P. Mika
S. Angeletou
S. Golder
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2009
Field of study

The increase of personal digital cameras with video functionality and video-enabled camera phones has increased the amount of user-generated videos on the Web. People are spending more and more time viewing online videos as a major source of entertainment and “infotainment”. Social websites allow users to assign shared free-form tags to user-generated multimedia resources, thus generating annotations for objects with a minimum amount of effort. Tagging allows communities to organise their multimedia items into browseable sets, but these tags may be poorly chosen and related tags may be omitted. Current techniques to retrieve, integrate and present this media to users are deficient and could do with improvement. In this paper, we describe a framework for semantic enrichment, ranking and integration of web video tags using Semantic Web technologies. Semantic enrichment of folksonomies can bridge the gap between the uncontrolled and flat structures typically found in user-generated content and structures provided by the Semantic Web. The enhancement of tag spaces with semantics has been accomplished through two major tasks: a tag space expansion and ranking step; and through concept matching and integration with the Linked Data cloud. We have explored social, temporal and spatial contexts to enrich and extend the existing tag space. The resulting semantic tag space is modelled via a local graph based on co-occurrence distances for ranking. A ranked tag list is mapped and integrated with the Linked Data cloud through the DBpedia resource repository. Multi-dimensional context filtering for tag expansion means that tag ranking is much easier and it provides less ambiguous tag to concept matching

CiteSeerX

Crossref

Open Research Online (The Open University)

A Data-Driven Approach for Tag Refinement and Localization in Web Videos

Author: Ballan Lamberto
Bertini Marco
Del Bimbo Alberto
Serra Giuseppe
Publication venue: 'Elsevier BV'
Publication date: 01/01/2015
Field of study

Tagging of visual content is becoming more and more widespread as web-based services and social networks have popularized tagging functionalities among their users. These user-generated tags are used to ease browsing and exploration of media collections, e.g. using tag clouds, or to retrieve multimedia content. However, not all media are equally tagged by users. Using the current systems is easy to tag a single photo, and even tagging a part of a photo, like a face, has become common in sites like Flickr and Facebook. On the other hand, tagging a video sequence is more complicated and time consuming, so that users just tag the overall content of a video. In this paper we present a method for automatic video annotation that increases the number of tags originally provided by users, and localizes them temporally, associating tags to keyframes. Our approach exploits collective knowledge embedded in user-generated tags and web sources, and visual similarity of keyframes and images uploaded to social sites like YouTube and Flickr, as well as web sources like Google and Bing. Given a keyframe, our method is able to select on the fly from these visual sources the training exemplars that should be the most relevant for this test sample, and proceeds to transfer labels across similar images. Compared to existing video tagging approaches that require training classifiers for each tag, our system has few parameters, is easy to implement and can deal with an open vocabulary scenario. We demonstrate the approach on tag refinement and localization on DUT-WEBV, a large dataset of web videos, and show state-of-the-art results.Comment: Preprint submitted to Computer Vision and Image Understanding (CVIU

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Università degli Studi di Udine

Florence Research

Archivio istituzionale della ricerca - Università di Modena e Reggio Emilia

Archivio istituzionale della ricerca - Università di Padova

Multimedia Annotation Interoperability Framework

Author: Pan J.Z. (Jeff)
Troncy R. (Raphael)
Tzouvaras V.
Publication venue: W3C
Publication date: 01/01/2007
Field of study

Multimedia systems typically contain digital documents of mixed media types, which are indexed on the basis of strongly divergent metadata standards. This severely hamplers the inter-operation of such systems. Therefore, machine understanding of metadata comming from different applications is a basic requirement for the inter-operation of distributed Multimedia systems. In this document, we present how interoperability among metadata, vocabularies/ontologies and services is enhanced using Semantic Web technologies. In addition, it provides guidelines for semantic interoperability, illustrated by use cases. Finally, it presents an overview of the most commonly used metadata standards and tools, and provides the general research direction for semantic interoperability using Semantic Web technologies

CWI's Institutional Repository

Utilising semantic technologies for intelligent indexing and retrieval of digital images

Author: A Smeulders
C Fellbau
C Lee
Dhavalkumar Thakker
F Gaihua
Gerald Schaefer
J Bhogal
J Vogel
K Wu
L Ying
N Noy
N Shadbolt
R Datta
Taha Osman
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

The proliferation of digital media has led to a huge interest in classifying and indexing media objects for generic search and usage. In particular, we are witnessing colossal growth in digital image repositories that are difficult to navigate using free-text search mechanisms, which often return inaccurate matches as they in principle rely on statistical analysis of query keyword recurrence in the image annotation or surrounding text. In this paper we present a semantically-enabled image annotation and retrieval engine that is designed to satisfy the requirements of the commercial image collections market in terms of both accuracy and efficiency of the retrieval process. Our search engine relies on methodically structured ontologies for image annotation, thus allowing for more intelligent reasoning about the image content and subsequently obtaining a more accurate set of results and a richer set of alternatives matchmaking the original query. We also show how our well-analysed and designed domain ontology contributes to the implicit expansion of user queries as well as the exploitation of lexical databases for explicit semantic-based query expansion

Crossref

Nottingham Trent Institutional Repository (IRep)

Bradford Scholars

Multimedia Annotations for Practical Collaborative Reasoning

Author: Cebrián-de-la-Serna Manuel
Cebrián-Robles Violeta
Gallego-Arrufat María Jesús
Publication venue: 'University of Alicante'
Publication date: 01/01/2021
Field of study

University education requires students to be trained both at university and at external internship centres. Because of Covid-19, the availability of multimedia resources and examples of practical contexts has become vital. Multimedia annotation can help students reflect on the professional world, collaborating and interacting with colleagues online. This study aims to encourage collaborative practical thinking by using new video annotation technologies. 274 students participated in an experiment of task design focusing on the analysis of a technology-based, award-winning educational innovation project. With mixed research design, qualitative and quantitative data exported from the video annotation platform used was collected and analysed. The results show differences in the quality and quantity of the answers: in the tasks with broad Folksonomy they are more numerous but more dispersed in their analysis, and vice versa. The quality of the answers given with narrow Folksonomy is also higher in both texts and videos modes. Producing multimedia annotations is a practical way to encourage students to practise reflective reasoning about the professional reality.Ministry of Science and Innovation, Spain (Award:EDU2013-41974-P

Journal of New Approaches in Educational Research (Alicante University)

Repositorio Institucional de la Universidad de Alicante

Directory of Open Access Journals

Repositorio Institucional Universidad de Málaga

DIALNET

Social Tagging: Exploring the Image, the Tags, and the Game

Author: Butterworth R.
Goker A. S.
Konkova E.
MacFarlane A.
Publication venue: ERGON-VERLAG
Publication date: 01/01/2014
Field of study

An increasing amount of images are being uploaded, shared, and retrieved on the Web. These large image collections need to be properly stored, organized and easily retrieved. Tags have a key role in image retrieval but it is difficult for those who upload the images to also undertake the quality tag assignment for potential future retrieval by others. Relying on professional keyword assignment is not a practical option for large image collections due to resource constraints. Although a number of content-based image retrieval systems have been launched, they have not demonstrated sufficient utility on large-scale image sources on the web, and are usually used as a supplement to existing text-based image retrieval systems. An alternative to professional image indexing can be social tagging -- with two major types being photo-sharing networks and image labeling games. Here we analyze these applications to evaluate their usefulness from the semantic point of view. We also investigate whether social tagging behaviour can be managed. The findings of the study have shown that social tagging can generate a sizeable number of tags that can be classified as interpretive for an image, and that tagging behaviour has a manageable and adjustable nature depending on tagging guidelines

CiteSeerX

City Research Online