Search CORE

3,553 research outputs found

CHORUS Deliverable 2.1: State of the Art on Multimedia Search Engines

Author: Boujemaa Nozha
Compañó Ramón
Dosch Christoph
Geurts Joost
Karlgren Jussi
King Paul
Kompatsiaris Yiannis
Köhler Joachim
Le Moine Jean-Yves
Ortgies Robert
Point Jean-Charles
Rotenberg Boris
Rudström Åsa
Sebe Nicu
Publication venue: Chorus Project Consortium
Publication date: 01/01/2007
Field of study

Based on the information provided by European projects and national initiatives related to multimedia search as well as domains experts that participated in the CHORUS Think-thanks and workshops, this document reports on the state of the art related to multimedia content search from, a technical, and socio-economic perspective. The technical perspective includes an up to date view on content based indexing and retrieval technologies, multimedia search in the context of mobile devices and peer-to-peer networks, and an overview of current evaluation and benchmark inititiatives to measure the performance of multimedia search engines. From a socio-economic perspective we inventorize the impact and legal consequences of these technical advances and point out future directions of research

RISE – Research Institutes of Sweden

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Swedish Institute of Computer Science Publications Database

Software institutes' Online Digital Archive

The IMMED Project: Wearable Video Monitoring of People with Age Dementia

Author: André-Obrecht Régine
Benois-Pineau Jenny
Dartigues Jean-François
Dovgalecs Vladislavs
El Khoury Elie
Gaëstel Yann
Joly Philippe
Karaman Svebor
Mégret Rémi
Pinquier Julien
Wannous Hazem
Publication venue: HAL CCSD
Publication date: 25/10/2010
Field of study

International audienceIn this paper, we describe a new application for multimedia indexing, using a system that monitors the instrumental activities of daily living to assess the cognitive decline caused by dementia. The system is composed of a wearable camera device designed to capture audio and video data of the instrumental activities of a patient, which is leveraged with multimedia indexing techniques in order to allow medical specialists to analyze several hour long observation shots efficiently

Scientific Publications of the University of Toulouse II Le Mirail

HAL-Inserm

Evaluation Methodologies for Visual Information Retrieval and Annotation

Author: Nowak Stefanie
Publication venue
Publication date: 09/03/2012
Field of study

Die automatisierte Evaluation von Informations-Retrieval-Systemen erlaubt Performanz und Qualität der Informationsgewinnung zu bewerten. Bereits in den 60er Jahren wurden erste Methodologien für die system-basierte Evaluation aufgestellt und in den Cranfield Experimenten überprüft. Heutzutage gehören Evaluation, Test und Qualitätsbewertung zu einem aktiven Forschungsfeld mit erfolgreichen Evaluationskampagnen und etablierten Methoden. Evaluationsmethoden fanden zunächst in der Bewertung von Textanalyse-Systemen Anwendung. Mit dem rasanten Voranschreiten der Digitalisierung wurden diese Methoden sukzessive auf die Evaluation von Multimediaanalyse-Systeme übertragen. Dies geschah häufig, ohne die Evaluationsmethoden in Frage zu stellen oder sie an die veränderten Gegebenheiten der Multimediaanalyse anzupassen. Diese Arbeit beschäftigt sich mit der system-basierten Evaluation von Indizierungssystemen für Bildkollektionen. Sie adressiert drei Problemstellungen der Evaluation von Annotationen: Nutzeranforderungen für das Suchen und Verschlagworten von Bildern, Evaluationsmaße für die Qualitätsbewertung von Indizierungssystemen und Anforderungen an die Erstellung visueller Testkollektionen. Am Beispiel der Evaluation automatisierter Photo-Annotationsverfahren werden relevante Konzepte mit Bezug zu Nutzeranforderungen diskutiert, Möglichkeiten zur Erstellung einer zuverlässigen Ground Truth bei geringem Kosten- und Zeitaufwand vorgestellt und Evaluationsmaße zur Qualitätsbewertung eingeführt, analysiert und experimentell verglichen. Traditionelle Maße zur Ermittlung der Performanz werden in vier Dimensionen klassifiziert. Evaluationsmaße vergeben üblicherweise binäre Kosten für korrekte und falsche Annotationen. Diese Annahme steht im Widerspruch zu der Natur von Bildkonzepten. Das gemeinsame Auftreten von Bildkonzepten bestimmt ihren semantischen Zusammenhang und von daher sollten diese auch im Zusammenhang auf ihre Richtigkeit hin überprüft werden. In dieser Arbeit wird aufgezeigt, wie semantische Ähnlichkeiten visueller Konzepte automatisiert abgeschätzt und in den Evaluationsprozess eingebracht werden können. Die Ergebnisse der Arbeit inkludieren ein Nutzermodell für die konzeptbasierte Suche von Bildern, eine vollständig bewertete Testkollektion und neue Evaluationsmaße für die anforderungsgerechte Qualitätsbeurteilung von Bildanalysesystemen.Performance assessment plays a major role in the research on Information Retrieval (IR) systems. Starting with the Cranfield experiments in the early 60ies, methodologies for the system-based performance assessment emerged and established themselves, resulting in an active research field with a number of successful benchmarking activities. With the rise of the digital age, procedures of text retrieval evaluation were often transferred to multimedia retrieval evaluation without questioning their direct applicability. This thesis investigates the problem of system-based performance assessment of annotation approaches in generic image collections. It addresses three important parts of annotation evaluation, namely user requirements for the retrieval of annotated visual media, performance measures for multi-label evaluation, and visual test collections. Using the example of multi-label image annotation evaluation, I discuss which concepts to employ for indexing, how to obtain a reliable ground truth to moderate costs, and which evaluation measures are appropriate. This is accompanied by a thorough analysis of related work on system-based performance assessment in Visual Information Retrieval (VIR). Traditional performance measures are classified into four dimensions and investigated according to their appropriateness for visual annotation evaluation. One of the main ideas in this thesis adheres to the common assumption on the binary nature of the score prediction dimension in annotation evaluation. However, the predicted concepts and the set of true indexed concepts interrelate with each other. This work will show how to utilise these semantic relationships for a fine-grained evaluation scenario. Outcomes of this thesis result in a user model for concept-based image retrieval, a fully assessed image annotation test collection, and a number of novel performance measures for image annotation evaluation

Fraunhofer-ePrints

Digitale Bibliothek Thüringen

Artificial Intelligence in the Creative Industries: A Review

Author: Anantrasirichai Nantheera
Bull David
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/07/2021
Field of study

This paper reviews the current state of the art in Artificial Intelligence (AI) technologies and applications in the context of the creative industries. A brief background of AI, and specifically Machine Learning (ML) algorithms, is provided including Convolutional Neural Network (CNNs), Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs) and Deep Reinforcement Learning (DRL). We categorise creative applications into five groups related to how AI technologies are used: i) content creation, ii) information analysis, iii) content enhancement and post production workflows, iv) information extraction and enhancement, and v) data compression. We critically examine the successes and limitations of this rapidly advancing technology in each of these areas. We further differentiate between the use of AI as a creative tool and its potential as a creator in its own right. We foresee that, in the near future, machine learning-based AI will be adopted widely as a tool or collaborative assistant for creativity. In contrast, we observe that the successes of machine learning in domains with fewer constraints, where AI is the `creator', remain modest. The potential of AI (or its developers) to win awards for its original creations in competition with human creatives is also limited, based on contemporary technologies. We therefore conclude that, in the context of creative industries, maximum benefit from AI will be derived where its focus is human centric -- where it is designed to augment, rather than replace, human creativity

arXiv.org e-Print Archive

Explore Bristol Research

Evaluation Methodologies for Visual Information Retrieval and Annotation

Author: Nowak Stefanie
Publication venue
Publication date: 09/03/2012
Field of study

Digitale Bibliothek Thüringen

A Probabilistic Multimedia Retrieval Model and its Evaluation

Author: de Jong Franciska M.G.
de Vries A.J.
de Vries A.P.
Hiemstra Djoerd
Sayed A.H.
van Ballegooij A.
Westerveld T.H.W.
Publication venue: Hindawi Publishing
Publication date: 01/01/2003
Field of study

We present a probabilistic model for the retrieval of multimodal documents. The model is based on Bayesian decision theory and combines models for text-based search with models for visual search. The textual model is based on the language modelling approach to text retrieval, and the visual information is modelled as a mixture of Gaussian densities. Both models have proved successful on various standard retrieval tasks. We evaluate the multimodal model on the search task of TREC′s video track. We found that the disclosure of video material based on visual information only is still too difficult. Even with purely visual information needs, text-based retrieval still outperforms visual approaches. The probabilistic model is useful for text, visual, and multimedia retrieval. Unfortunately, simplifying assumptions that reduce its computational complexity degrade retrieval effectiveness. Regarding the question whether the model can effectively combine information from different modalities, we conclude that whenever both modalities yield reasonable scores, a combined run outperforms the individual runs

CiteSeerX

Springer - Publisher Connector

Directory of Open Access Journals

University of Twente Research Information

An architecture for semantic integration of data and medical images

Author: Millán Marta
Trujillo Maria
Valencia Daniel
Publication venue
Publication date: 12/05/2015
Field of study

Resumen En las organizaciones prestadoras de servicios de salud, existen diferentes fuentes de datos (e.g. historia clínica, datos demográficos, archivos DICOM) de naturaleza distinta, que están dispersas en medios de almacenamiento y que provienen de fuentes heterogéneas. Adicionalmente, el formato DICOM brinda la posibilidad de almacenar información del paciente y de las imágenes médicas. Estos archivos son administrados en PACS, sin embargo los PACs no brinda herramientas de apoyo al diagnóstico. En este artículo se presenta una arquitectura de integración de datos orientada a enriquecer imágenes médicas mediante metadatos extraídos de un vocabulario controlado. La arquitectura fue instanciada en un prototipo que ofrece mecanismos de anotación – manual y automática – de imágenes y estrategias de búsqueda y recuperación de datos e imágenes diferentes a los tradicionales usando palabras claves o descriptores MPEG-7. La anotación se basa en un vocabulario controlado que forma parte de una taxonomía de conceptos y términos médicos. Abstract In organisations providing health services, there exist different data sources (e.g. clinical history, demographics data, DICOM files) of diverse nature, which are scattered storage and come from heterogeneous sources. Additionally, the DICOM format stores patient information and medical images. These files are managed in PACS, however PACs does not provide diagnostic support tools. In this paper, a data integration architecture oriented to enrich medical images using metadata extracted from a controlled vocabulary is presented. The architecture was instantiated in a prototype that provides image annotation mechanisms - manual and automatic – and strategies for searching and retrieving data and images using keywords or descriptors MPEG-7, which are different from traditional ones. The annotation is based on a controlled vocabulary that is part of a taxonomy of concepts and medical terms

Biblioteca Digital de la Universidad del Valle

Cloud-Based Benchmarking of Medical Image Analysis

Author
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2020
Field of study

Medical imagin

Directory of Open Access Books (DOAB)