26 research outputs found
High Dynamic Range Imaging by Perceptual Logarithmic Exposure Merging
In this paper we emphasize a similarity between the Logarithmic-Type Image
Processing (LTIP) model and the Naka-Rushton model of the Human Visual System
(HVS). LTIP is a derivation of the Logarithmic Image Processing (LIP), which
further replaces the logarithmic function with a ratio of polynomial functions.
Based on this similarity, we show that it is possible to present an unifying
framework for the High Dynamic Range (HDR) imaging problem, namely that
performing exposure merging under the LTIP model is equivalent to standard
irradiance map fusion. The resulting HDR algorithm is shown to provide high
quality in both subjective and objective evaluations.Comment: 14 pages 8 figures. Accepted at AMCS journa
Pandora: Description of a Painting Database for Art Movement Recognition with Baselines and Perspectives
To facilitate computer analysis of visual art, in the form of paintings, we
introduce Pandora (Paintings Dataset for Recognizing the Art movement)
database, a collection of digitized paintings labelled with respect to the
artistic movement. Noting that the set of databases available as benchmarks for
evaluation is highly reduced and most existing ones are limited in variability
and number of images, we propose a novel large scale dataset of digital
paintings. The database consists of more than 7700 images from 12 art
movements. Each genre is illustrated by a number of images varying from 250 to
nearly 1000. We investigate how local and global features and classification
systems are able to recognize the art movement. Our experimental results
suggest that accurate recognition is achievable by a combination of various
categories.To facilitate computer analysis of visual art, in the form of
paintings, we introduce Pandora (Paintings Dataset for Recognizing the Art
movement) database, a collection of digitized paintings labelled with respect
to the artistic movement. Noting that the set of databases available as
benchmarks for evaluation is highly reduced and most existing ones are limited
in variability and number of images, we propose a novel large scale dataset of
digital paintings. The database consists of more than 7700 images from 12 art
movements. Each genre is illustrated by a number of images varying from 250 to
nearly 1000. We investigate how local and global features and classification
systems are able to recognize the art movement. Our experimental results
suggest that accurate recognition is achievable by a combination of various
categories.Comment: 11 pages, 1 figure, 6 table
Video genre categorization and representation using audio-visual information
International audienceWe propose an audio-visual approach to video genre classification using content descriptors that exploit audio, color, temporal, and contour information. Audio information is extracted at block-level, which has the advantage of capturing local temporal information. At the temporal structure level, we consider action content in relation to human perception. Color perception is quantified using statistics of color distribution, elementary hues, color properties, and relationships between colors. Further, we compute statistics of contour geometry and relationships. The main contribution of our work lies in harnessingn the descriptive power of the combination of these descriptors in genre classification. Validation was carried out on over 91 h of video footage encompassing 7 common video genres, yielding average precision and recall ratios of 87% to 100% and 77% to 100%, respectively, and an overall average correct classification of up to 97%. Also, experimental comparison as part of the MediaEval 2011 benchmarkingn campaign demonstrated the efficiency of the proposed audiovisual descriptors over other existing approaches. Finally, we discuss a 3-D video browsing platform that displays movies using efaturebased coordinates and thus regroups them according to genre
An audio-visual approach to web video categorization
International audienceIn this paper we address the issue of automatic video genre categorization of web media using an audio-visual approach. To this end, we propose content descriptors which exploit audio, temporal structure and color information. The potential of our descriptors is experimentally validated both from the perspective of a classification system and as an information retrieval approach. Validation is carried out on a real scenario, namely on more than 288 hours of video footage and 26 video genres specific to blip.tv media platform. Additionally, to reduce semantic gap, we propose a new relevance feedback technique which is based on hierarchical clustering. Experimental tests prove that retrieval performance can be significantly increased in this case, becoming comparable to the one obtained with high level semantic textual descriptors
A Fuzzy Color Credibility Approach To Color Image Filtering
This contribution proposes a fuzzy approach to color image filtering by the fuzzy modeling of the concept of color credibility. Based on the perceptual notion of color resemblance, the colors are modeled as fuzzy sets in the color space. The filtering principle is to select at the filters output the color that is the most credible with respect to the rest of colors within the filtering window. Although the approach does not make any assumption on the desired filter type, the result is similar to a vector median-type filter. 1. INTRODUCTION During the last decade much attention has been devoted to the study of the arising field of vector (or multichannel) image processing. The progresses of the multispectral sensing devices and the growing power of computers enabled and imposed the consideration of images characterized in each pixel by a vector (color images being the most common example) . Since the beginning, it was noticed that the "stack of scalars" approach to multichan..