Search CORE

9,950 research outputs found

Co-Regularized Deep Representations for Video Summarization

Author: Chandrasekhar Vijay
Goh Hanlin
Lin Jie
Morère Olivier
Veillard Antoine
Publication venue
Publication date: 30/01/2015
Field of study

Compact keyframe-based video summaries are a popular way of generating viewership on video sharing platforms. Yet, creating relevant and compelling summaries for arbitrarily long videos with a small number of keyframes is a challenging task. We propose a comprehensive keyframe-based summarization framework combining deep convolutional neural networks and restricted Boltzmann machines. An original co-regularization scheme is used to discover meaningful subject-scene associations. The resulting multimodal representations are then used to select highly-relevant keyframes. A comprehensive user study is conducted comparing our proposed method to a variety of schemes, including the summarization currently in use by one of the most popular video sharing websites. The results show that our method consistently outperforms the baseline schemes for any given amount of keyframes both in terms of attractiveness and informativeness. The lead is even more significant for smaller summaries.Comment: Video summarization, deep convolutional neural networks, co-regularized restricted Boltzmann machine

arXiv.org e-Print Archive

Crossref

Classification of Time-Series Images Using Deep Convolutional Neural Networks

Author: Abdel-Hamid
Armano
Bouvrie
Chen
Cui
Dalto
Deng
Eads
Graves
Hatami
Hatami
Krizhevsky
Krizhevsky
LeCun
LeCun
Lee
Nanopoulos
Rakthanmanon
Rodriguez
Senin
Simonyan
Souza
Souza
Wang
Wang
Wang
Xing
Yang
Zheng
Publication venue
Publication date: 07/10/2017
Field of study

Convolutional Neural Networks (CNN) has achieved a great success in image recognition task by automatically learning a hierarchical feature representation from raw data. While the majority of Time-Series Classification (TSC) literature is focused on 1D signals, this paper uses Recurrence Plots (RP) to transform time-series into 2D texture images and then take advantage of the deep CNN classifier. Image representation of time-series introduces different feature types that are not available for 1D signals, and therefore TSC can be treated as texture image recognition task. CNN model also allows learning different levels of representations together with a classifier, jointly and automatically. Therefore, using RP and CNN in a unified framework is expected to boost the recognition rate of TSC. Experimental results on the UCR time-series classification archive demonstrate competitive accuracy of the proposed approach, compared not only to the existing deep architectures, but also to the state-of-the art TSC algorithms.Comment: The 10th International Conference on Machine Vision (ICMV 2017

arXiv.org e-Print Archive

Crossref

Analyzing Visual Mappings of Traditional and Alternative Music Notation

Author: El-Assady Mennatallah
Häußler Johannes
Keim Daniel
Kraus Matthias
Miller Matthias
Publication venue
Publication date: 25/10/2018
Field of study

In this paper, we postulate that combining the domains of information visualization and music studies paves the ground for a more structured analysis of the design space of music notation, enabling the creation of alternative music notations that are tailored to different users and their tasks. Hence, we discuss the instantiation of a design and visualization pipeline for music notation that follows a structured approach, based on the fundamental concepts of information and data visualization. This enables practitioners and researchers of digital humanities and information visualization, alike, to conceptualize, create, and analyze novel music notation methods. Based on the analysis of relevant stakeholders and their usage of music notation as a mean of communication, we identify a set of relevant features typically encoded in different annotations and encodings, as used by interpreters, performers, and readers of music. We analyze the visual mappings of musical dimensions for varying notation methods to highlight gaps and frequent usages of encodings, visual channels, and Gestalt laws. This detailed analysis leads us to the conclusion that such an under-researched area in information visualization holds the potential for fundamental research. This paper discusses possible research opportunities, open challenges, and arguments that can be pursued in the process of analyzing, improving, or rethinking existing music notation systems and techniques.Comment: 5 pages including references, 3rd Workshop on Visualization for the Digital Humanities, Vis4DH, IEEE Vis 201

arXiv.org e-Print Archive

KOPS - The Institutional Repository of the University of Konstanz