Search CORE

64,208 research outputs found

Clear Visual Separation of Temporal Event Sequences

Author: Grønbæk Kaj
Mathisen Andreas
Publication venue
Publication date: 17/10/2017
Field of study

Extracting and visualizing informative insights from temporal event sequences becomes increasingly difficult when data volume and variety increase. Besides dealing with high event type cardinality and many distinct sequences, it can be difficult to tell whether it is appropriate to combine multiple events into one or utilize additional information about event attributes. Existing approaches often make use of frequent sequential patterns extracted from the dataset, however, these patterns are limited in terms of interpretability and utility. In addition, it is difficult to assess the role of absolute and relative time when using pattern mining techniques. In this paper, we present methods that addresses these challenges by automatically learning composite events which enables better aggregation of multiple event sequences. By leveraging event sequence outcomes, we present appropriate linked visualizations that allow domain experts to identify critical flows, to assess validity and to understand the role of time. Furthermore, we explore information gain and visual complexity metrics to identify the most relevant visual patterns. We compare composite event learning with two approaches for extracting event patterns using real world company event data from an ongoing project with the Danish Business Authority.Comment: In Proceedings of the 3rd IEEE Symposium on Visualization in Data Science (VDS), 201

arXiv.org e-Print Archive

Crossref

Automatic annotation of tennis games: An integration of audio, vision, and learning

Author: Fei Yan
Josef Kittler
David Windridge
William Christmas
Krystian Mikolajczyk
Stephen Cox
Qiang Huang
Kijak
Kolonias
Huang
Yu
Ekinci
Zhu
Yan
Christmas
Yan
Kijak
Coldefy
Zhu
Lai
Hartley
Kittler
Huang
Tsochantaridis
Joachims
Altun
Taskar
Ng
Publication venue: 'Elsevier BV'
Publication date: 01/01/1999
Field of study

Fully automatic annotation of tennis game using broadcast video is a task with a great potential but with enormous challenges. In this paper we describe our approach to this task, which integrates computer vision, machine listening, and machine learning. At the low level processing, we improve upon our previously proposed state-of-the-art tennis ball tracking algorithm and employ audio signal processing techniques to detect key events and construct features for classifying the events. At high level analysis, we model event classification as a sequence labelling problem, and investigate four machine learning techniques using simulated event sequences. Finally, we evaluate our proposed approach on three real world tennis games, and discuss the interplay between audio, vision and learning. To the best of our knowledge, our system is the only one that can annotate tennis game at such a detailed level

Crossref

Middlesex University Research Repository

Institutional Repository Universiteit Antwerpen

University of East Anglia digital repository

Surrey Research Insight

Tropmed Central Antwerp

The Sound Manifesto

Author: Bisnovatyi Ilia
O'Donnell Michael J.
Publication venue: 'SPIE-Intl Soc Optical Eng'
Publication date: 08/07/2000
Field of study

Computing practice today depends on visual output to drive almost all user interaction. Other senses, such as audition, may be totally neglected, or used tangentially, or used in highly restricted specialized ways. We have excellent audio rendering through D-A conversion, but we lack rich general facilities for modeling and manipulating sound comparable in quality and flexibility to graphics. We need co-ordinated research in several disciplines to improve the use of sound as an interactive information channel. Incremental and separate improvements in synthesis, analysis, speech processing, audiology, acoustics, music, etc. will not alone produce the radical progress that we seek in sonic practice. We also need to create a new central topic of study in digital audio research. The new topic will assimilate the contributions of different disciplines on a common foundation. The key central concept that we lack is sound as a general-purpose information channel. We must investigate the structure of this information channel, which is driven by the co-operative development of auditory perception and physical sound production. Particular audible encodings, such as speech and music, illuminate sonic information by example, but they are no more sufficient for a characterization than typography is sufficient for a characterization of visual information.Comment: To appear in the conference on Critical Technologies for the Future of Computing, part of SPIE's International Symposium on Optical Science and Technology, 30 July to 4 August 2000, San Diego, C

arXiv.org e-Print Archive

Crossref

Scene extraction in motion pictures

Author: Dorai Chitra
Truong Ba Tu
Venkatesh Svetha
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2003
Field of study

This paper addresses the challenge of bridging the semantic gap between the rich meaning users desire when they query to locate and browse media and the shallowness of media descriptions that can be computed in today\u27s content management systems. To facilitate high-level semantics-based content annotation and interpretation, we tackle the problem of automatic decomposition of motion pictures into meaningful story units, namely scenes. Since a scene is a complicated and subjective concept, we first propose guidelines from fill production to determine when a scene change occurs. We then investigate different rules and conventions followed as part of Fill Grammar that would guide and shape an algorithmic solution for determining a scene. Two different techniques using intershot analysis are proposed as solutions in this paper. In addition, we present different refinement mechanisms, such as film-punctuation detection founded on Film Grammar, to further improve the results. These refinement techniques demonstrate significant improvements in overall performance. Furthermore, we analyze errors in the context of film-production techniques, which offer useful insights into the limitations of our method

Deakin Research Online

A lightweight web video model with content and context descriptions for integration with linked data

Author: Breslin John G.
Choudhury Smitashree
Decker Stefan
Publication venue
Publication date: 01/01/2009
Field of study

The rapid increase of video data on the Web has warranted an urgent need for effective representation, management and retrieval of web videos. Recently, many studies have been carried out for ontological representation of videos, either using domain dependent or generic schemas such as MPEG-7, MPEG-4, and COMM. In spite of their extensive coverage and sound theoretical grounding, they are yet to be widely used by users. Two main possible reasons are the complexities involved and a lack of tool support. We propose a lightweight video content model for content-context description and integration. The uniqueness of the model is that it tries to model the emerging social context to describe and interpret the video. Our approach is grounded on exploiting easily extractable evolving contextual metadata and on the availability of existing data on the Web. This enables representational homogeneity and a firm basis for information integration among semantically-enabled data sources. The model uses many existing schemas to describe various ontology classes and shows the scope of interlinking with the Linked Data cloud

CiteSeerX

Open Research Online (The Open University)

Segregating Event Streams and Noise with a Markov Renewal Process Model

Author: Plumbley MD
Stowell D
Publication venue
Publication date: 01/08/2013
Field of study

DS and MP are supported by EPSRC Leadership Fellowship EP/G007144/1

Queen Mary Research Online

Surrey Research Insight

Recommended from our members

The critical events for motor-sensory temporal recalibration

Author: Arnold D. H.
Nancarrow K.
Yarrow K.
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2012
Field of study

Determining if we, or another agent, were responsible for a sensory event can require an accurate sense of timing. Our sense of appropriate timing relationships must, however, be malleable as there is a variable delay between the physical timing of an event and when sensory signals concerning that event are encoded in the brain. One dramatic demonstration of such malleability involves having people repeatedly press a button thereby causing a beep. If a delay is inserted between button presses and beeps, when it is subsequently taken away beeps can seem to precede the button presses that caused them. For this to occur it is important that people feel they were responsible for instigating the beeps. In terms of their timing, as yet it is not clear what combination of events is important for motor-sensory temporal recalibration. Here, by introducing ballistic reaches of short or longer extent before a button press, we varied the delay between the intention to act and the sensory consequence of that action. This manipulation failed to modulate recalibration magnitude. By contrast, introducing a similarly lengthened delay between button presses and consequent beeps eliminated recalibration. Thus it would seem that the critical timing relationship for motor-sensory temporal recalibration is between tactile signals relating to the completion of an action and the subsequent auditory percept

City Research Online

Crossref

Directory of Open Access Journals

Frontiers - Publisher Connector

PubMed Central

University of Queensland eSpace

Different roles of similarity and predictability in auditory stream segregation

Author: Bendixen Alexandra
Bőhm Tamás M.
Denham Susan L.
Mill Robert
Szalárdy Orsolya
Winkler István
Publication venue: 'Akademiai Kiado Zrt.'
Publication date: 19/06/2013
Field of study

Sound sources often emit trains of discrete sounds, such as a series of footsteps. Previously, two dif¬ferent principles have been suggested for how the human auditory system binds discrete sounds to¬gether into perceptual units. The feature similarity principle is based on linking sounds with similar characteristics over time. The predictability principle is based on linking sounds that follow each other in a predictable manner. The present study compared the effects of these two principles. Participants were presented with tone sequences and instructed to continuously indicate whether they perceived a single coherent sequence or two concurrent streams of sound. We investigated the inﬂuence of separate manipulations of similarity and predictability on these perceptual reports. Both grouping principles affected perception of the tone sequences, albeit with different characteristics. In particular, results suggest that whereas predictability is only analyzed for the currently perceived sound organization, feature similarity is also analyzed for alternative groupings of sound. Moreover, changing similarity or predictability within an ongoing sound sequence led to markedly different dynamic effects. Taken together, these results provide evidence for different roles of similarity and predictability in auditory scene analysis, suggesting that forming auditory stream representations and competition between alter¬natives rely on partly different processes

Crossref

Repository of the Academy's Library