Search CORE

2,327 research outputs found

Dynamic texture recognition using time-causal and time-recursive spatio-temporal receptive fields

Author: Jansson Ylva
Lindeberg Tony
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This work presents a first evaluation of using spatio-temporal receptive fields from a recently proposed time-causal spatio-temporal scale-space framework as primitives for video analysis. We propose a new family of video descriptors based on regional statistics of spatio-temporal receptive field responses and evaluate this approach on the problem of dynamic texture recognition. Our approach generalises a previously used method, based on joint histograms of receptive field responses, from the spatial to the spatio-temporal domain and from object recognition to dynamic texture recognition. The time-recursive formulation enables computationally efficient time-causal recognition. The experimental evaluation demonstrates competitive performance compared to state-of-the-art. Especially, it is shown that binary versions of our dynamic texture descriptors achieve improved performance compared to a large range of similar methods using different primitives either handcrafted or learned from data. Further, our qualitative and quantitative investigation into parameter choices and the use of different sets of receptive fields highlights the robustness and flexibility of our approach. Together, these results support the descriptive power of this family of time-causal spatio-temporal receptive fields, validate our approach for dynamic texture recognition and point towards the possibility of designing a range of video analysis methods based on these new time-causal spatio-temporal primitives.Comment: 29 pages, 16 figure

arXiv.org e-Print Archive

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Grounding Dynamic Spatial Relations for Embodied (Robot) Interaction

Author: Bhatt Mehul
Eppe Manfred
Spranger Michael
Suchan Jakob
Publication venue
Publication date: 25/04/2016
Field of study

This paper presents a computational model of the processing of dynamic spatial relations occurring in an embodied robotic interaction setup. A complete system is introduced that allows autonomous robots to produce and interpret dynamic spatial phrases (in English) given an environment of moving objects. The model unites two separate research strands: computational cognitive semantics and on commonsense spatial representation and reasoning. The model for the first time demonstrates an integration of these different strands.Comment: in: Pham, D.-N. and Park, S.-B., editors, PRICAI 2014: Trends in Artificial Intelligence, volume 8862 of Lecture Notes in Computer Science, pages 958-971. Springe

arXiv.org e-Print Archive

Digital.CSIC

Answer Set Programming Modulo `Space-Time'

Author: DA Randell
E Erdem
G Ligozat
J Bomanson
J Suchan
JO Wallgrün
M Aiello
M Bartholomew
M Bhatt
M Bhatt
M Gebser
P Muller
PA Wałęga
Publication venue
Publication date: 17/05/2018
Field of study

We present ASP Modulo `Space-Time', a declarative representational and computational framework to perform commonsense reasoning about regions with both spatial and temporal components. Supported are capabilities for mixed qualitative-quantitative reasoning, consistency checking, and inferring compositions of space-time relations; these capabilities combine and synergise for applications in a range of AI application areas where the processing and interpretation of spatio-temporal data is crucial. The framework and resulting system is the only general KR-based method for declaratively reasoning about the dynamics of `space-time' regions as first-class objects. We present an empirical evaluation (with scalability and robustness results), and include diverse application examples involving interpretation and control tasks

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive

Learning relational event models from video

Author: Bhatt M
Cohn AG
Dubba KSR
Dylla F
Hogg DC
Publication venue: 'AI Access Foundation'
Publication date: 15/05/2015
Field of study

Event models obtained automatically from video can be used in applications ranging from abnormal event detection to content based video retrieval. When multiple agents are involved in the events, characterizing events naturally suggests encoding interactions as relations. Learning event models from this kind of relational spatio-temporal data using relational learning techniques such as Inductive Logic Programming (ILP) hold promise, but have not been successfully applied to very large datasets which result from video data. In this paper, we present a novel framework REMIND (Relational Event Model INDuction) for supervised relational learning of event models from large video datasets using ILP. Efficiency is achieved through the learning from interpretations setting and using a typing system that exploits the type hierarchy of objects in a domain. The use of types also helps prevent over generalization. Furthermore, we also present a type-refining operator and prove that it is optimal. The learned models can be used for recognizing events from previously unseen videos. We also present an extension to the framework by integrating an abduction step that improves the learning performance when there is noise in the input data. The experimental results on several hours of video data from two challenging real world domains (an airport domain and a physical action verbs domain) suggest that the techniques are suitable to real world scenarios

CiteSeerX

Crossref

White Rose Research Online

The Meaning of Action:a review on action recognition and mapping

Author: Geib Christopher
Kragic Danica
Krüger Volker
Ude Ales
Publication venue
Publication date: 01/01/2007
Field of study

VBN

The Meaning of Action:a review on action recognition and mapping

Author: Geib Christopher
Kragic Danica
Krüger Volker
Ude Ales
Publication venue: Aalborg Universitet, Copenhagen Institute of Technology (CVMI)
Publication date: 01/01/2007
Field of study

In this paper, we analyze the different approaches taken to date within the computer vision, robotics and artificial intelligence communities for the representation, recognition, synthesis and understanding of action. We deal with action at different levels of complexity and provide the reader with the necessary related literature references. We put the literature references further into context and outline a possible interpretation of action by taking into account the different aspects of action recognition, action synthesis and task-level planning

Lund University Publications

VBN

Combining logic and probability in tracking and scene interpretation

Author: Bennett B.
Publication venue: Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, Germany
Publication date: 01/01/2008
Field of study

The paper gives a high-level overview of some ways in which logical representations and reasoning can be used in computer vision applications, such as tracking and scene interpretation. The combination of logical and statistical approaches is also considered

Dagstuhl Research Online Publication Server

White Rose Research Online