Search CORE

3,065 research outputs found

General Dynamic Scene Reconstruction from Multiple View Video

Author: Guillemaut Jean-Yves
Hilton Adrian
Kim Hansung
Mustafa Armin
Publication venue
Publication date: 30/09/2015
Field of study

This paper introduces a general approach to dynamic scene reconstruction from multiple moving cameras without prior knowledge or limiting constraints on the scene structure, appearance, or illumination. Existing techniques for dynamic scene reconstruction from multiple wide-baseline camera views primarily focus on accurate reconstruction in controlled environments, where the cameras are fixed and calibrated and background is known. These approaches are not robust for general dynamic scenes captured with sparse moving cameras. Previous approaches for outdoor dynamic scene reconstruction assume prior knowledge of the static background appearance and structure. The primary contributions of this paper are twofold: an automatic method for initial coarse dynamic scene segmentation and reconstruction without prior knowledge of background appearance or structure; and a general robust approach for joint segmentation refinement and dense reconstruction of dynamic scenes from multiple wide-baseline static or moving cameras. Evaluation is performed on a variety of indoor and outdoor scenes with cluttered backgrounds and multiple dynamic non-rigid objects such as people. Comparison with state-of-the-art approaches demonstrates improved accuracy in both multiple view segmentation and dense reconstruction. The proposed approach also eliminates the requirement for prior knowledge of scene structure and appearance

arXiv.org e-Print Archive

Crossref

Southampton (e-Prints Soton)

University of Surrey

Surrey Research Insight

Recommended from our members

Intelligent image cropping and scaling

Author: Deigmoeller Joerg
Publication venue: Brunel University School of Engineering and Design PhD Theses
Publication date: 01/01/2011
Field of study

This thesis was submitted for the degree of Doctor of Philosophy and awarded by Brunel University, 2011.Nowadays, there exist a huge number of end devices with different screen properties for watching television content, which is either broadcasted or transmitted over the internet. To allow best viewing conditions on each of these devices, different image formats have to be provided by the broadcaster. Producing content for every single format is, however, not applicable by the broadcaster as it is much too laborious and costly. The most obvious solution for providing multiple image formats is to produce one high resolution format and prepare formats of lower resolution from this. One possibility to do this is to simply scale video images to the resolution of the target image format. Two significant drawbacks are the loss of image details through ownscaling and possibly unused image areas due to letter- or pillarboxes. A preferable solution is to find the contextual most important region in the high-resolution format at first and crop this area with an aspect ratio of the target image format afterwards. On the other hand, defining the contextual most important region manually is very time consuming. Trying to apply that to live productions would be nearly impossible. Therefore, some approaches exist that automatically define cropping areas. To do so, they extract visual features, like moving reas in a video, and define regions of interest (ROIs) based on those. ROIs are finally used to define an enclosing cropping area. The extraction of features is done without any knowledge about the type of content. Hence, these approaches are not able to distinguish between features that might be important in a given context and those that are not. The work presented within this thesis tackles the problem of extracting visual features based on prior knowledge about the content. Such knowledge is fed into the system in form of metadata that is available from TV production environments. Based on the extracted features, ROIs are then defined and filtered dependent on the analysed content. As proof-of-concept, this application finally adapts SDTV (Standard Definition Television) sports productions automatically to image formats with lower resolution through intelligent cropping and scaling. If no content information is available, the system can still be applied on any type of content through a default mode. The presented approach is based on the principle of a plug-in system. Each plug-in represents a method for analysing video content information, either on a low level by extracting image features or on a higher level by processing extracted ROIs. The combination of plug-ins is determined by the incoming descriptive production metadata and hence can be adapted to each type of sport individually. The application has been comprehensively evaluated by comparing the results of the system against alternative cropping methods. This evaluation utilised videos which were manually cropped by a professional video editor, statically cropped videos and simply scaled, non-cropped videos. In addition to and apart from purely subjective evaluations, the gaze positions of subjects watching sports videos have been measured and compared to the regions of interest positions extracted by the system

Brunel University Research Archive

Feasibility of automated body trait determination using the SR4K time-of-flight camera in cow barns

Author: Jan H Haas
Jan Harms
Jennifer Salau
Sascha Bieletzki
Ulrike Bauer
Wolfgang Junge
Publication venue: Springer Nature
Publication date: 01/01/2014
Field of study

Springer - Publisher Connector

Data Mining: The Next Generation

Author: Agrawal Rakesh
Bollinger Toni
Clifton Christopher W.
Dzeroski Saso
Freytag Johann-Christoph
Hipp Jochen
Keim Daniel
Kramer Stefan
Kriegel Hans-Peter
Leser Ulf
Liu Bing
Mannila Heikki
Meo Rosa
Morishita Shinichi
Ng Raymond
Pei Jian
Raghavan Prabhakar
Ramakrishnan Raghu
Spiliopoulou Myra
Srivastava Jaideep
Torra Vicenc
Publication venue: Dagstuhl Seminar Proceedings. 04292 - Perspectives Workshop: Data Mining: The Next Generation
Publication date: 01/01/2005
Field of study

Dagstuhl Research Online Publication Server

OBSERVING THE INTERNATIONAL:GOVERNMENTALITY OF GEOPOLITICS, VISUAL CULTURE AND THE SOCIAL LOGISTICS OF WAR MAKING

Author: Eken Mehmet
Publication venue
Publication date: 01/01/2018
Field of study

Royal Holloway - Pure

A Hybrid Model for Concurrent Interaction Recognition from Videos

Author: Abirami S.
Sivarathinabala M.
Publication venue: Agora University Press
Publication date: 03/07/2016
Field of study

Human behavior analysis plays an important role in understanding the high-level human activities from surveillance videos. Human behavior has been identified using gestures, postures, actions, interactions and multiple activities of humans. This paper has been analyzed by identifying concurrent interactions, that takes place between multiple peoples. In order to capture the concurrency, a hybrid model has been designed with the combination of Layered Hidden Markov Model (LHMM) and Coupled HMM (CHMM). The model has three layers called as pose layer, action layer and interaction layer, in which pose and action of the single person has been defined in the layered model and the interaction of two persons or multiple persons are defined using CHMM. This hybrid model reduces the training parameters and the temporal correlations over the frames are maintained. The spatial and temporal information are extracted and from the body part attributes, the simple human actions as well as concurrent actions/interactions are predicted. In addition, we further evaluated the results on various datasets also, for analyzing the concurrent interaction between the peoples

Agora University Editing House: Journals

Embodied cognitive evolution and the cerebellum

Author: Barton Robert A.
Publication venue: The Royal Society
Publication date: 01/08/2012
Field of study

Much attention has focused on the dramatic expansion of the forebrain, particularly the neocortex, as the neural substrate of cognitive evolution. However, though relatively small, the cerebellum contains about four times more neurons than the neocortex. I show that commonly used comparative measures such as neocortex ratio underestimate the contribution of the cerebellum to brain evolution. Once differences in the scaling of connectivity in neocortex and cerebellum are accounted for, a marked and general pattern of correlated evolution of the two structures is apparent. One deviation from this general pattern is a relative expansion of the cerebellum in apes and other extractive foragers. The confluence of these comparative patterns, studies of ape foraging skills and social learning, and recent evidence on the cognitive neuroscience of the cerebellum, suggest an important role for the cerebellum in the evolution of the capacity for planning, execution and understanding of complex behavioural sequences—including tool use and language. There is no clear separation between sensory–motor and cognitive specializations underpinning such skills, undermining the notion of executive control as a distinct process. Instead, I argue that cognitive evolution is most effectively understood as the elaboration of specialized systems for embodied adaptive control

Durham Research Online

PubMed Central

Robust subspace learning for static and dynamic affect and behaviour modelling

Author: Georgakis Christos
Publication venue: Computing, Imperial College London
Publication date: 01/09/2017
Field of study

Machine analysis of human affect and behavior in naturalistic contexts has witnessed a growing attention in the last decade from various disciplines ranging from social and cognitive sciences to machine learning and computer vision. Endowing machines with the ability to seamlessly detect, analyze, model, predict as well as simulate and synthesize manifestations of internal emotional and behavioral states in real-world data is deemed essential for the deployment of next-generation, emotionally- and socially-competent human-centered interfaces. In this thesis, we are primarily motivated by the problem of modeling, recognizing and predicting spontaneous expressions of non-verbal human affect and behavior manifested through either low-level facial attributes in static images or high-level semantic events in image sequences. Both visual data and annotations of naturalistic affect and behavior naturally contain noisy measurements of unbounded magnitude at random locations, commonly referred to as ‘outliers’. We present here machine learning methods that are robust to such gross, sparse noise. First, we deal with static analysis of face images, viewing the latter as a superposition of mutually-incoherent, low-complexity components corresponding to facial attributes, such as facial identity, expressions and activation of atomic facial muscle actions. We develop a robust, discriminant dictionary learning framework to extract these components from grossly corrupted training data and combine it with sparse representation to recognize the associated attributes. We demonstrate that our framework can jointly address interrelated classification tasks such as face and facial expression recognition. Inspired by the well-documented importance of the temporal aspect in perceiving affect and behavior, we direct the bulk of our research efforts into continuous-time modeling of dimensional affect and social behavior. Having identified a gap in the literature which is the lack of data containing annotations of social attitudes in continuous time and scale, we first curate a new audio-visual database of multi-party conversations from political debates annotated frame-by-frame in terms of real-valued conflict intensity and use it to conduct the first study on continuous-time conflict intensity estimation. Our experimental findings corroborate previous evidence indicating the inability of existing classifiers in capturing the hidden temporal structures of affective and behavioral displays. We present here a novel dynamic behavior analysis framework which models temporal dynamics in an explicit way, based on the natural assumption that continuous- time annotations of smoothly-varying affect or behavior can be viewed as outputs of a low-complexity linear dynamical system when behavioral cues (features) act as system inputs. A novel robust structured rank minimization framework is proposed to estimate the system parameters in the presence of gross corruptions and partially missing data. Experiments on prediction of dimensional conflict and affect as well as multi-object tracking from detection validate the effectiveness of our predictive framework and demonstrate that for the first time that complex human behavior and affect can be learned and predicted based on small training sets of person(s)-specific observations.Open Acces

Spiral - Imperial College Digital Repository

Non-REM dreaming in relation to the cyclic alternating pattern an exploratory study

Author: Wainstein Danyal
Publication venue: 'Islamia College Peshawar - Department of Psychology'
Publication date: 01/01/2013
Field of study

Includes bibliographical references.Dreaming is yet to be studied in relation to sleep microstructure. By endeavouring to study mentation in relation to the finer neurophysiological processes underlying the rhythmicity of the sleep cycles, dream science stands to benefit from the wealth of knowledge of these processes. While relationships between dreaming and certain of these processes have been identified in the literature, a comprehensive study of dreaming in relation to all of the recognized components of the sleep microstructure is completely lacking. With this in mind, the main aim of this study was to examine sleep microstructure in relation to dreaming and determine whether there is any relationship between dream recall and the various types of phasic arousal phenomena during NREM sleep, as systematised within the global framework of the cyclic alternating pattern (CAP)

Cape Town University OpenUCT