Search CORE

264 research outputs found

Using the properties of Primate Motion Sensitive Neurons to extract camera motion and depth from brief 2-D Monocular Image Sequences

Author: A Watson
A Watson
BKP Horn
D Fortun
D Regan
D Scaramuzza
F Fraundorfer
J Engel
J Engel
J Gibson
J Maunsell
J Perrone
J Perrone
JA Movshon
JA Perrone
JA Perrone
JA Perrone
JA Perrone
JA Perrone
JA Perrone
JJ Koenderink
K Nakayama
KH Britten
KH Foster
L Valgaerts
M Hawken
M Hawken
M Irani
N Inaba
P Anandan
R Mur-Artal
RE Kalman
RT Born
S Wuerger
W Warren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Humans and most animals can run/fly and navigate efficiently through cluttered environments while avoiding obstacles in their way. Replicating this advanced skill in autonomous robotic vehicles currently requires a vast array of sensors coupled with computers that are bulky, heavy and power hungry. The human eye and brain have had millions of years to develop an efficient solution to the problem of visual navigation and we believe that it is the best system to reverse engineer. Our brain and visual system appear to use a very different solution to the visual odometry problem compared to most computer vision approaches. We show how a neural-based architecture is able to extract self-motion information and depth from monocular 2-D video sequences and highlight how this approach differs from standard CV techniques. We previously demonstrated how our system works during pure translation of a camera. Here, we extend this approach to the case of combined translation and rotation

Crossref

Research Commons@Waikato

Testing a biologically-based system for extracting depth from brief monocular 2-D video sequences

Author: Corlett Dale
Cree Michael J.
Hedayati Mohammad
Perrone John A.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2018
Field of study

Knowledge of the 3-D layout in front of a moving robot or vehicle is essential for obstacle avoidance and navigation. Currently the most common methods for acquiring that information rely on ‘active’ technologies which project light into the world (e.g., LIDAR). Some passive (non-emitting) systems use stereo cameras but only a relatively small number of techniques attempt to solve the 3-D layout problem using the information from a single video camera. A single camera offers many advantages such as lighter weight and fewer video streams to process. The visual motion occurring in brief monocular video sequences contains information regarding the movement of the camera and the structure of the scene. Extracting that information is difficult however because it relies on accurate estimates of the image motion velocities (optical flow) and knowledge of the camera motion, especially the heading direction. We have solved these two problems and can now obtain image flow and heading direction using mechanisms based on the properties of motion sensitive neurones in the brain. This allows us to recover depth information from monocular video sequences and here we report on a series of tests that assess the accuracy of this novel approach to 3-D depth recovery

Research Commons@Waikato

Extraction of Surface-Related Features in a Recurrent Model of V1-V2 Interactions

Author: Eshel Ben-Jacob
Heiko Neumann
Ulrich Weidenbacher
Publication venue: Public Library of Science
Publication date: 15/06/2009
Field of study

Humans can effortlessly segment surfaces and objects from two-dimensional (2D) images that are projections of the 3D world. The projection from 3D to 2D leads partially to occlusions of surfaces depending on their position in depth and on viewpoint. One way for the human visual system to infer monocular depth cues could be to extract and interpret occlusions. It has been suggested that the perception of contour junctions, in particular T-junctions, may be used as cue for occlusion of opaque surfaces. Furthermore, X-junctions could be used to signal occlusion of transparent surfaces.In this contribution, we propose a neural model that suggests how surface-related cues for occlusion can be extracted from a 2D luminance image. The approach is based on feedforward and feedback mechanisms found in visual cortical areas V1 and V2. In a first step, contours are completed over time by generating groupings of like-oriented contrasts. Few iterations of feedforward and feedback processing lead to a stable representation of completed contours and at the same time to a suppression of image noise. In a second step, contour junctions are localized and read out from the distributed representation of boundary groupings. Moreover, surface-related junctions are made explicit such that they are evaluated to interact as to generate surface-segmentations in static images. In addition, we compare our extracted junction signals with a standard computer vision approach for junction detection to demonstrate that our approach outperforms simple feedforward computation-based approaches.A model is proposed that uses feedforward and feedback mechanisms to combine contextually relevant features in order to generate consistent boundary groupings of surfaces. Perceptually important junction configurations are robustly extracted from neural representations to signal cues for occlusion and transparency. Unlike previous proposals which treat localized junction configurations as 2D image features, we link them to mechanisms of apparent surface segregation. As a consequence, we demonstrate how junctions can change their perceptual representation depending on the scene context and the spatial configuration of boundary fragments

Public Library of Science (PLOS)

Crossref

PubMed Central

A cyclopean perspective on mouse visual cortex

Author: La Chioma Alessandro
Publication venue: Ludwig-Maximilians-Universität München
Publication date: 11/10/2018
Field of study

Digitale Hochschulschriften der LMU

Computational Modeling of Human Dorsal Pathway for Motion Processing

Author: Wang Yuheng
Publication venue: RIT Scholar Works
Publication date: 01/08/2013
Field of study

Reliable motion estimation in videos is of crucial importance for background iden- tification, object tracking, action recognition, event analysis, self-navigation, etc. Re- constructing the motion field in the 2D image plane is very challenging, due to variations in image quality, scene geometry, lighting condition, and most importantly, camera jit- tering. Traditional optical flow models assume consistent image brightness and smooth motion field, which are violated by unstable illumination and motion discontinuities that are common in real world videos. To recognize observer (or camera) motion robustly in complex, realistic scenarios, we propose a biologically-inspired motion estimation system to overcome issues posed by real world videos. The bottom-up model is inspired from the infrastructure as well as functionalities of human dorsal pathway, and the hierarchical processing stream can be divided into three stages: 1) spatio-temporal processing for local motion, 2) recogni- tion for global motion patterns (camera motion), and 3) preemptive estimation of object motion. To extract effective and meaningful motion features, we apply a series of steer- able, spatio-temporal filters to detect local motion at different speeds and directions, in a way that\u27s selective of motion velocity. The intermediate response maps are cal- ibrated and combined to estimate dense motion fields in local regions, and then, local motions along two orthogonal axes are aggregated for recognizing planar, radial and circular patterns of global motion. We evaluate the model with an extensive, realistic video database that collected by hand with a mobile device (iPad) and the video content varies in scene geometry, lighting condition, view perspective and depth. We achieved high quality result and demonstrated that this bottom-up model is capable of extracting high-level semantic knowledge regarding self motion in realistic scenes. Once the global motion is known, we segment objects from moving backgrounds by compensating for camera motion. For videos captured with non-stationary cam- eras, we consider global motion as a combination of camera motion (background) and object motion (foreground). To estimate foreground motion, we exploit corollary dis- charge mechanism of biological systems and estimate motion preemptively. Since back- ground motions for each pixel are collectively introduced by camera movements, we apply spatial-temporal averaging to estimate the background motion at pixel level, and the initial estimation of foreground motion is derived by comparing global motion and background motion at multiple spatial levels. The real frame signals are compared with those derived by forward predictions, refining estimations for object motion. This mo- tion detection system is applied to detect objects with cluttered, moving backgrounds and is proved to be efficient in locating independently moving, non-rigid regions. The core contribution of this thesis is the invention of a robust motion estimation system for complicated real world videos, with challenges by real sensor noise, complex natural scenes, variations in illumination and depth, and motion discontinuities. The overall system demonstrates biological plausibility and holds great potential for other applications, such as camera motion removal, heading estimation, obstacle avoidance, route planning, and vision-based navigational assistance, etc

RIT Scholar Works

Change blindness: eradication of gestalt strategies

Author: Goddard Paul
Wilson Steve
Publication venue: 'Pion Ltd'
Publication date: 01/08/2011
Field of study

Arrays of eight, texture-defined rectangles were used as stimuli in a one-shot change blindness (CB) task where there was a 50% chance that one rectangle would change orientation between two successive presentations separated by an interval. CB was eliminated by cueing the target rectangle in the first stimulus, reduced by cueing in the interval and unaffected by cueing in the second presentation. This supports the idea that a representation was formed that persisted through the interval before being 'overwritten' by the second presentation (Landman et al, 2003 Vision Research 43149–164]. Another possibility is that participants used some kind of grouping or Gestalt strategy. To test this we changed the spatial position of the rectangles in the second presentation by shifting them along imaginary spokes (by ±1 degree) emanating from the central fixation point. There was no significant difference seen in performance between this and the standard task [F(1,4)=2.565, p=0.185]. This may suggest two things: (i) Gestalt grouping is not used as a strategy in these tasks, and (ii) it gives further weight to the argument that objects may be stored and retrieved from a pre-attentional store during this task

University of Lincoln Institutional Repository

Recommended from our members

Functional Specialization of eye-specific visual pathways into higher visual cortex

Author: Salinas Kirstie Jay
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

The brain is able to construct a visual representation of the world by parallel processing of cortical neurons that prefer increasingly complex stimuli. One way the visual cortex has accomplished parallel processing is by creating functionally organized modules that are tuned to unique features and linking them in multiple processing stages of cortex. For example, primary visual cortex (V1) sends functionally distinct information to higher visual areas (HVAs), which are more specialized in their processing of spatiotemporal information. Inherently coupled to this process is the convergence of eye-specific inputs in visual cortex. Shifting the eye-specific tuning of neurons in primary visual cortex by monocular deprivation in early life is known to disrupt tuning for spatial frequency in adulthood. Combining space and time better characterizes the segregation of HVAs. To begin to understand if eye-specific responses could be linked to tuning properties important for the segregation of HVAs, we characterized eye-specific spatiotemporal tuning of layer 2/3 excitatory cells within the binocular zone of V1 and two HVAs grouped into the putative ventral and dorsal streams, LM and PM, using two-photon GCaMP6s imaging of awake mice. An asymmetry was found at the level of V1, such that responses driven primarily by the contralateral eye were biased towards high spatial frequencies, low speeds, cardinal directions, and were more direction selective than binocular or ipsilateral eye-driven responses. Eye-specific inputs in V1 are tuned to different speeds and also have different degrees of speed tuning, where contralateral eye inputs are more speed tuned than ipsilateral eye inputs. The proportions of eye-specific neurons of LM and PM matched the expected preferences based on eye-specific spatial frequency tuning found at the level of V1. A similar contralateral bias for distinct features, most notably, spatiotemporal tuning, was found within LM and PM, linking neurons with similar eye-specific preferences to their tuning for early feature detectors important for stream specialization. To determine if V1 sends eye-specific functionally distinct information to HVAs, we injected AAV-Syn-GCaMP6s into the binocular zone of V1 and imaged the afferents that targeted either LM or PM. We found that V1 afferents to LM and PM were distinct in their distributions for ocular dominance, suggesting that eye-specific projections from V1 to HVAs contribute to their functional specificity. To determine if the functional specialization of HVAs depend upon eye-specific developmental mechanisms, we deprived mice of visual experience through the contralateral eye (CMD) during the ocular dominance critical period and assessed eye-specific spatiotemporal tuning of V1, LM and PM in adulthood. We found that CMD diminished the functional specificity of V1, LM and PM, resulting in areas without differentiated spatiotemporal preferences. Moreover, the eye-specific functional segregation was also disrupted with CMD. Altogether, our data demonstrates that the maturation of higher visual areas is dependent on proper binocular visual experience and suggests that the functional specialization of eye-specific responses could be an efficient routing mechanism to differentiate higher visual areas

eScholarship - University of California

An Empirical Model of Area MT: Investigating the Link between Representation Properties and Function

Author: Sadat Rezai Seyed Omid
Publication venue: 'University of Waterloo'
Publication date: 03/10/2018
Field of study

The middle temporal area (MT) is one of the visual areas of the primate brain where neurons have highly specialized representations of motion and binocular disparity. Other stimulus features such as contrast, size, and pattern can also modulate MT activity. Since MT has been studied intensively for decades, there is a rich literature on its response characteristics. Here, I present an empirical model that incorporates some of this literature into a statistical model of population response. Specifically, the parameters of the model are drawn from distributions that I have estimated from data in the electrophysiology literature. The model accepts arbitrary stereo video as input and uses computer-vision methods to calculate dense flow, disparity, and contrast fields. The activity is then predicted using a combination of tuning functions, which have previously been used to describe data in a variety of experiments. The empirical model approximates a number of MT phenomena more closely than other models as well as reproducing three phenomena not addressed with the past models. I present three applications of the model. First, I use it for examining the relationships between MT tuning features and behaviour in an ethologically relevant task. Second, I employ it to study the functional role of MT surrounds in motion-related tasks. Third, I use it to guide the internal activity of a deep convolutional network towards a more physiologically realistic representation

University of Waterloo's Institutional Repository

Visual Cortex

Author
Publication venue: 'IntechOpen'
Publication date: 20/04/2021
Field of study

The neurosciences have experienced tremendous and wonderful progress in many areas, and the spectrum encompassing the neurosciences is expansive. Suffice it to mention a few classical fields: electrophysiology, genetics, physics, computer sciences, and more recently, social and marketing neurosciences. Of course, this large growth resulted in the production of many books. Perhaps the visual system and the visual cortex were in the vanguard because most animals do not produce their own light and offer thus the invaluable advantage of allowing investigators to conduct experiments in full control of the stimulus. In addition, the fascinating evolution of scientific techniques, the immense productivity of recent research, and the ensuing literature make it virtually impossible to publish in a single volume all worthwhile work accomplished throughout the scientific world. The days when a single individual, as Diderot, could undertake the production of an encyclopedia are gone forever. Indeed most approaches to studying the nervous system are valid and neuroscientists produce an almost astronomical number of interesting data accompanied by extremely worthy hypotheses which in turn generate new ventures in search of brain functions. Yet, it is fully justified to make an encore and to publish a book dedicated to visual cortex and beyond. Many reasons validate a book assembling chapters written by active researchers. Each has the opportunity to bind together data and explore original ideas whose fate will not fall into the hands of uncompromising reviewers of traditional journals. This book focuses on the cerebral cortex with a large emphasis on vision. Yet it offers the reader diverse approaches employed to investigate the brain, for instance, computer simulation, cellular responses, or rivalry between various targets and goal directed actions. This volume thus covers a large spectrum of research even though it is impossible to include all topics in the extremely diverse field of neurosciences

Directory of Open Access Books (DOAB)

Bio-Inspired Computer Vision: Towards a Synergistic Approach of Artificial and Biological Vision

Author: Kornprobst Pierre
Masson Guillaume,
Medathati N V Kartheek
Neumann Heiko
Publication venue: HAL CCSD
Publication date: 29/04/2016
Field of study

To appear in CVIUStudies in biological vision have always been a great source of inspiration for design of computer vision algorithms. In the past, several successful methods were designed with varying degrees of correspondence with biological vision studies, ranging from purely functional inspiration to methods that utilise models that were primarily developed for explaining biological observations. Even though it seems well recognised that computational models of biological vision can help in design of computer vision algorithms, it is a non-trivial exercise for a computer vision researcher to mine relevant information from biological vision literature as very few studies in biology are organised at a task level. In this paper we aim to bridge this gap by providing a computer vision task centric presentation of models primarily originating in biological vision studies. Not only do we revisit some of the main features of biological vision and discuss the foundations of existing computational studies modelling biological vision, but also we consider three classical computer vision tasks from a biological perspective: image sensing, segmentation and optical flow. Using this task-centric approach, we discuss well-known biological functional principles and compare them with approaches taken by computer vision. Based on this comparative analysis of computer and biological vision, we present some recent models in biological vision and highlight a few models that we think are promising for future investigations in computer vision. To this extent, this paper provides new insights and a starting point for investigators interested in the design of biology-based computer vision algorithms and pave a way for much needed interaction between the two communities leading to the development of synergistic models of artificial and biological vision

Elsevier - Publisher Connector

HAL AMU

INRIA a CCSD electronic archive server