622 research outputs found

    Relationship between suspicious coincidence in natural images and contour-salience in oriented filter responses

    Get PDF
    Salient contour detection is an important lowlevel visual process in the human visual system, and has significance towards understanding higher visual and cognitive processes. Salience detection can be investigated by examining the visual cortical response to visual input. Visual response activity in the early stages of visual processing can be approximated by a sequence of convolutions of the input scene with the difference-of-Gaussian (DoG) and the oriented Gabor filters. The filtered responses are unusually high for prominent edge locations in the image, and are uniformly similar across different natural image inputs. Furthermore, such a response follows a power law distribution. The aim of this thesis is to examine how these response properties could be utilized to the problem of salience detection. First, I identify a method to find the best threshold on the response activity (orientation energy) toward the detection of salient contours: compare the response distribution to a Gaussian distribution of equal variance. Second, I justify this comparison by providing an explanation under the framework of Suspicious Coincidence proposed by Barlow [1]. A connection is provided between perceived salience of contours and the neuronal goal of detecting suspiciousness, where salient contours are seen as affording suspicious coincidences by the visual system. Finally, the neural plausibility of such a salience detection mechanism is investigated, and the representational effciency is shown which could potentially explain why the human visual system can effortlessly detect salience

    Performance, Development, and Analysis of Tactile vs. Visual Receptive Fields in Texture Tasks

    Get PDF
    Texture segmentation is an effortless process in scene analysis, yet its neural mechanisms are not sufficiently understood. A common assumption in most current approaches is that texture segmentation is a vision problem. However, considering that texture is basically a surface property, this assumption can at times be misleading. One interesting possibility is that texture may be more intimately related with touch than with vision. Recent neurophysiological findings showed that receptive fields (RFs) for touch resemble that of vision, albeit with some subtle differences. To leverage on this, here I propose three ways to investigate the tactile receptive fields in the context of texture processing: (1) performance, (2) development, and (3) analysis. For performance, I tested how such distinct properties in tactile receptive fields can affect texture segmentation performance, as compared to that of visual receptive fields. Preliminary results suggest that touch has an advantage over vision in texture segmentation. These results support the idea that texture is fundamentally a tactile (surface) property. The next question is what drives the two types of RFs, visual and tactile, to become different during cortical development? I investigated the possibility that tactile RF and visual RF emerge based on the same cortical learning process, where the only difference is in the input type, natural-scene-like vs. texture-like. The main result is that RFs trained on natural scenes develop RFs resembling visual RFs, while those trained on texture resemble tactile RFs. These results again suggest a tight link between texture and the tactile modality, from a developmental context. To investigate further the functional properties of these RFs in texture processing, the response of tactile RFs and visual RFs were analyzed with manifold learning and with statistical approaches. The results showed that touch-based manifold seems more suitable for texture processing and desirable properties found in visual RF response can carry over to those in the tactile domain. These results are expected to shed new light on the role of tactile perception of texture; help develop more powerful, biologically inspired texture segmentation algorithms; and further clarify the differences and similarities between touch and vision

    Change blindness: eradication of gestalt strategies

    Get PDF
    Arrays of eight, texture-defined rectangles were used as stimuli in a one-shot change blindness (CB) task where there was a 50% chance that one rectangle would change orientation between two successive presentations separated by an interval. CB was eliminated by cueing the target rectangle in the first stimulus, reduced by cueing in the interval and unaffected by cueing in the second presentation. This supports the idea that a representation was formed that persisted through the interval before being 'overwritten' by the second presentation (Landman et al, 2003 Vision Research 43149–164]. Another possibility is that participants used some kind of grouping or Gestalt strategy. To test this we changed the spatial position of the rectangles in the second presentation by shifting them along imaginary spokes (by ±1 degree) emanating from the central fixation point. There was no significant difference seen in performance between this and the standard task [F(1,4)=2.565, p=0.185]. This may suggest two things: (i) Gestalt grouping is not used as a strategy in these tasks, and (ii) it gives further weight to the argument that objects may be stored and retrieved from a pre-attentional store during this task

    What does the honeybee see? And how do we know?

    Get PDF
    This book is the only account of what the bee, as an example of an insect, actually detects with its eyes. Bees detect some visual features such as edges and colours, but there is no sign that they reconstruct patterns or put together features to form objects. Bees detect motion but have no perception of what it is that moves, and certainly they do not recognize “things” by their shapes. Yet they clearly see well enough to fly and find food with a minute brain. Bee vision is therefore relevant to the construction of simple artificial visual systems, for example for mobile robots. The surprising conclusion is that bee vision is adapted to the recognition of places, not things. In this volume, Adrian Horridge also sets out the curious and contentious history of how bee vision came to be understood, with an account of a century of neglect of old experimental results, errors of interpretation, sharp disagreements, and failures of the scientific method. The design of the experiments and the methods of making inferences from observations are also critically examined, with the conclusion that scientists are often hesitant, imperfect and misleading, ignore the work of others, and fail to consider alternative explanations. The erratic path to understanding makes interesting reading for anyone with an analytical mind who thinks about the methods of science or the engineering of seeing machines

    Shape Representations Using Nested Descriptors

    Get PDF
    The problem of shape representation is a core problem in computer vision. It can be argued that shape representation is the most central representational problem for computer vision, since unlike texture or color, shape alone can be used for perceptual tasks such as image matching, object detection and object categorization. This dissertation introduces a new shape representation called the nested descriptor. A nested descriptor represents shape both globally and locally by pooling salient scaled and oriented complex gradients in a large nested support set. We show that this nesting property introduces a nested correlation structure that enables a new local distance function called the nesting distance, which provides a provably robust similarity function for image matching. Furthermore, the nesting property suggests an elegant flower like normalization strategy called a log-spiral difference. We show that this normalization enables a compact binary representation and is equivalent to a form a bottom up saliency. This suggests that the nested descriptor representational power is due to representing salient edges, which makes a fundamental connection between the saliency and local feature descriptor literature. In this dissertation, we introduce three examples of shape representation using nested descriptors: nested shape descriptors for imagery, nested motion descriptors for video and nested pooling for activities. We show evaluation results for these representations that demonstrate state-of-the-art performance for image matching, wide baseline stereo and activity recognition tasks

    Understanding drawing: a cognitive account of observational process

    Get PDF
    This thesis contributes to theorising observational drawing from a cognitive perspective. Our current understanding of drawing is developing rapidly through artistic and scientific enquiry. However, it remains fragmented because the frames of reference of those modes of enquiry do not coincide. Therefore, the foundations for a truly interdisciplinary understanding of observational drawing are still inceptive. This thesis seeks to add to those foundations by bridging artistic and scientific perspectives on observational process and the cognitive aptitudes underpinning it. The project is based on four case studies of experienced artists drawing processes, with quantitative and qualitative data gathered: timing of eye and hand movements, and artists verbal reports. The data sets are analysed with a generative approach, using behavioural and protocol analysis methods to yield comparative models that describe cognitive strategies for drawing. This forms a grounded framework that elucidates the cognitive activities and competences observational process entails. Cognitive psychological theory is consulted to explain the observed behaviours, and the combined evidence is applied to understanding apparent discrepancies in existing accounts of drawing. In addition, the use of verbal reporting methods in drawing studies is evaluated. The study observes how drawing process involves a segregation of activities that enables efficient use of limited and parametrically constrained cognitive resources. Differing drawing strategies are shown to share common key characteristics; including a staged use of selective visual attention, and the capacity to temporarily postpone critical judgement in order to engage fully in periods of direct perception and action. The autonomy and regularity of those activities, demonstrated by the artists studied, indicate that drawing ability entails tacit self‐knowledge concerning the cognitive and perceptual capacities described in this thesis. This thesis presents drawing as a skill that involves strategic use of visual deconstruction, comparison, analogical transfer and repetitive cycles of construction, evaluation and revision. I argue that drawing skill acquisition and transfer can be facilitated by the elucidation of these processes. As such, this framework for describing and understanding drawing is offered to those who seek to understand, learn or teach observational practice, and to those who are taking a renewed interest in drawing as a tool for thought

    She believes she is herself, which isn\u27t complete madness: Becoming the Female Subject through Womanhood as Relation

    Get PDF
    Senior Project submitted to The Division of Languages and Literature of Bard College
    corecore