Search CORE

79 research outputs found

Objects and scenes classification with selective use of central and peripheral image content

Author: Alameer Ali
Degenaar Patrick
Nazarpour Kianoush
Publication venue: 'Elsevier BV'
Publication date: 01/01/2020
Field of study

University of Salford Institutional Repository

Edinburgh Research Explorer

Split fovea theory and the role of the two cerebral hemispheres in reading: a review of the evidence

Author: Brysbaert Marc
Ellis Andrew W
Publication venue: 'Elsevier BV'
Publication date: 01/01/2010
Field of study

Ghent University Academic Bibliography

Space-variant picture coding

Author: Popkin Timothy John
Publication venue: 'Queen Mary University of London'
Publication date: 01/01/2010
Field of study

PhDSpace-variant picture coding techniques exploit the strong spatial non-uniformity of the human visual system in order to increase coding efficiency in terms of perceived quality per bit. This thesis extends space-variant coding research in two directions. The first of these directions is in foveated coding. Past foveated coding research has been dominated by the single-viewer, gaze-contingent scenario. However, for research into the multi-viewer and probability-based scenarios, this thesis presents a missing piece: an algorithm for computing an additive multi-viewer sensitivity function based on an established eye resolution model, and, from this, a blur map that is optimal in the sense of discarding frequencies in least-noticeable- rst order. Furthermore, for the application of a blur map, a novel algorithm is presented for the efficient computation of high-accuracy smoothly space-variant Gaussian blurring, using a specialised filter bank which approximates perfect space-variant Gaussian blurring to arbitrarily high accuracy and at greatly reduced cost compared to the brute force approach of employing a separate low-pass filter at each image location. The second direction is that of artifi cially increasing the depth-of- field of an image, an idea borrowed from photography with the advantage of allowing an image to be reduced in bitrate while retaining or increasing overall aesthetic quality. Two synthetic depth of field algorithms are presented herein, with the desirable properties of aiming to mimic occlusion eff ects as occur in natural blurring, and of handling any number of blurring and occlusion levels with the same level of computational complexity. The merits of this coding approach have been investigated by subjective experiments to compare it with single-viewer foveated image coding. The results found the depth-based preblurring to generally be significantly preferable to the same level of foveation blurring

Queen Mary Research Online

OpenGrey Repository

A new type of eye movement model based on recurrent neural networks for simulating the gaze behavior of human reading.

Author: Ren Jinchang
Wang Xiaoming
Zhao Xinbo
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2019
Field of study

Traditional eye movement models are based on psychological assumptions and empirical data that are not able to simulate eye movement on previously unseen text data. To address this problem, a new type of eye movement model is presented and tested in this paper. In contrast to conventional psychology-based eye movement models, ours is based on a recurrent neural network (RNN) to generate a gaze point prediction sequence, by using the combination of convolutional neural networks (CNN), bidirectional long short-term memory networks (LSTM), and conditional random fields (CRF). The model uses the eye movement data of a reader reading some texts as training data to predict the eye movements of the same reader reading a previously unseen text. A theoretical analysis of the model is presented to show its excellent convergence performance. Experimental results are then presented to demonstrate that the proposed model can achieve similar prediction accuracy while requiring fewer features than current machine learning models

University of Strathclyde Institutional Repository

Open Access Institutional Repository at Robert Gordon University

Directory of Open Access Journals

The Contribution of the Magnocellular Visual Pathway to the Process of Visual Word Recognition

Author: Thomson Jacqueline
Publication venue
Publication date: 01/01/2007
Field of study

Previous research on visual word recognition has uncovered a variety of factors which influence how easily this process is achieved. Some factors are intrinsic to the word itself (e.g., length, frequency, regularity) and some are environmental factors (e.g., stimuli contrast or visual field position). Any proposed account of visual word recognition must consider not only the properties of the word itself, but also the properties of the visual system that processes the words. This thesis tested the hypothesis that the magnocellular visual pathway contributes to the processing of words and that this contribution is most evident when words are presented in parafoveal vision. Experiments 1 and 2 investigated the effect on the recognition of isolated words of limiting input to the visual system by occluding one eye. We looked at the effect of visual field presentation position and word length. Previous research using binocular viewing had shown a large length effect in the left visual field. We found that occluding the right eye reduced the left visual field length effect. Experiments 3, 4 and 5 looked at the impact of varying presentation position on competent readers and dyslexics. Numerous studies in sentence processing have shown that phonological information can be extracted during parafoveal preview. We asked whether dyslexics’ well attested phonological impairment will hinder their ability to extract phonological information in parafoveal vision. Experiments 3 and 4 demonstrated that only the dyslexic group showed an effect of word regularity. Experiment 5 used a rhyme-matching task to show that only dyslexic readers have a problem in extracting phonological information from word pairs presented to the right visual field. We relate this to magnocellular functioning. Experiments 6, 7 and 8 used isoluminant stimuli to directly test the consequences of inhibiting the magnocellular visual pathway on the recognition of words presented both foveally and parafoveally. The results of these experiments show that blocking the magnocellular pathway affects parafoveal areas of the visual field more than the foveal area and that words are affected by this whereas non-words are not. In conclusion, we demonstrated that the magnocellular pathway does contribute significantly to the recognition of words and that the parafoveal area of the retina is more heavily dependent on the magnocellular pathway compared to the foveal area of the retina. We go on to propose plans for future research looking at the role of the magnocellular pathway in parafoveal preview in sentence reading

Edinburgh Research Archive

Recommended from our members

Testing and training lifeguard visual search

Author: Laxton VA
Publication venue
Publication date: 01/09/2019
Field of study

Lifeguards play a crucial role in drowning prevention. However, current U.K. lifeguard qualifications are limited in training and assessing visual surveillance skills, and little is known about how lifeguards successfully detect drowning swimmers. To improve our understanding of lifeguard visual search skill, and explore the potential for improving this skill through training, this thesis had the following aims: (a) to identify whether visual skills for drowning detection improve with lifeguard experience, (b) to understand why such differences occur, and (c) design and valid a visual training intervention to improve drowning detection on the basis of these results. The first two studies investigated drowning-detection skills of participants with differing levels of lifeguard experience in a dynamic search task with simulated drownings. Lifeguards were found to detect drownings faster and more often than non-lifeguards. In three follow-up studies these results were replicated with more naturalistic stimuli. Video footage from an American wave pool was extracted, which showed genuine instances of swimmer distress. Results again demonstrated lifeguard superiority in detecting the drowning targets. Eye tracking measures, recorded on both the simulated and naturalistic clips, failed to reveal any differences between lifeguards and non-lifeguards, suggesting that superior drowning detection for lifeguards did not result from better scanning strategies per se. Following this, two cognitive mechanisms that may underlie drowning-detection skill were investigated. Lifeguard and non-lifeguard performance on Multiple Object Avoidance (MOA) and Functional Field of View (FFOV) tests was assessed. Although lifeguards had better MOA task performance compared to non-lifeguards, only the lifeguards’ accuracy at detecting the central target in the FFOV task predicted performance on a subsequent drowning detection task. It was concluded that superior drowning detection was a result of better classification recognition of drowning swimmers (which was the central task in the FFOV test). Based on these findings the final experiment explored the effectiveness of an intense classification training task to improve drowning detection. An intervention was designed that required participants to differentiate between videos of isolated drowning and non-drowning swimmers. Non-lifeguards trained in this intervention showed greater improvement on a subsequent drowning-detection task compared to untrained control participants, who completed an active-control task. The results of this thesis suggest that drowning-detection skill can be reliably assessed, and that foveal processing of drowning characteristics is key to lifeguards' superior performance. Isolating and training this key sub-skill improves drowning-detection performance and offers a method for training future lifeguards

Nottingham Trent Institutional Repository (IRep)

Negative and positive congruence effects in letters and shapes

Author: A. Brooks
A. D. Friederici
A. F. Kramer
A. F. Sanders
A. Huckauf
A. M. Treisman
A. Toet
B. A. Eriksen
B. Julesz
C. Leeuwen van
C. Leeuwen van
C. Leeuwen van
Cees Van Leeuwen
D. Bavelier
D. Besner
D. J. Field
D. J. Mewhort
E. Bedson
E. Fox
E. Freeman
E. Hazeltine
E. L. Bjork
E. T. Davis
G. Kanizsa
G. M. Reicher
H. Bouma
H. E. Egeth
H. Nakatani
H. Pashler
I. Kovacs
J. Baron
J. Duncan
J. Duncan
J. Duncan
J. E. Joseph
J. F. Stins
J. G. Rueckl
J. M. Wolfe
J. M. Wolfe
J. Miller
J. N. Rouder
J. R. Pomerantz
J. R. Stroop
K. A. Briand
K. Brendler
L. E. Krueger
L. E. Krueger
L. E. Krueger
L. Paquet
M. G. Eley
M. Grünau von
M. Hogeboom
M. Mason
N. J. Cepeda
N. Lavie
O. Zegarra-Moran
P. Bigsby
P. Gong
R. A. Rensink
R. H. Lubbe van der
R. Hagenaar
R. Kimchi
S. J. Leat
S. N. Greenberg
S. T. Chung
S. T. Orton
T. G. Ghirardelli
T. Lachmann
T. Lachmann
T. W. Deacon
Thomas Lachmann
U. Polat
V. Lollo Di
W. D. Ross
W. K. Estes
W. K. Estes
W. P. Banks
W. P. Banks
W. R. Garner
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Biologically inspired feature extraction for rotation and scale tolerant pattern analysis

Author: Vidacic Dragan
Publication venue: University of New Hampshire Scholars\u27 Repository
Publication date: 01/01/2009
Field of study

Biologically motivated information processing has been an important area of scientific research for decades. The central topic addressed in this dissertation is utilization of lateral inhibition and more generally, linear networks with recurrent connectivity along with complex-log conformal mapping in machine based implementations of information encoding, feature extraction and pattern recognition. The reasoning behind and method for spatially uniform implementation of inhibitory/excitatory network model in the framework of non-uniform log-polar transform is presented. For the space invariant connectivity model characterized by Topelitz-Block-Toeplitz matrix, the overall network response is obtained without matrix inverse operations providing the connection matrix generating function is bound by unity. It was shown that for the network with the inter-neuron connection function expandable in a Fourier series in polar angle, the overall network response is steerable. The decorrelating/whitening characteristics of networks with lateral inhibition are used in order to develop space invariant pre-whitening kernels specialized for specific category of input signals. These filters have extremely small memory footprint and are successfully utilized in order to improve performance of adaptive neural whitening algorithms. Finally, the method for feature extraction based on localized Independent Component Analysis (ICA) transform in log-polar domain and aided by previously developed pre-whitening filters is implemented. Since output codes produced by ICA are very sparse, a small number of non-zero coefficients was sufficient to encode input data and obtain reliable pattern recognition performance

UNH Scholars' Repository

Recommended from our members

An exploration into the contributing cognitive skills of lifeguard visual search

Author: Crundall D
Laxton V
Mackenzie AK
Publication venue: 'Wiley'
Publication date: 19/01/2022
Field of study

Lifeguard drowning detection in swimming pools and beach settings is influenced by experience. The current experiment explores the cognitive skills that might underlie this experience effect. Lifeguard and non-lifeguard performance in a domain-free multiple object avoidance (MOA) task and a partially domain-free functional field of view (FFOV) task was compared to performance on an occlusion-based drowning detection task. Lifeguards performed better than non-lifeguards on the MOA task and the FFOV central task (identifying whether an isolated swimmer was drowning). However, only performance in the central FFOV task was associated with performance in the occlusion-based drowning detection task, and this was the only part of the two tasks that was not domain-free. These results suggest lifeguard drowning detection is mainly driven through the learned ability to process behaviours of drowning swimmers quicker than non-lifeguards. Therefore, it may be possible to train novices’ ability to detect drowning swimmers through an exposure task

Nottingham Trent Institutional Repository (IRep)