52 research outputs found
Toward a further understanding of object feature binding: a cognitive neuroscience perspective.
The aim of this thesis is to lead to a further understanding of the neural mechanisms underlying object feature binding in the human brain. The focus is on information processing and integration in the visual system and visual shortterm memory. From a review of the literature it is clear that there are three major
competing binding theories, however, none of these individually solves the binding problem satisfactorily. Thus the aim of this research is to conduct behavioural experimentation into object feature binding, paying particular attention to visual short-term memory.
The behavioural experiment was designed and conducted using a within-subjects delayed responset ask comprising a battery of sixty-four composite objects each with three features and four dimensions in each of three conditions (spatial, temporal and spatio-temporal).Findings from the experiment,which focus on spatial and temporal aspects of object feature binding and feature proximity on
binding errors, support the spatial theories on object feature binding, in addition we propose that temporal theories and convergence, through hierarchical feature
analysis, are also involved. Because spatial properties have a dedicated processing neural stream, and temporal properties rely on limited capacity memory systems, memories for sequential information would likely be more
difficult to accuratelyr ecall. Our study supports other studies which suggest that both spatial and temporal coherence to differing degrees,may be involved in
object feature binding. Traditionally, these theories have purported to provide individual solutions, but this thesis proposes a novel unified theory of object feature binding in which hierarchical feature analysis, spatial attention and temporal synchrony each plays a role. It is further proposed that binding takes place in visual short-term memory through concerted and integrated information
processing in distributed cortical areas. A cognitive model detailing this integrated proposal is given. Next, the cognitive model is used to inform the design and suggested implementation of a computational model which would be
able to test the theory put forward in this thesis. In order to verify the model, future work is needed to implement the computational model.Thus it is argued
that this doctoral thesis provides valuable experimental evidence concerning spatio-temporal aspects of the binding problem and as such is an additional building block in the quest for a solution to the object feature binding problem
Feed forward neural networks and genetic algorithms for automated financial time series modelling
This thesis presents an automated system for financial time series modelling. Formal and applied
methods are investigated for combining feed-forward Neural Networks and Genetic Algorithms (GAs) into a
single adaptive/learning system for automated time series forecasting. Four important research contributions
arise from this investigation: i) novel forms of GAs are introduced which are designed to counter the
representational bias associated with the conventional Holland GA, ii) an experimental methodology for
validating neural network architecture design strategies is introduced, iii) a new method for network pruning
is introduced, and iv) an automated method for inferring network complexity for a given learning task is
devised. These methods provide a general-purpose applied methodology for developing neural network
applications and are tested in the construction of an automated system for financial time series modelling.
Traditional economic theory has held that financial price series are random. The lack of a priori models on
which to base a computational solution for financial modelling provides one of the hardest tests of adaptive
system technology. It is shown that the system developed in this thesis isolates a deterministic signal within
a Gilt Futures prices series, to a confidences level of over 99%, yielding a prediction accuracy of over 60%
on a single run of 1000 out-of-sample experiments.
An important research issue in the use of feed-forward neural networks is the problems associated
with parameterisation so as to ensure good generalisation. This thesis conducts a detailed examination of
this issue. A novel demonstration of a network's ability to act as a universal functional approximator for
finite data sets is given. This supplies an explicit formula for setting a network's architecture and weights in
order to map a finite data set to arbitrary precision. It is shown that a network's ability to generalise is
extremely sensitive to many parameter choices and that unless careful safeguards are included in the
experimental procedure over-fitting can occur. This thesis concentrates on developing automated techniques
so as to tackle these problems.
Techniques for using GAs to parameterise neural networks are examined. It is shown that the
relationship between the fitness function, the GA operators and the choice of encoding are all instrumental
in determining the likely success of the GA search. To address this issue a new style of GA is introduced
which uses multiple encodings in the course of a run. These are shown to out-perform the Holland GA on a
range of standard test functions. Despite this innovation it is argued that the direct use of GAs to neural
network parameterisation runs the risk of compounding the network sensitivity issue. Moreover, in the
absence of a precise formulation of generalisation a less direct use of GAs to network parameterisation is
examined. Specifically a technique, artficia1 network generation (ANG), is introduced in which a GA is
used to artificially generate test learning problems for neural networks that have known network solutions.
ANG provides a means for directly testing i) a neural net architecture, ii) a neural net training process, and
iii) a neural net validation procedure, against generalisation. ANG is used to provide statistical evidence in
favour of Occam's Razor as a neural network design principle. A new method for pruning and inferring
network complexity for a given learning problem is introduced. Network Regression Pruning (NRP) is a
network pruning method that attempts to derive an optimal network architecture by starting from what is
considered an overly large network. NRP differs radically from conventional pruning methods in that it
attempts to hold a trained network's mapping fixed as pruning proceeds. NRP is shown to be extremely
successful at isolating optimal network architectures on a range of test problems generated using ANG.
Finally, NRP and techniques validated using ANG are combined to implement an Automated Neural
network Time series Analysis System (ANTAS). ANTAS is applied to the gilt futures price series The Long
Gilt Futures Contract (LGFC)
Toward a further understanding of object feature binding : a cognitive neuroscience perspective
The aim of this thesis is to lead to a further understanding of the neural mechanisms underlying object feature binding in the human brain. The focus is on information processing and integration in the visual system and visual shortterm memory. From a review of the literature it is clear that there are three major competing binding theories, however, none of these individually solves the binding problem satisfactorily. Thus the aim of this research is to conduct behavioural experimentation into object feature binding, paying particular attention to visual short-term memory. The behavioural experiment was designed and conducted using a within-subjects delayed responset ask comprising a battery of sixty-four composite objects each with three features and four dimensions in each of three conditions (spatial, temporal and spatio-temporal).Findings from the experiment,which focus on spatial and temporal aspects of object feature binding and feature proximity on binding errors, support the spatial theories on object feature binding, in addition we propose that temporal theories and convergence, through hierarchical feature analysis, are also involved. Because spatial properties have a dedicated processing neural stream, and temporal properties rely on limited capacity memory systems, memories for sequential information would likely be more difficult to accuratelyr ecall. Our study supports other studies which suggest that both spatial and temporal coherence to differing degrees,may be involved in object feature binding. Traditionally, these theories have purported to provide individual solutions, but this thesis proposes a novel unified theory of object feature binding in which hierarchical feature analysis, spatial attention and temporal synchrony each plays a role. It is further proposed that binding takes place in visual short-term memory through concerted and integrated information processing in distributed cortical areas. A cognitive model detailing this integrated proposal is given. Next, the cognitive model is used to inform the design and suggested implementation of a computational model which would be able to test the theory put forward in this thesis. In order to verify the model, future work is needed to implement the computational model.Thus it is argued that this doctoral thesis provides valuable experimental evidence concerning spatio-temporal aspects of the binding problem and as such is an additional building block in the quest for a solution to the object feature binding problem.EThOS - Electronic Theses Online ServiceGBUnited Kingdo
Identification and control of dynamic systems using neural networks.
The aim of this thesis is to contribute in solving problems related to the on-line
identification and control of unknown dynamic systems using feedforward neural
networks. In this sense, this thesis presents new on-line learning algorithms for
feedforward neural networks based upon the theory of variable structure system
design, along with mathematical proofs regarding the convergence of solutions given
by the algorithms; the boundedness of these solutions; and robustness features of
the algorithms with respect to external perturbations affecting the neural networks'
signals.
In the thesis, the problems of on-line identification of the forward transfer
operator, and the inverse transfer operator of unknown dynamic systems are also
analysed, and neural networks-based identification schemes are proposed. These
identification schemes are tested by computer simulations on linear and nonlinear
unknown plants using both continuous-time and discrete-time versions of the proposed
learning algorithms.
The thesis reports about the direct inverse dynamics control problems using
neural networks, and contributes towards solving these problems by proposing a
direct inverse dynamics neural network-based control scheme with on-line learning
capabilities of the inverse dynamics of the plant, and the addition of a feedback
path that enables the resulting control scheme to exhibit robustness characteristics
with respect to external disturbances affecting the output of the system. Computer
simulation results on the performance of the mentioned control scheme in controlling
linear and nonlinear plants are also included.
The thesis also formulates a neural network-based internal model control scheme
with on-line estimation capabilities of the forward transfer operator and the inverse
transfer operator of unknown dynamic systems. The performance of this internal
model control scheme is tested by computer simulations using a stable open-loop
unknown plant with output signal corrupted by white noise.
Finally, the thesis proposes a neural network-based adaptive control scheme
where identification and control are simultaneously carried out
Development of cue integration with reward-mediated learning
This thesis will first introduce in more detail the Bayesian theory and its use in integrating multiple information sources. I will briefly talk about models and their relation to the dynamics of an environment, and how to combine multiple alternative models. Following that I will discuss the experimental findings on multisensory integration in humans and animals. I start with psychophysical results on various forms of tasks and setups, that show that the brain uses and combines information from multiple cues. Specifically, the discussion will focus on the finding that humans integrate this information in a way that is close to the theoretical optimal performance. Special emphasis will be put on results about the developmental aspects of cue integration, highlighting experiments that could show that children do not perform similar to the Bayesian predictions. This section also includes a short summary of experiments on how subjects handle multiple alternative environmental dynamics. I will also talk about neurobiological findings of cells receiving input from multiple receptors both in dedicated brain areas but also primary sensory areas. I will proceed with an overview of existing theories and computational models of multisensory integration. This will be followed by a discussion on reinforcement learning (RL). First I will talk about the original theory including the two different main approaches model-free and model-based reinforcement learning. The important variables will be introduced as well as different algorithmic implementations. Secondly, a short review on the mapping of those theories onto brain and behaviour will be given. I mention the most in uential papers that showed correlations between the activity in certain brain regions with RL variables, most prominently between dopaminergic neurons and temporal difference errors. I will try to motivate, why I think that this theory can help to explain the development of near-optimal cue integration in humans. The next main chapter will introduce our model that learns to solve the task of audio-visual orienting. Many of the results in this section have been published in [Weisswange et al. 2009b,Weisswange et al. 2011]. The model agent starts without any knowledge of the environment and acts based on predictions of rewards, which will be adapted according to the reward signaling the quality of the performed action. I will show that after training this model performs similarly to the prediction of a Bayesian observer. The model can also deal with more complex environments in which it has to deal with multiple possible underlying generating models (perform causal inference). In these experiments I use di#erent formulations of Bayesian observers for comparison with our model, and find that it is most similar to the fully optimal observer doing model averaging. Additional experiments using various alterations to the environment show the ability of the model to react to changes in the input statistics without explicitly representing probability distributions. I will close the chapter with a discussion on the benefits and shortcomings of the model. The thesis continues whith a report on an application of the learning algorithm introduced before to two real world cue integration tasks on a robotic head. For these tasks our system outperforms a commonly used approximation to Bayesian inference, reliability weighted averaging. The approximation is handy because of its computational simplicity, because it relies on certain assumptions that are usually controlled for in a laboratory setting, but these are often not true for real world data. This chapter is based on the paper [Karaoguz et al. 2011]. Our second modeling approach tries to address the neuronal substrates of the learning process for cue integration. I again use a reward based training scheme, but this time implemented as a modulation of synaptic plasticity mechanisms in a recurrent network of binary threshold neurons. I start the chapter with an additional introduction section to discuss recurrent networks and especially the various forms of neuronal plasticity that I will use in the model. The performance on a task similar to that of chapter 3 will be presented together with an analysis of the in uence of different plasticity mechanisms on it. Again benefits and shortcomings and the general potential of the method will be discussed. I will close the thesis with a general conclusion and some ideas about possible future work
Investigations into controllers for adaptive autonomous agents based on artificial neural networks.
This thesis reports the development and study of novel architectures for the simulation
of adaptive behaviour based on artificial neural networks. There are two distinct
levels of enquiry. At the primary level, the initial aim was to design and implement a
unified architecture integrating sensorimotor learning and overall control. This was
intended to overcome shortcomings of typical behaviour-based approaches in reactive
control settings. It was achieved in two stages. Initially, feedforward neural networks
were used at the sensorimotor level of a modular architecture and overall control was
provided by an algorithm. The algorithm was then replaced by a recurrent neural
network. For training, a form of reinforcement learning was used. This posed an
intriguing composite of the well-known action selection and credit assignment
problems. The solution was demonstrated in two sets of simulation studies involving
variants of each architecture. These studies also showed: firstly that the expected
advantages over the standard behaviour-based approach were realised, and secondly
that the new integrated architecture preserved these advantages, with the added value
of a unified control approach. The secondary level of enquiry addressed the more
foundational question of whether the choice of processing mechanism is critical if the
simulation of adaptive behaviour is to progress much beyond the reactive stage in
more than a trivial sense. It proceeded by way of a critique of the standard behaviourbased
approach to make a positive assessment of the potential for recurrent neural
networks to fill such a role. The findings were used to inform further investigations at
the primary level of enquiry. These were based on a framework for the simulation of
delayed response learning using supervised learning techniques. A further new
architecture, based on a second-order recurrent neural network, was designed for this
set of studies. It was then compared with existing architectures. Some interesting
results are presented to indicate the appropriateness of the design and the potential of
the approach, though limitations in the long run are not discounted
- …