Search CORE

313,229 research outputs found

Can the human mind learn to backward induce? A neural network answer.

Author: Leonidas Spiliopoulos
Publication venue
Publication date
Field of study

This paper addresses the question of whether neural networks, a realistic cognitive model of the human information processing, can learn to backward induce in a two stage game with a unique subgame-perfect Nash Equilibrium. The result that the neural networks only learn a heuristic that approximates the desired output and does not backward induce is in accordance with the documented difficulty of humans to apply backward induction and their dependence on heuristics.behavioral game theory; neural networks; learning

Research Papers in Economics

An Efficient Threshold-Driven Aggregate-Label Learning Algorithm for Multimodal Information Processing

Author: Belatreche Ammar
Chen Yi
Li Haizhou
Luo Xiaoling
Pan Zihan
Qu Hong
Wu Jibin
Zhang Malu
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2020
Field of study

The aggregate-label learning paradigm tackles the long-standing temporary credit assignment (TCA) problem in neuroscience and machine learning, enabling spiking neural networks to learn multimodal sensory clues with delayed feedback signals. However, the existing aggregate-label learning algorithms only work for single spiking neurons, and with low learning efficiency, which limit their real-world applicability. To address these limitations, we first propose an efficient threshold-driven plasticity algorithm for spiking neurons, namely ETDP. It enables spiking neurons to generate the desired number of spikes that match the magnitude of delayed feedback signals and to learn useful multimodal sensory clues embedded within spontaneous spiking activities. Furthermore, we extend the ETDP algorithm to support multi-layer spiking neural networks (SNNs), which significantly improves the applicability of aggregate-label learning algorithms. We also validate the multi-layer ETDP learning algorithm in a multimodal computation framework for audio-visual pattern recognition. Experimental results on both synthetic and realistic datasets show significant improvements in the learning efficiency and model capacity over the existing aggregate-label learning algorithms. It, therefore, provides many opportunities for solving real-world multimodal pattern recognition tasks with spiking neural networks

Northumbria Research Link

Feed-Forward Neural Networks Need Inductive Bias to Learn Equality Relations

Author: Kopparti R. M.
Weyde T.
Publication venue
Publication date: 01/01/2018
Field of study

Basic binary relations such as equality and inequality are fundamental to relational data structures. Neural networks should learn such relations and generalise to new unseen data. We show in this study, however, that this generalisation fails with standard feed-forward networks on binary vectors. Even when trained with maximal training data, standard networks do not reliably detect equality. We introduce differential rectifier (DR) units that we add to the network in different configurations. The DR units create an inductive bias in the networks, so that they do learn to generalise, even from small numbers of examples and we have not found any negative effect of their inclusion in the network. Given the fundamental nature of these relations, we hypothesize that feed-forward neural network learning benefits from inductive bias in other relations as well. Consequently, the further development of suitable inductive biases will be beneficial to many tasks in relational learning with neural networks

arXiv.org e-Print Archive

City Research Online

Recommended from our members

Optimizing thermodynamic trajectories using evolutionary and gradient-based reinforcement learning

Author: Beeler Chris
Coles Rory
Mills Kyle
Tamblyn Isaac
Whitelam Stephen
Yahorau Uladzimir
Publication venue: eScholarship, University of California
Publication date: 20/03/2019
Field of study

Using a model heat engine, we show that neural network-based reinforcement learning can identify thermodynamic trajectories of maximal efficiency. We consider both gradient and gradient-free reinforcement learning. We use an evolutionary learning algorithm to evolve a population of neural networks, subject to a directive to maximize the efficiency of a trajectory composed of a set of elementary thermodynamic processes; the resulting networks learn to carry out the maximally-efficient Carnot, Stirling, or Otto cycles. When given an additional irreversible process, this evolutionary scheme learns a previously unknown thermodynamic cycle. Gradient-based reinforcement learning is able to learn the Stirling cycle, whereas an evolutionary approach achieves the optimal Carnot cycle. Our results show how the reinforcement learning strategies developed for game playing can be applied to solve physical problems conditioned upon path-extensive order parameters

eScholarship - University of California