8,436 research outputs found
Towards a learning-theoretic analysis of spike-timing dependent plasticity
This paper suggests a learning-theoretic perspective on how synaptic
plasticity benefits global brain functioning. We introduce a model, the
selectron, that (i) arises as the fast time constant limit of leaky
integrate-and-fire neurons equipped with spiking timing dependent plasticity
(STDP) and (ii) is amenable to theoretical analysis. We show that the selectron
encodes reward estimates into spikes and that an error bound on spikes is
controlled by a spiking margin and the sum of synaptic weights. Moreover, the
efficacy of spikes (their usefulness to other reward maximizing selectrons)
also depends on total synaptic strength. Finally, based on our analysis, we
propose a regularized version of STDP, and show the regularization improves the
robustness of neuronal learning when faced with multiple stimuli.Comment: To appear in Adv. Neural Inf. Proc. System
A Survey of Monte Carlo Tree Search Methods
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work
Finance Applications of Game Theory
Traditional finance theory based on the assumptions of symmetric information and perfect and competitive markets has provided many important insights. These include the Modigliani and Miller Theorems, the CAPM, the Efficient Markets Hypothesis and continuous time finance. However, many empirical phenomena are difficult to reconcile with this traditional framework. Game theoretic techniques have allowed insights into a number of these. Many puzzles remain. This paper argues that recent advances in game theory concerned with higher order beliefs, informational cascades and heterogeneous prior beliefs have the potential to provide insights into some of these remaining puzzles.
Re-reading Jevons's Principles of Science - Induction Redux
In this paper I try to substantiate the thesis that Jevons may have been too harsh on the vices of induction and generously optimistic about the virtues of deduction, as discussed, primarily, in his magnum opus, The Principles of Science [6]. With this aim in mind the paper attempts to suggest (modern), recursion theoretic, theoretical technologies that could reduce and, under conditions that I claim would be acceptable to Jevons, even eliminate the inductive indeterminacies that he had emphasised.Jevons, Inductiion, Inductive Inference, Bayes's Rule
- …