8,436 research outputs found

    Towards a learning-theoretic analysis of spike-timing dependent plasticity

    Full text link
    This paper suggests a learning-theoretic perspective on how synaptic plasticity benefits global brain functioning. We introduce a model, the selectron, that (i) arises as the fast time constant limit of leaky integrate-and-fire neurons equipped with spiking timing dependent plasticity (STDP) and (ii) is amenable to theoretical analysis. We show that the selectron encodes reward estimates into spikes and that an error bound on spikes is controlled by a spiking margin and the sum of synaptic weights. Moreover, the efficacy of spikes (their usefulness to other reward maximizing selectrons) also depends on total synaptic strength. Finally, based on our analysis, we propose a regularized version of STDP, and show the regularization improves the robustness of neuronal learning when faced with multiple stimuli.Comment: To appear in Adv. Neural Inf. Proc. System

    A Survey of Monte Carlo Tree Search Methods

    Get PDF
    Monte Carlo tree search (MCTS) is a recently proposed search method that combines the precision of tree search with the generality of random sampling. It has received considerable interest due to its spectacular success in the difficult problem of computer Go, but has also proved beneficial in a range of other domains. This paper is a survey of the literature to date, intended to provide a snapshot of the state of the art after the first five years of MCTS research. We outline the core algorithm's derivation, impart some structure on the many variations and enhancements that have been proposed, and summarize the results from the key game and nongame domains to which MCTS methods have been applied. A number of open research questions indicate that the field is ripe for future work

    Finance Applications of Game Theory

    Get PDF
    Traditional finance theory based on the assumptions of symmetric information and perfect and competitive markets has provided many important insights. These include the Modigliani and Miller Theorems, the CAPM, the Efficient Markets Hypothesis and continuous time finance. However, many empirical phenomena are difficult to reconcile with this traditional framework. Game theoretic techniques have allowed insights into a number of these. Many puzzles remain. This paper argues that recent advances in game theory concerned with higher order beliefs, informational cascades and heterogeneous prior beliefs have the potential to provide insights into some of these remaining puzzles.

    Re-reading Jevons's Principles of Science - Induction Redux

    Get PDF
    In this paper I try to substantiate the thesis that Jevons may have been too harsh on the vices of induction and generously optimistic about the virtues of deduction, as discussed, primarily, in his magnum opus, The Principles of Science [6]. With this aim in mind the paper attempts to suggest (modern), recursion theoretic, theoretical technologies that could reduce and, under conditions that I claim would be acceptable to Jevons, even eliminate the inductive indeterminacies that he had emphasised.Jevons, Inductiion, Inductive Inference, Bayes's Rule
    corecore