Search CORE

37,083 research outputs found

Assessing the Potential of Classical Q-learning in General Game Playing

Author: CB Browne
CJCH Watkins
CP Robert
D Silver
D Silver
H Wang
J Hu
J Méhat
M Genesereth
M Genesereth
M Świechowski
RS Sutton
V Mnih
Publication venue
Publication date: 14/10/2018
Field of study

\&

Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex)\footnote{source code: https://github.com/wh1992v/ggp-rl}, to allow comparison to Banerjee et al.. We find that Q-learning converges to a high win rate in GGP. For the

\epsilon

-greedy strategy, we propose a first enhancement, the dynamic

\epsilon

algorithm. In addition, inspired by (Gelly

\&

Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Comment: arXiv admin note: substantial text overlap with arXiv:1802.0594

arXiv.org e-Print Archive

Crossref

Leiden University Scholary Publications

Assessing the Potential of Classical Q-learning in General Game Playing

Author: CB Browne
CJCH Watkins
CP Robert
D Silver
D Silver
H Wang
J Hu
J Méhat
M Genesereth
M Genesereth
M Świechowski
RS Sutton
V Mnih
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 25/09/2019
Field of study

After the recent groundbreaking results of AlphaGo and AlphaZero, we have seen strong interests in deep reinforcement learning and artificial general intelligence (AGI) in game playing. However, deep learning is resource-intensive and the theory is not yet well developed. For small games, simple classical table-based Q-learning might still be the algorithm of choice. General Game Playing (GGP) provides a good testbed for reinforcement learning to research AGI. Q-learning is one of the canonical reinforcement learning methods, and has been used by (Banerjee & Stone, IJCAI 2007) in GGP. In this paper we implement Q-learning in GGP for three small-board games (Tic-Tac-Toe, Connect Four, Hex), to allow comparison to Banerjee et al. We find that Q-learning converges to a high win rate in GGP. For the ϵ" role="presentation" style="display: inline-table; line-height: normal; letter-spacing: normal; word-spacing: normal; overflow-wrap: normal; white-space: nowrap; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; border-width: 0px; border-style: initial; position: relative;">ϵ-greedy strategy, we propose a first enhancement, the dynamic ϵ" role="presentation" style="display: inline-table; line-height: normal; letter-spacing: normal; word-spacing: normal; overflow-wrap: normal; white-space: nowrap; float: none; direction: ltr; max-width: none; max-height: none; min-width: 0px; min-height: 0px; border-width: 0px; border-style: initial; position: relative;">ϵ algorithm. In addition, inspired by (Gelly & Silver, ICML 2007) we combine online search (Monte Carlo Search) to enhance offline learning, and propose QM-learning for GGP. Both enhancements improve the performance of classical Q-learning. In this work, GGP allows us to show, if augmented by appropriate enhancements, that classical table-based Q-learning can perform well in small games.Computer Systems, Imagery and Medi

Crossref

Leiden University Scholary Publications

Understanding the Cultural Value of 'In Harmony-Sistema England'

Author: Phillips Tom
Rimmer Mark
Street John
Publication venue
Publication date: 28/10/2014
Field of study

This research project on which this paper reports was designed to explore questions of cultural value in relation to the schools music project In Harmony-Sistema England. Our core research focus has been upon the ways in which children, their teachers and tutors, and their families understand the value of their participation in IHSE initiatives. The project engaged with three case studies of IHSE initiatives (based in Norwich, Telford and Newcastle) and qualitative data was gathered with primary school children, school staff, parents and IHSE musicians in all three cases

University of East Anglia digital repository

Unmasking Clever Hans Predictors and Assessing What Machines Really Learn

Author: Binder Alexander
Lapuschkin Sebastian
Montavon Grégoire
Müller Klaus-Robert
Samek Wojciech
Wäldchen Stephan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 26/02/2019
Field of study

Current learning machines have successfully solved hard application problems, reaching high accuracy and displaying seemingly "intelligent" behavior. Here we apply recent techniques for explaining decisions of state-of-the-art learning machines and analyze various tasks from computer vision and arcade games. This showcases a spectrum of problem-solving behaviors ranging from naive and short-sighted, to well-informed and strategic. We observe that standard performance evaluation metrics can be oblivious to distinguishing these diverse problem solving behaviors. Furthermore, we propose our semi-automated Spectral Relevance Analysis that provides a practically effective way of characterizing and validating the behavior of nonlinear learning machines. This helps to assess whether a learned model indeed delivers reliably for the problem that it was conceived for. Furthermore, our work intends to add a voice of caution to the ongoing excitement about machine intelligence and pledges to evaluate and judge some of these recent successes in a more nuanced manner.Comment: Accepted for publication in Nature Communication

arXiv.org e-Print Archive

Directory of Open Access Journals

Fraunhofer-ePrints

Internet and gaming addiction: a systematic literature review of neuroimaging studies

Author: Adalier
Bailey
Beck
Beutel
Birmaher
Brebner
Chen
Costa
Craven
Derogatis
Descartes
Dong
First
Ge
Goldstein
Grant
Han
Hoeft
Hou
Huang
Huettel
Ko
Koepp
Koob
Le Bihan
Lebcrubier
Lee
Lemmens
Lin
Lin
Littel
Liu
Luck
Niedermeyer
Pavlov
Prochaska
Repovš
Sheehan
Skinner
Tao
Tao
Thomas
Van Rooij
Volkow
Wang
Wilson
Yang
Yi
Young
Young
Yuan
Zhou
Publication venue: 'MDPI AG'
Publication date: 01/01/2012
Field of study

In the past decade, research has accumulated suggesting that excessive Internet use can lead to the development of a behavioral addiction. Internet addiction has been considered as a serious threat to mental health and the excessive use of the Internet has been linked to a variety of negative psychosocial consequences. The aim of this review is to identify all empirical studies to date that used neuroimaging techniques to shed light upon the emerging mental health problem of Internet and gaming addiction from a neuroscientific perspective. Neuroimaging studies offer an advantage over traditional survey and behavioral research because with this method, it is possible to distinguish particular brain areas that are involved in the development and maintenance of addiction. A systematic literature search was conducted, identifying 18 studies. These studies provide compelling evidence for the similarities between different types of addictions, notably substance-related addictions and Internet and gaming addiction, on a variety of levels. On the molecular level, Internet addiction is characterized by an overall reward deficiency that entails decreased dopaminergic activity. On the level of neural circuitry, Internet and gaming addiction led to neuroadaptation and structural changes that occur as a consequence of prolonged increased activity in brain areas associated with addiction. On a behavioral level, Internet and gaming addicts appear to be constricted with regards to their cognitive functioning in various domains. The paper shows that understanding the neuronal correlates associated with the development of Internet and gaming addiction will promote future research and will pave the way for the development of addiction treatment approaches

Multidisciplinary Digital Publishing Institute

CiteSeerX

Crossref

Nottingham Trent Institutional Repository (IRep)

Directory of Open Access Journals

PubMed Central

Evolutionary games on graphs

Author: Abramson
Ahmed
Aktipis
Albert
Alexander
Alonso-Sanz
Amaral
Antal
Antal
Arthur
Ashlock
Atman
Aumann
Axelrod
Axelrod
Axelrod
Axelrod
Axelrod
Axelrod
Axelrod
Bak
Bala
Ball
Barabási
Baumol
Ben-Naim
Ben-Naim
Benaim
Benzi
Berg
Berg
Bidaux
Biely
Binder
Binmore
Binmore
Binmore
Bishop
Blarer
Blume
Blume
Blume
Blume
Blume
Boccaletti
Boerlijst
Boerlijst
Bollobás
Bollobás
Bomze
Bomze
Bradley
Bramson
Brauchli
Bray
Broom
Brosig
Brower
Brown
Busse
Camerer
Cardy
Cardy
Challet
Challet
Chandler
Chiappin
Clifford
Colman
Conlisk
Coolen
Coricelli
Cressman
Cross
Czárán
Dawkins
Derényi
Dickman
Dickman
Dieckmann
Doebeli
Domany
Dornic
Dorogovtsev
Dorogovtsev
Douglass
Drossel
Drossel
Du
Dugatkin
Duran
Durrett
Durrett
Durrett
Dutta
Ebel
Eigen
Eisert
Ellner
Equíluz
Erdős
Fehr
Field
Fisch
Fisher
Forsythe
Fort
Foster
Frachebourg
Frachebourg
Frachebourg
Frean
Freidlin
Frick
Friedman
Fudenberg
Fudenberg
Fudenberg
Fuks
Föllmer
Gambarelli
Gammaitoni
Gao
Gao
Gardiner
Gardner
Gatenby
Geritz
Gibbons
Gilpin
Gintis
Glauber
Gould
Grassberger
Grassberger
Greenberg
Grim
Grim
Guan
Gutowitz
György Szabó
Györgyi
Gábor Fáth
Gómez-Gardeñez
Güth
Haken
Hamilton
Hamilton
Hardin
Hardin
Harris
Harsanyi
Hauert
Hauert
Hauert
Hauert
Hauert
Hauert
Hauk
He
Helbing
Helbing
Helbing
Hempel
Henrich
Hinrichsen
Hofbauer
Hofbauer
Hofbauer
Hofbauer
Holland
Holley
Holme
Huberman
Ifti
Ifti
Imhof
Jackson
Jansen
Janssen
Jensen
Johnson
Johnson
Joo
Jung
Kandori
Katz
Katz
Kawasaki
Kelly
Kermack
Kerr
Killingback
Killingback
Killingback
Kim
Kim
Kinzel
Kirchkamp
Kirkup
Kittel
Kobayashi
Kraines
Kraines
Krapivsky
Kreft
Kreps
Kuperman
Kuznetsov
Ledyard
Lee
Lee
Lee
Lewontin
Lieberman
Liggett
Lim
Lin
Lindgren
Lindgren
MacLean
Macy
Marro
Marsili
Martins
Masuda
Masuda
May
Maynard Smith
Maynard Smith
Maynard Smith
Meron
Metz
Meyer
Mie¸kisz
Mie¸kisz
Mie¸kisz
Milgram
Mobilia
Molander
Monderer
Moran
Mukherji
Mézard
Nakamaru
Nakamaru
Nash
Newman
Newman
Newman
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Nowak
Ohta
Ohtsuki
Ohtsuki
Pacheco
Pacheco
Page
Page
Palla
Panchanathan
Perc
Perc
Perc
Perc
Pettit
Pfeiffer
Pfeiffer
Pikovsky
Posch
Posch
Poundstone
Prager
Provata
Ralston
Rapoport
Rasmussen
Ravasz
Reichenbach
Reichenbach
Riolo
Robson
Roca
Russell
Saijo
Samuelson
Samuelson
Santos
Santos
Santos
Santos
Santos
Santos
Sato
Sato
Schlag
Schlag
Schmittmann
Schnakenberg
Schwarz
Schweitzer
Selten
Selten
Semmann
Shapley
Sigmund
Sigmund
Silvertown
Sinervo
Skyrms
Skyrms
Stanley
Sysi-Aho
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szabó
Szolnoki
Szolnoki
Szolnoki
Szolnoki
Szolnoki
Sánchez
Tainaka
Tainaka
Tainaka
Tainaka
Tainaka
Tainaka
Tainaka
Tang
Taylor
Taylor
Thaler
Thorndike
Tomassini
Tomochi
Toral
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Traulsen
Trivers
Trivers
Turner
Vainstein
Vainstein
Vilenkin
von Neumann
von Neumann
Vukov
Wakano
Watt
Watts
Wedekind
Weibull
Weidlich
Wiener
Wild
Wilhelm
Winfree
Wolfram
Wolfram
Wolfram
Wolpert
Wormald
Wu
Wu
Wu
Young
Zeeman
Zimmermann
Zimmermann
Zimmermann
Zimmermann
Publication venue: 'Elsevier BV'
Publication date: 24/09/2007
Field of study

Game theory is one of the key paradigms behind many scientific disciplines from biology to behavioral sciences to economics. In its evolutionary form and especially when the interacting agents are linked in a specific social network the underlying solution concepts and methods are very similar to those applied in non-equilibrium statistical physics. This review gives a tutorial-type overview of the field for physicists. The first three sections introduce the necessary background in classical and evolutionary game theory from the basic definitions to the most important results. The fourth section surveys the topological complications implied by non-mean-field-type social network structures in general. The last three sections discuss in detail the dynamic behavior of three prominent classes of models: the Prisoner's Dilemma, the Rock-Scissors-Paper game, and Competing Associations. The major theme of the review is in what sense and how the graph structure of interactions can modify and enrich the picture of long term behavioral patterns emerging in evolutionary games.Comment: Review, final version, 133 pages, 65 figure

arXiv.org e-Print Archive

Crossref

Using a gamified monitoring app to change adolescents' snack intake : the development of the REWARD app and evaluation design

Author: B. Deforche
C. Braet
C. Lachat
J. Van Camp
J. Vangeel
K. Beullens
L. Goossens
L. Maes
L. Vervoort
N. De Cock
S. Eggermont
W. Van Lippevelde
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

Background: As the snacking pattern of European adolescents is of great concern, effective interventions are necessary. Till now health promotion efforts in children and adolescents have had only limited success in changing adolescents' eating patterns and anthropometrics. Therefore, the present study proposes an innovative approach to influence dietary behaviors in youth based on new insights on effective behavior change strategies and attractive intervention channels to engage adolescents. This article describes the rationale, the development, and evaluation design of the 'Snack Track School' app. The aim of the app is to improve the snacking patterns of Flemish 14- to 16-year olds. Methods: The development of the app was informed by the systematic, stepwise, iterative, and collaborative principles of the Intervention Mapping protocol. A four week mHealth intervention was developed based on the dual-system model with behavioral change strategies targeting both the reflective (i.e., active learning, advance organizers, mere exposure, goal-setting, monitoring, and feedback) and automatic processes (i.e., rewards and positive reinforcement). This intervention will be evaluated via a controlled pre-post design in Flemish schools among 1400 adolescents. Discussion: When this intervention including strategies focused on both the reflective and automatic pathway proves to be effective, it will offer a new scientifically-based vision, guidelines and practical tools for public health and health promotion (i.e., incorporation of learning theories in intervention programs)

Springer - Publisher Connector

Ghent University Academic Bibliography

Directory of Open Access Journals

PubMed Central

How agency models inspire large scale participatory planning and its evaluation

Author: Abrami G.
Barreteau O.
Ducrot R.
Ferrand N.
Hassenforder E.
Publication venue: HAL CCSD
Publication date: 01/01/2013
Field of study

International audienceWe describe how three models, for sustainable change, human agency in collective resource management, and socio-environmental systems, have been used to design a protocol and the tools for a large scale (1500 participants, 35 villages) multi-level participatory process held in Africa for Integrated Natural Resource Management, through the European Project Afromaison. The process especially combines a common action model to support proposals by stakeholders, an integration matrix to build coherent plans, a role playing game design process, and a method to combine planning and playing to engage into the plans. It has also inspired the design of the attached monitoring and evaluation process. We describe the process in two countries, Ethiopia and Uganda, present the theoretical bases of the evaluation framework using the ENCORE paradigm and the implemented methodology transferred to local evaluators. We introduce some results and propose comments on potential learning back to the modelling community

HAL-IRD

Agritrop

HAL-CIRAD