Search CORE

6,348 research outputs found

Building Machines That Learn and Think Like People

Author: Gershman Samuel J.
Lake Brenden M.
Tenenbaum Joshua B.
Ullman Tomer D.
Publication venue
Publication date: 01/04/2016
Field of study

Recent progress in artificial intelligence (AI) has renewed interest in building systems that learn and think like people. Many advances have come from using deep neural networks trained end-to-end in tasks such as object recognition, video games, and board games, achieving performance that equals or even beats humans in some respects. Despite their biological inspiration and performance achievements, these systems differ from human intelligence in crucial ways. We review progress in cognitive science suggesting that truly human-like learning and thinking machines will have to reach beyond current engineering trends in both what they learn, and how they learn it. Specifically, we argue that these machines should (a) build causal models of the world that support explanation and understanding, rather than merely solving pattern recognition problems; (b) ground learning in intuitive theories of physics and psychology, to support and enrich the knowledge that is learned; and (c) harness compositionality and learning-to-learn to rapidly acquire and generalize knowledge to new tasks and situations. We suggest concrete challenges and promising routes towards these goals that can combine the strengths of recent neural network advances with more structured cognitive models.Comment: In press at Behavioral and Brain Sciences. Open call for commentary proposals (until Nov. 22, 2016). https://www.cambridge.org/core/journals/behavioral-and-brain-sciences/information/calls-for-commentary/open-calls-for-commentar

arXiv.org e-Print Archive

DSpace@MIT

Design for a Darwinian Brain: Part 1. Philosophy and Neuroscience

Author: A. Newell
A. Uttley
C. Fernando
D. Penn
D. Perkins
E. Szathmary
E.M. Izhikevich
E.M. Izhikevich
G. Price
G. Price
J. Fodor
J. Holland
J. Maynard Smith
J. Tani
L. Steels
M. Botvinick
M. Eigen
M. Pigliucci
N. Kashtan
N. Rougier
R. Dawkins
R. Sternberg
R. Sutton
S. Harnad
T. Ganti
Y. Ikegaya
Publication venue
Publication date: 01/01/2013
Field of study

Physical symbol systems are needed for open-ended cognition. A good way to understand physical symbol systems is by comparison of thought to chemistry. Both have systematicity, productivity and compositionality. The state of the art in cognitive architectures for open-ended cognition is critically assessed. I conclude that a cognitive architecture that evolves symbol structures in the brain is a promising candidate to explain open-ended cognition. Part 2 of the paper presents such a cognitive architecture.Comment: Darwinian Neurodynamics. Submitted as a two part paper to Living Machines 2013 Natural History Museum, Londo

arXiv.org e-Print Archive

Crossref

Approximate performability and dependability analysis using generalized stochastic Petri Nets

Author: Haverkort Boudewijn R.
Publication venue: North-Holland
Publication date: 01/01/1992
Field of study

Since current day fault-tolerant and distributed computer and communication systems tend to be large and complex, their corresponding performability models will suffer from the same characteristics. Therefore, calculating performability measures from these models is a difficult and time-consuming task.\ud \ud To alleviate the largeness and complexity problem to some extent we use generalized stochastic Petri nets to describe to models and to automatically generate the underlying Markov reward models. Still however, many models cannot be solved with the current numerical techniques, although they are conveniently and often compactly described.\ud \ud In this paper we discuss two heuristic state space truncation techniques that allow us to obtain very good approximations for the steady-state performability while only assessing a few percent of the states of the untruncated model. For a class of reversible models we derive explicit lower and upper bounds on the exact steady-state performability. For a much wider class of models a truncation theorem exists that allows one to obtain bounds for the error made in the truncation. We discuss this theorem in the context of approximate performability models and comment on its applicability. For all the proposed truncation techniques we present examples showing their usefulness

CiteSeerX

University of Twente Research Information

Evolutionary Algorithms for Reinforcement Learning

Author: Grefenstette J. J.
Moriarty D. E.
Schultz A. C.
Publication venue: 'AI Access Foundation'
Publication date: 01/06/2011
Field of study

There are two distinct approaches to solving reinforcement learning problems, namely, searching in value function space and searching in policy space. Temporal difference methods and evolutionary algorithms are well-known examples of these approaches. Kaelbling, Littman and Moore recently provided an informative survey of temporal difference methods. This article focuses on the application of evolutionary algorithms to the reinforcement learning problem, emphasizing alternative policy representations, credit assignment methods, and problem-specific genetic operators. Strengths and weaknesses of the evolutionary approach to reinforcement learning are presented, along with a survey of representative applications

arXiv.org e-Print Archive

Crossref

Born to learn: The inspiration, progress, and future of evolved plastic artificial neural networks

Author: Abbott
Abraham
Abraham
Alexander
Allis
Alpaydin
Anderson
Andrea Soltoggio
Angeline
Arifovic
Arnold
Arnold
Ay
Bailey
Baldwin
Baxter
Baxter
Bear
Beer
Bengio
Bengio
Bentley
Best
Bienenstock
Birmingham
Blynel
Boers
Bourlard
Brown
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Bullinaria
Butz
Butz
Bäck
Cabessa
Carew
Carlson
Carver
Cervier
Chklovskii
Clark
Cliff
Clutton-Brock
Coleman
Cooper
Damasio
Darwin
Dawkins
Deary
Deng
de Vladar
Di Paolo
Di Paolo
Dobzhansky
Doidge
Downing
Downing
Doya
Doya
Draganski
Dudai
Durr
Edelman
Eiben
Ellefsen
Ellefsen
Eskridge
Fellous
Fernando
Fernando
Finnie
Floreano
Floreano
Floreano
Floreano
Floreano
Floreano
Fogel
Fontanari
Friston
Fujii
Funahashi
Gerstner
Gerstner
Goldberg
Goodfellow
Graves
Greve
Grossberg
Gruau
Gu
Gustafsson
Hansen
Happel
Harrington
Harris-Warrick
Harvey
Hasselmo
Hasselmo
Hasselmo
Hawkins
Hebb
Hensch
Hinton
Hinton
Hinton
Hochreiter
Hoinville
Holland
Holland
Hopkins
Hornby
Howard
Howard
Howard
Hull
Husbands
Izhikevich
Izhikevich
Jaeger
Jo
Kaas
Kandel
Kandel
Katz
Katz
Katz
Kenneth O. Stanley
Khan
Khan
Khan
Khan
Kirkpatrick
Kiyota
Klug
Kohonen
Kohonen
Kolb
Kolb
Kondo
Koutník
Koza
Krichmar
Krizhevsky
Kumaran
Kupfermann
Køppe
Lake
Lalejini
Lamprecht
Langton
Lanzi
LeCun
LeDoux
Lee
Lehman
Lehman
Lehman
Levy
Lüders
Lüders
Maass
Maniadakis
Marder
Marder
Markram
Mattiussi
Mayley
McClelland
McQuesten
Meng
Merzenich
Michalewicz
Michalski
Miconi
Miller
Miller
Millington
Monroe
Mouret
Murre
Niv
Nogueira
Nogueira
Nolfi
Nolfi
Nolfi
Nolfi
Norouzzadeh
Offerman
Oja
Oquab
Orchard
Ormrod
Pan
Pascual-Leone
Pavlov
Pehlevan
Pigliucci
Rauschecker
Rawal
Rescorla
Risi
Risi
Risi
Risi
Risi
Roberts
Robins
Roff
Rolls
Rumelhart
Russell
Russo
Sanchez
Schmidhuber
Schmidhuber
Schmidhuber
Schmidhuber
Schreiweis
Schroll
Schultz
Schultz
Schultz
Sebastian Risi
Silva
Silva
Silver
Sims
Sims
Sipper
Skinner
Skinner
Smith
Smith
Smith
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Soltoggio
Sporns
Staddon
Stanley
Stanley
Stanley
Stanley
Stanley
Steels
Stone
Suri
Suri
Sutton
Taylor
Tessier-Lavigne
Thorndike
Thrun
Thrun
Thrun
Tonelli
Tonelli
Urzelai
Varela
Venkatesh
Vitay
Wagner
Walters
Widrow
Willshaw
Yamauchi
Yao
Yao
Yoder
Young
Ziemke
Publication venue: 'Elsevier BV'
Publication date: 01/01/2018
Field of study

Biological plastic neural networks are systems of extraordinary computational capabilities shaped by evolution, development, and lifetime learning. The interplay of these elements leads to the emergence of adaptive behavior and intelligence. Inspired by such intricate natural phenomena, Evolved Plastic Artificial Neural Networks (EPANNs) use simulated evolution in-silico to breed plastic neural networks with a large variety of dynamics, architectures, and plasticity rules: these artificial systems are composed of inputs, outputs, and plastic components that change in response to experiences in an environment. These systems may autonomously discover novel adaptive algorithms, and lead to hypotheses on the emergence of biological adaptation. EPANNs have seen considerable progress over the last two decades. Current scientific and technological advances in artificial neural networks are now setting the conditions for radically new approaches and results. In particular, the limitations of hand-designed networks could be overcome by more flexible and innovative solutions. This paper brings together a variety of inspiring ideas that define the field of EPANNs. The main methods and results are reviewed. Finally, new opportunities and developments are presented

arXiv.org e-Print Archive

Loughborough University Institutional Repository

Crossref

The IT University of Copenhagen's Repository

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Sequential Decision-Making for Drug Design: Towards closed-loop drug design

Author: Gummesson Svensson Hampus
Publication venue
Publication date: 01/01/2023
Field of study

Drug design is a process of trial and error to design molecules with a desired response toward a biological target, with the ultimate goal of finding a new medication. It is estimated to be up to 10^{60} molecules that are of potential interest as drugs, making it a difficult problem to find suitable molecules. A crucial part of drug design is to design and determine what molecules should be experimentally tested, to determine their activity toward the biological target. To experimentally test the properties of a molecule, it has to be successfully made, often requiring a sequence of reactions to obtain the desired product. Machine learning can be utilized to predict the outcome of a reaction, helping to find successful reactions, but requires data for the reaction type of interest. This thesis presents a work that combinatorially investigates the use of active learning to acquire training data for reaching a certain level of predictive ability in predicting whether a reaction is successful or not. However, only a limited number of molecules can often be synthesized every time. Therefore, another line of work in this thesis investigates which designed molecules should be experimentally tested, given a budget of experiments, to sequentially acquire new knowledge. This is formulated as a multi-armed bandit problem and we propose an algorithm to solve this problem. To suggest potential drug molecules to choose from, recent advances in machine learning have also enabled the use of generative models to design novel molecules with certain predicted properties. Previous work has formulated this as a reinforcement learning problem with success in designing and optimizing molecules with drug-like properties. This thesis presents a systematic comparison of different reinforcement learning algorithms for string-based generation of drug molecules. This includes a study of different ways of learning from previous and current batches of samples during the iterative generation

Chalmers Research