Search CORE

35 research outputs found

Exploiting Submodular Value Functions for Scaling Up Active Perception

Author: Oliehoek F.
Satsangi Y.
Spaan M.T.J.
Whiteson S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2018
Field of study

International Migration, Integration and Social Cohesion online publications

Exploiting Submodular Value Functions for Scaling Up Active Perception

Author: Oliehoek F.
Satsangi Y.
Spaan M.T.J.
Whiteson S.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/02/2018
Field of study

International Migration, Integration and Social Cohesion online publications

General-Sum Multi-Agent Continuous Inverse Optimal Control

Author: Gavrila D
Neumeyer C
Oliehoek F
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 18/02/2021
Field of study

IEEE Modelling possible future outcomes of robot-human interactions is of importance in the intelligent vehicle and mobile robotics domains. Knowing the reward function that explains the observed behaviour of a human agent is advantageous for modelling the behaviour with Markov Decision Processes (MDPs). However, learning the rewards that determine the observed actions from data is complicated by interactions. We present a novel inverse reinforcement learning(IRL) algorithm that can infer the reward function in multi-agent interactive scenarios. In particular, the agents may act boundedly rational (i.e., sub-optimal), a characteristic that is typical for human decision making. Additionally, every agent optimizes its own reward function which makes it possible to address non-cooperative setups. In contrast to other methods, the algorithm does not rely on reinforcement learning during inference of the parameters of the reward function. We demonstrate that our proposed method accurately infers the ground truth reward function in two-agent interactive experiments

University of Liverpool Repository

Bayesian RL in factored POMDPs

Author: Amato C
Katt S
Oliehoek F
Publication venue
Publication date: 01/01/2019
Field of study

Robust decision-making agents in any non-trivial system must reason over uncertainty of various types such as action outcomes, the agent's current state and the dynamics of the environment. The outcome and state un- certainty are elegantly captured by the Partially Observable Markov Decision Processes (POMDP) framework [1], which enable reasoning in stochastic, par- tially observable environments. POMDP solution methods, however, typically assume complete access to the system dynamics, which unfortunately are often not available. When such a model is not available, model-based Bayesian Re- inforcement Learning (BRL) methods explicitly maintain a posterior over the possible models of the environment, and use this knowledge to select actions that, theoretically, trade o_ exploration and exploitation optimally. However, few of the BRL methods are applicable to partial observable settings, and those that are, have limited scaling properties. The Bayes-Adaptive POMDP (BA- POMDP) [4], for example, models the environment in a tabular fashion, which poses a bottleneck for scalability. Here, we describe previous work [3] that pro- poses a method to overcome this bottleneck by representing the dynamics with Bayes Network, an approach that exploits structure in the form of independence between state and observation features.Interactive Intelligenc

University of Liverpool Repository

TU Delft Repository

Local Communication Protocols for Learning Complex Swarm Behaviors with Deep Reinforcement Learning

Author: A Martinoli
C Kube
C Moeslinger
F Arvin
FA Oliehoek
J Foerster
JK Gupta
L Bayındır
N Correll
P Basu
S Nouyan
V Mnih
Publication venue
Publication date: 01/01/2018
Field of study

Swarm systems constitute a challenging problem for reinforcement learning (RL) as the algorithm needs to learn decentralized control policies that can cope with limited local sensing and communication abilities of the agents. While it is often difficult to directly define the behavior of the agents, simple communication protocols can be defined more easily using prior knowledge about the given task. In this paper, we propose a number of simple communication protocols that can be exploited by deep reinforcement learning to find decentralized control policies in a multi-robot swarm environment. The protocols are based on histograms that encode the local neighborhood relations of the agents and can also transmit task-specific information, such as the shortest distance and direction to a desired target. In our framework, we use an adaptation of Trust Region Policy Optimization to learn complex collaborative tasks, such as formation building and building a communication link. We evaluate our findings in a simulated 2D-physics environment, and compare the implications of different communication protocols.Comment: 13 pages, 4 figures, version 2, accepted at ANTS 201

arXiv.org e-Print Archive

TUbiblio

Crossref

Multiagent Sequential Decision Making (MSDM)

Author: F A Oliehoek
M T J
Publication venue
Publication date: 01/01/2008
Field of study

CiteSeerX

A Research Agenda for Hybrid Intelligence: Augmenting Human Intellect With Collaborative, Adaptive, Responsible, and Explainable Artificial Intelligence

Author: Akata Z.
Balliet D.
de Rijke M.
Dignum F.
Dignum V.
Eiben G.
Fokkens A.
Grossi D.
Hindriks K.
Hoos H.
Hung H.
Jonker C.
Monz C.
Neerincx M.
Oliehoek F.
Prakken H.
Schlobach S.
van der Gaag L.
van Harmelen F.
van Hoof H.
van Riemsdijk B.
van Wynsberghe A.
Verbrugge R.
Verheij B.
Vossen P.
Welling M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/08/2020
Field of study

International Migration, Integration and Social Cohesion online publications

Interspecific Germline Transmission of Cultured Primordial Germ Cells

Author: AW Blackler
C Liu
CH Liu
Daniel R. Lu
Ellen J. Collarini
EM McCarthy
F Pitel
G Reynaud
H Li
J Macdonald
Jeffrey Fesler
M-C Van De Lavoir
M-C Van De Lavoir
Marie-Cecile van de Lavoir
Ono T
Osman El-Maarri
PA Leighton
PA Oliehoek
Philip A. Leighton
Robert J. Etches
S Ishiguro
S Pardue
SJ Kang
SJ Kang
T Saito
T. S. Thiyagasundaram
U Wernery
V Hamburger
William D. Harriman
Y Takeuchi
Y Takeuchi
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

In birds, the primordial germ cell (PGC) lineage separates from the soma within 24 h following fertilization. Here we show that the endogenous population of about 200 PGCs from a single chicken embryo can be expanded one million fold in culture. When cultured PGCs are injected into a xenogeneic embryo at an equivalent stage of development, they colonize the testis. At sexual maturity, these donor PGCs undergo spermatogenesis in the xenogeneic host and become functional sperm. Insemination of semen from the xenogeneic host into females from the donor species produces normal offspring from the donor species. In our model system, the donor species is chicken (Gallus domesticus) and the recipient species is guinea fowl (Numida meleagris), a member of a different avian family, suggesting that the mechanisms controlling proliferation of the germline are highly conserved within birds. From a pragmatic perspective, these data are the basis of a novel strategy to produce endangered species of birds using domesticated hosts that are both tractable and fecund

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Development and application of genomic control methods for genome-wide association studies using non-additive models

Author: A Bittles
AL Price
B Devlin
Cornelia M. van Duijn
D Kobayashi
F Liu
G Zheng
G Zheng
H-E Wichmann
Harald Grallert
J Dupuis
J Liu
J Yu
Janina S. Ried
JK Pritchard
Konstantin Strauch
Lin Chen
M Kolz
P Gorroochurn
PA Oliehoek
PIW De Bakker
PIW De Bakker
PJ McLaren
S-A Bacanu
T Dadd
T Yan
Tatiana I. Axenovich
W-M Chen
Y Zang
Yakov A. Tsepilov
YS Aulchenko
YS Aulchenko
YS Aulchenko
Yurii S. Aulchenko
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2013
Field of study

Genome-wide association studies (GWAS) comprise a powerful tool for mapping genes of complex traits. However, an inflation of the test statistic can occur because of population substructure or cryptic relatedness, which could cause spurious associations. If information on a large number of genetic markers is available, adjusting the analysis results by using the method of genomic control (GC) is possible. GC was originally proposed to correct the Cochran-Armitage additive trend test. For non-additive models, correction has been shown to depend on allele frequencies. Therefore, usage of GC is limited to situations where allele frequencies of null markers and candidate markers are matched. In this work, we extended the capabilities of the GC method for non-additive models, which allows us to use null markers with arbitrary allele frequencies for GC. Analytical expressions for the inflation of a test statistic describing its dependency on allele frequency and several population parameters were obtained for recessive, dominant, and over-dominant models of inheritance. We proposed a method to estimate these required population parameters. Furthermore, we suggested a GC method based on approximation of the correction coefficient by a polynomial of allele frequency and described procedures to correct the genotypic (two degrees of freedom) test for cases when the model of inheritance is unknown. Statistical properties of the described methods were investigated using simulated and real data. We demonstrated that all considered methods were effective in controlling type 1 error in the presence of genetic substructure. The proposed GC methods can be applied to statistical tests for GWAS with various models of inheritance. All methods developed and tested in this work were implemented using R language as a part of the GenABEL package

Crossref

Directory of Open Access Journals

PubMed Central

EUR Research Repository

Edinburgh Research Explorer

Erasmus University Digital Repository

PuSH

The Francis Crick Institute

Q-value Heuristics for Approximate Solutions of Dec-POMDPs

Author: Oliehoek F. A.
Vlassis Nikos
Publication venue
Publication date: 01/01/2007
Field of study

Open Repository and Bibliography - Luxembourg

International Migration, Integration and Social Cohesion online publications