Search CORE

33 research outputs found

Graphical models for interactive POMDPs: representations and solutions

Author: B. Adam
C. Camerer
D. Dennett
D. Fudenberg
E. Fehr
J. MacQueen
J. Pineau
J.A. Tatman
J.C. Harsanyi
J.F. Mertens
J.M. Charnes
L. Kaelbling
P. Gmytrasiewicz
P.J. Gmytrasiewicz
Prashant Doshi
Qiongyu Chen
R. Smallwood
R.D. Shachter
R.J. Aumann
Yifeng Zeng
Publication venue
Publication date: 01/01/2008
Field of study

We develop new graphical representations for the problem of sequential decision making in partially observable multiagent environments, as formalized by interactive partially observable Markov decision processes (I-POMDPs). The graphical models called interactive inf uence diagrams (I-IDs) and their dynamic counterparts, interactive dynamic inf uence diagrams (I-DIDs), seek to explicitly model the structure that is often present in real-world problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that IPOMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent given its belief as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-IDs and I-DIDs may be applied and demonstrate their usefulness. We also show how the models may be solved using the standard algorithms that are applicable to DIDs. Solving I-DIDs exactly involves knowing the solutions of possible models of the other agents. The space of models grows exponentially with the number of time steps. We present a method of solving I-DIDs approximately by limiting the number of other agents’ candidate models at each time step to a constant. We do this by clustering models that are likely to be behaviorally equivalent and selecting a representative set from the clusters. We discuss the error bound of the approximation technique and demonstrate its empirical performance

CiteSeerX

Crossref

Teeside University's Research Repository

VBN

ScholarBank@NUS

Avaliação da eficiência da gestão dos serviços municipais de abastecimento de água e esgotamento sanitário utilizandoData Envelopment Analysis

Author: CARMO C.M.
CESCONETTO A
CHARNES A
CHARNES A
Dirceu Scaratti
FARIA F.P
FARRELL M. J
FÄRE R
Gidiane Scaratti
GONÇALVES A.C
GRIGOLIN R.
KIRIGIA J.M
LOBO M.S.C
LOUREIRO A.L.
MURRAY C.J.L
PINILLOS M
RETZLAFF-ROBERTS D
SCARATTI D
THANASSOULIS E
TUPPER H.C
William Michelon
YU Y
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Scope for the Application of Mathematical Programming Techniques in the Synthesis and Planning of Sustainable Processes

Author: Birge J.R.
Charnes A.
Deb K.
Douglas J.M.
Ehrgott M.
El-Halwagi MM.
Floudas C.A.
Freeman H.
Graves S.C.
Linderoth J.
Pistikopoulos E.N.
Rudd D. F.
Ruszczyński A.
Tawarmalani M.
Uryasev S.
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Estudo da sustentabilidade agrícola em município amazônico com análise envoltória de dados

Author: Abay C.
Adler N.
Ali M.
Allen R.
Angulo Meza L.
Angulo Meza L.
Avkiran N.K.
Banker R.D.
Barreto R.C.S.
Batistella M.
Batistella M.
Batistella M.
Battese G.E.
Bosetti V.
Bravo-Ureta B.E.
Bravo-Ureta B.E.
Carpenter R.A.
Castilla R.E.F.
Charlwood J.D.
Charnes A.
Coelli T.J.
Cooper W.W.
Dale V.H.
De Koeijer T.J.
Edwards C.A.
Ehlers E.
Eliane Gonçalves Gomes
Fernandes L.A.O.
Färe R.
Gomes E.G.
Gomes E.G.
Gomes I.
Herendeen R.A.
João Alfredo de Carvalho Mangabeira
João Carlos Correia Baptista Soares de Mello
Lins M.P.
Lins M.P.E.
Lins M.P.E.
Lopes S.B.
López-Ridaura S.
Mangabeira J.A.C.
Marzall K.
Melgarejo L.
Miranda E.E.
Moran E.F.
Pacini C.
Pannell D.J.
Parra-Lopez C.
Pereira M.F.
Podinovski V.V.
Praneetvatakul S.
Pretty J.
Rasul G.
Rigby D.
Rios L.R
Rodríguez-Díaz J.A.
Sachs I.
Sachs I.
Sauer J.
Senra L.F.A.C.
Soares de Mello J.C.C.B.
Souza-Santos R.
Sydenstricker J.M.
Thanassoulis E.
Thiam A.
Thompson R.G.
Toresan L.
Veiga J.E.
Von Wirén-Lehr S.
Zhen L.
Publication venue: 'FapUNIFESP (SciELO)'
Publication date
Field of study

Crossref

Five-stage Procedure for the Evaluation of Simulation Models through Statistical Techniques

Author: Brunner D.T.
Charnes J.M.
Kleijnen J.P.C.
Morrice D.J.
Swain J.J.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1996
Field of study

Regression Metamodels and Design of Experiments

Author: Brunner D.T.
Charnes J.M.
Kleijnen J.P.C.
Morrice D.J.
Swain J.J.
van Groenendaal W.J.H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1996
Field of study

Sample-Path Solution of Stochastic Variational Inequalities, with Applications to Option Pricing

Author: Brunner D.T.
Charnes J.M.
Gürkan G.
Morrice D.M.
Ozge A.Y.
Robinson S.M.
Swain J.J.
Publication venue: IEEE, Piscataway
Publication date: 01/01/1996
Field of study

Validation of trace-driven simulation models: Regression analysis revisited

Author: Bettonvil B.W.M.
Brunner D.T.
Charnes J.M.
Kleijnen J.P.C.
Morrice D.J.
Swain J.J.
van Groenendaal W.J.H.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/1996
Field of study

Estimating buffer overflows in three stages using cross-entropy

Author: Charnes J.M.
Chen C.H.
de Boer Pieter-Tjerk
Kroese Dirk
Rubinstein R.Y.
Snowdon J.L.
Yücesan E.
Publication venue: ACM-IEEE
Publication date: 01/01/2002
Field of study

In this paper we propose a fast adaptive importance sampling method for the efficient simulation of buffer overflow probabilities in queueing networks. The method comprises three stages. First we estimate the minimum cross-entropy tilting parameter for a small buffer level; next, we use this as a starting value for the estimation of the optimal tilting parameter for the actual (large) buffer level; finally, the tilting parameter just found is used to estimate the overflow probability of interest. We recognize three distinct properties of the method which together explain why the method works well; we conjecture that they hold for quite general queueing networks. Numerical results support this conjecture and demonstrate the high efficiency of the proposed algorithm

Crossref

University of Twente Research Information

University of Queensland eSpace

Response surface methodology revisited

Author: Angun M.E.
Charnes J.M.
Chen C.H.
den Hertog D.
Gürkan G.
Kleijnen J.P.C.
Snowdon J.L.
Yucesan E.
Publication venue: Kolegium Europy Wschodniej im. Jana Nowaka-Jezioranskiego
Publication date: 01/01/2002
Field of study