Search CORE

27,364 research outputs found

Human-Machine Collaborative Optimization via Apprenticeship Scheduling

Author: Golen Toni
Gombolay Matthew
Jensen Reed
Shah Julie
Shah Neel
Son Sung-Hyun
Stigile Jessica
Publication venue
Publication date: 10/05/2018
Field of study

Coordinating agents to complete a set of tasks with intercoupled temporal and resource constraints is computationally challenging, yet human domain experts can solve these difficult scheduling problems using paradigms learned through years of apprenticeship. A process for manually codifying this domain knowledge within a computational framework is necessary to scale beyond the ``single-expert, single-trainee" apprenticeship model. However, human domain experts often have difficulty describing their decision-making processes, causing the codification of this knowledge to become laborious. We propose a new approach for capturing domain-expert heuristics through a pairwise ranking formulation. Our approach is model-free and does not require enumerating or iterating through a large state space. We empirically demonstrate that this approach accurately learns multifaceted heuristics on a synthetic data set incorporating job-shop scheduling and vehicle routing problems, as well as on two real-world data sets consisting of demonstrations of experts solving a weapon-to-target assignment problem and a hospital resource allocation problem. We also demonstrate that policies learned from human scheduling demonstration via apprenticeship learning can substantially improve the efficiency of a branch-and-bound search for an optimal schedule. We employ this human-machine collaborative optimization technique on a variant of the weapon-to-target assignment problem. We demonstrate that this technique generates solutions substantially superior to those produced by human domain experts at a rate up to 9.5 times faster than an optimization approach and can be applied to optimally solve problems twice as complex as those solved by a human demonstrator.Comment: Portions of this paper were published in the Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI) in 2016 and in the Proceedings of Robotics: Science and Systems (RSS) in 2016. The paper consists of 50 pages with 11 figures and 4 table

arXiv.org e-Print Archive

DSpace@MIT

Coordinated replenishment systems with discount opportunities

Author: Duyn Schouten F.A. van der
Eijs M.J.G. van
Heuts R.M.J.
Publication venue
Publication date
Field of study

Inventory Control;Inventory Models;inkoop

Research Papers in Economics

Risk-sensitive Inverse Reinforcement Learning via Semi- and Non-Parametric Methods

Author: Lacotte Jonathan
Majumdar Anirudha
Pavone Marco
Singh Sumeet
Publication venue
Publication date: 01/01/2018
Field of study

The literature on Inverse Reinforcement Learning (IRL) typically assumes that humans take actions in order to minimize the expected value of a cost function, i.e., that humans are risk neutral. Yet, in practice, humans are often far from being risk neutral. To fill this gap, the objective of this paper is to devise a framework for risk-sensitive IRL in order to explicitly account for a human's risk sensitivity. To this end, we propose a flexible class of models based on coherent risk measures, which allow us to capture an entire spectrum of risk preferences from risk-neutral to worst-case. We propose efficient non-parametric algorithms based on linear programming and semi-parametric algorithms based on maximum likelihood for inferring a human's underlying risk measure and cost function for a rich class of static and dynamic decision-making settings. The resulting approach is demonstrated on a simulated driving game with ten human participants. Our method is able to infer and mimic a wide range of qualitatively different driving styles from highly risk-averse to risk-neutral in a data-efficient manner. Moreover, comparisons of the Risk-Sensitive (RS) IRL approach with a risk-neutral model show that the RS-IRL framework more accurately captures observed participant behavior both qualitatively and quantitatively, especially in scenarios where catastrophic outcomes such as collisions can occur.Comment: Submitted to International Journal of Robotics Research; Revision 1: (i) Clarified minor technical points; (ii) Revised proof for Theorem 3 to hold under weaker assumptions; (iii) Added additional figures and expanded discussions to improve readabilit

arXiv.org e-Print Archive

Princeton University Open Access Repository

Asymptotic Expansions for Stationary Distributions of Perturbed Semi-Markov Processes

Author: A. Hanen
A.F. Veinott Jr
A.N. Langville
B. Allen
B.D. Craven
B.L. Miller
B.N. Feinberg
C. Engström
C.D. Meyer
D. Blackwell
D.S. Kim
D.S. Silvestrov
D.S. Silvestrov
D.S. Silvestrov
D.S. Silvestrov
D.S. Silvestrov
E. Englund
E. Seneta
E. Seneta
F. Andersson
F. Chatelin
F. Delebecque
F. Grabski
G. Latouche
G. Yin
G.W. Stewart
H. Vantilborgh
H.A. Simon
I. Hanski
I. Marek
I. Marek
I.N. Kovalenko
I.N. Kovalenko
J. Janssen
J. Janssen
J.J. Hunter
J.J. Hunter
K. Avrachenkov
K.E. Avrachenkov
K.E. Avrachenkov
K.E. Avrachenkov
K.E. Avrachenkov
K.E. Avrachenkov
K.E. Avrachenkov
L. Takács
M. Abbad
M. Coderch
M. Gyllenberg
M. Gyllenberg
M. Gyllenberg
M. Gyllenberg
M. Haviv
M. Haviv
M. Haviv
M. Haviv
M. Haviv
M. Haviv
M. Petersson
M.V. Kartashov
P. Schweitzer
P.J. Courtois
P.J. Courtois
P.J. Schweitzer
P.J. Schweitzer
P.J. Schweitzer
P.J. Schweitzer
P.J. Schweitzer
P.V. Kokotović
R. Durrett
R. Hassin
R.G. Phillips
S.P. Meyn
U. Sumita
V.S. Koroliuk
V.S. Korolyuk
V.S. Korolyuk
V.S. Korolyuk
V.S. Korolyuk
V.V. Anisimov
W.K. Grassman
W.L. Cao
W.L. Smith
Y. Ni
Y. Ni
Z.A. Abadov
Publication venue: 'AIP Publishing'
Publication date: 12/03/2016
Field of study

New algorithms for computing of asymptotic expansions for stationary distributions of nonlinearly perturbed semi-Markov processes are presented. The algorithms are based on special techniques of sequential phase space reduction, which can be applied to processes with asymptotically coupled and uncoupled finite phase spaces.Comment: 83 page

arXiv.org e-Print Archive

Crossref

Improving statistical inference on pathogen densities estimated by quantitative molecular methods: malaria gametocytaemia as a case study

Author: A Gelman
André Lin Ouédraogo
B Hoadley
C Eisenhart
C Fraser
C Osborne
CE McCulloch
Cornelus Hermsen
D Lunn
DJ Lunn
DJ Spiegelhalter
DL Massart
EC Fieller
F Ferre
FA Graybill
G Marsaglia
GK Shulka
HG Niesters
HS Mubarak El
J Jiang
J Wain
JF Cohen
JH McCulloch
JM Ruijter
JN Miller
JO Beer De
JS Murray
JS Yuan
K Danzer
K Khairnar
KK Lai
LC Okell
LJ Naszódi
M Plummer
M Sivaganesan
M Woodhead
Martin Walker
María-Gloria Basáñez
NM Ferguson
P Schneider
P Schneider
P Vankeerberghen
PF Mens
R Wampfler
S Nadarajah
SA Bustin
SA Bustin
SA Fiscus
T Nolan
T Ponnudurai
TD Schmittgen
Teun Bousema
Thomas S Churcher
TS Churcher
World Health Organization
Y Karlen
YW Tang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

BACKGROUND: Quantitative molecular methods (QMMs) such as quantitative real-time polymerase chain reaction (q-PCR), reverse-transcriptase PCR (qRT-PCR) and quantitative nucleic acid sequence-based amplification (QT-NASBA) are increasingly used to estimate pathogen density in a variety of clinical and epidemiological contexts. These methods are often classified as semi-quantitative, yet estimates of reliability or sensitivity are seldom reported. Here, a statistical framework is developed for assessing the reliability (uncertainty) of pathogen densities estimated using QMMs and the associated diagnostic sensitivity. The method is illustrated with quantification of Plasmodium falciparum gametocytaemia by QT-NASBA. RESULTS: The reliability of pathogen (e.g. gametocyte) densities, and the accompanying diagnostic sensitivity, estimated by two contrasting statistical calibration techniques, are compared; a traditional method and a mixed model Bayesian approach. The latter accounts for statistical dependence of QMM assays run under identical laboratory protocols and permits structural modelling of experimental measurements, allowing precision to vary with pathogen density. Traditional calibration cannot account for inter-assay variability arising from imperfect QMMs and generates estimates of pathogen density that have poor reliability, are variable among assays and inaccurately reflect diagnostic sensitivity. The Bayesian mixed model approach assimilates information from replica QMM assays, improving reliability and inter-assay homogeneity, providing an accurate appraisal of quantitative and diagnostic performance. CONCLUSIONS: Bayesian mixed model statistical calibration supersedes traditional techniques in the context of QMM-derived estimates of pathogen density, offering the potential to improve substantially the depth and quality of clinical and epidemiological inference for a wide variety of pathogens

Crossref

LSHTM Research Online

Springer - Publisher Connector

PubMed Central

Spiral - Imperial College Digital Repository

Radboud Repository