Search CORE

23 research outputs found

Fall Prediction for New Sequences of Motions

Author: O Höhn
O Höhn
S Dalibard
S Kalyanakrishnan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 04/12/2015
Field of study

Abstract. Motions reinforce meanings in human-robot communication, when they are relevant and initiated at the right times. Given a task of using motions for an autonomous humanoid robot to communicate, different sequences of relevant motions are generated from the motion library. Each motion in the motion library is stable, but a sequence may cause the robot to be unstable and fall. We are interested in predicting if a sequence of motions will result in a fall, without executing the sequence on the robot. We contribute a novel algorithm, ProFeaSM, that uses only body angles collected during the execution of single motions and interpolations between pairs of motions, to predict whether a sequence will cause the robot to fall. We demonstrate the efficacy of ProFeaSM on the NAO humanoid robot in a real-time simulator, Webots, and on a real NAO and explore the trade-off between precision and recall

CiteSeerX

Crossref

Towards Rapid Multi-robot Learning from Demonstration at the RoboCup Competition

Author: A Merke
A Weitzenfeld
BD Argall
D Wilking
DC Bentivegna
GA Kaminka
I Noda
J Fountain
J Nakanishi
JC Zagal
K Sullivan
K Tuyls
M Hausknecht
M Oubbati
M Saggar
M Schwarz
P Stone
P Stone
S Kalyanakrishnan
S Luke
S Metzler
T Latzke
T Nakashima
U Visser
Y Takahashi
Y Takahashi
Ç Meriçli
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/12/2015
Field of study

Abstract. We describe our previous and current efforts towards achiev-ing an unusual personal RoboCup goal: to train a full team of robots directly through demonstration, on the field of play at the RoboCup venue, how to collaboratively play soccer, and then use this trained team in the competition itself. Using our method, HiTAB, we can train teams of collaborative agents via demonstration to perform nontrivial joint behaviors in the form of hierarchical finite-state automata. We discuss HiTAB, our previous efforts in using it in RoboCup 2011 and 2012, recent experimental work, and our current efforts for 2014, then suggest a new RoboCup Technical Challenge problem in learning from demonstration. Imagine that you are at an unfamiliar disaster site with a team of robots, and are faced with a previously unseen task for them to do. The robots have only rudimentary but useful utility behaviors implemented. You are not a programmer. Without coding them, you have only a few hours to get your robots doing useful collaborative work in this new environment. How would you do this

CiteSeerX

Crossref

Swarm robotics: a review from the swarm engineering perspective

Author: A. Campo
A. Campo
A. Campo
A. E. Turgut
A. E. Turgut
A. F. T. Winfield
A. F. T. Winfield
A. F. T. Winfield
A. F. T. Winfield
A. Galstyan
A. Giusti
A. Halász
A. Howard
A. Kolling
A. L. Christensen
A. L. Christensen
A. Martinoli
A. Martinoli
A. Naghsh
A. Okubo
A. Prorok
A. Rosenfeld
A. Scheidler
A. Stranieri
A. Turing
A. V. Getling
B. R. Donald
B. Shucker
B. Shucker
B. Varghese
B. Wang
C. A. C. Parker
C. Ampatzis
C. Ampatzis
C. Anderson
C. Dixon
C. M. Breder Jr.
C. Melhuish
C. Melhuish
C. Pinciroli
C. Pinciroli
C. R. Kube
C. W. Reynolds
D. E. Goldberg
D. G. Kendall
D. Grünbaum
D. Payton
E. Bahçeci
E. Bonabeau
E. Bonabeau
E. Ferrante
E. Ferrante
E. Tuci
E. Yang
E. Şahin
Eliseo Ferrante
F. Ducatelle
F. Ducatelle
F. Mondada
G. A. Di Caro
G. A. Kaminka
G. Baldassarre
G. Baldassarre
G. Baldassarre
G. Beni
G. Dudek
G. Francesca
G. Pini
G. Pini
G. Pini
G. Podevijn
G. Theraulaz
G. Theraulaz
H. Hamann
H. Hamann
H. Meinhardt
H. Çelikkanat
I. D. Couzin
J. Amé
J. Bachrach
J. Beal
J. H. Holland
J. H. Reif
J. K. Parrish
J. Kramer
J. L. Elman
J. Lee
J. McLurkin
J. Pugh
J. S. Langer
J. Wawerla
J. Werfel
J. Werfel
J. Werfel
J. Wessnitzer
J.-L. Deneubourg
K. Dantu
K. J. O’Hara
K. Lerman
K. Lerman
L. Bayindir
L. E. Parker
L. Iocchi
L. Li
L. P. Kaelbling
L. P. Kaelbling
L. Panait
M. A. Hsieh
M. A. Montes de Oca
M. Brambilla
M. Brambilla
M. Dorigo
M. Dorigo
M. Dorigo
M. Granovetter
M. J. B. Krieger
M. J. Matarić
M. J. Matarić
M. J. Matarić
M. Massink
M. Minsky
M. Riedmiller
M. Schwager
M. Waibel
Manuele Brambilla
Marco Dorigo
Mauro Birattari
N. Correll
N. Correll
N. Franks
N. Mathews
N. Mathews
O. Khatib
O. Soysal
O. Soysal
O. Soysal
P. Flocchini
P. Levi
P. M. Maxim
P. Stone
P.-P. Grassé
Q. Lindsey
R. A. Brooks
R. Abbott
R. Beckers
R. Brooks
R. D. Beer
R. Frigg
R. Groß
R. Groß
R. Groß
R. Jeanson
R. L. Stewart
R. O’Grady
R. O’Grady
R. O’Grady
R. S. Sutton
R. T. Vaughan
S. Berman
S. Berman
S. Berman
S. Camazine
S. Garnier
S. Garnier
S. Kalyanakrishnan
S. Kazadi
S. Konur
S. Nolfi
S. Nouyan
S. Nouyan
S. Yun
T. Balch
T. H. Labella
T. L. Fine
T. Schmickl
T. Stirling
V. Crespi
V. Gazi
V. Gazi
V. Gazi
V. Gazi
V. Gazi
V. Gazi
V. Sperati
V. Sperati
V. Trianni
V. Trianni
W. Agassounon
W. Liu
W. Liu
W. M. Spears
W. M. Spears
Y. Liu
Y. Liu
Y. U. Cao
Á. Gutiérrez
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Learning Agents

Author: KALYANAKRISHNAN S
Publication venue: IEEE COMPUTER SOC
Publication date: 01/01/2016
Field of study

Dspace at IIT Bombay

Skill Combination for Reinforcement Learning

Author: C. Watkins
M.L. Puterman
R. Sutton
S. Kalyanakrishnan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

On the classification of interactive user behaviour indices

Author: R. Kalyanakrishnan
S. V. Raghavan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

Towards a Principled Solution to Simulated Robot Soccer

Author: M. Riedmiller
P. Stone
S. Kalyanakrishnan
S. Thrun
T. Gabel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2013
Field of study

Crossref

On Progress in RoboCup: The Simulation League Showcase

Author: E. Pagello
E. Pagello
I. Noda
O. Obst
S. Kalyanakrishnan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2011
Field of study

Crossref

PAC models in stochastic multi-objective multi-armed bandits

Author: Auer P.
Drugan M. M.
Kalyanakrishnan S.
Kaufmann E.
Szörényi B.
Zuluaga M.
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2017
Field of study

Many real-world applications, such as stock markets, energy consumption time series, and scheduling in noisy environments, are characterised by stochastic feedback. In this paper, the evolutionary multi-objective (EMO) techniques, like elitist selection strategies, and the probably approximatively correct (PAC) model are used to analyse the multi-armed bandits (MAB) paradigm that identifies the Pareto front from a finite set of arms with stochastic reward vectors. Each arm is associated with a confidence ball centred in the sampling's mean vector that decreases towards its true vector when the number of samples increases. The Pareto lower upper confidence bound algorithm samples the alternatives for which their confidence ball overlaps with the confidence regions of the Pareto optimal arms. Pareto racing deletes the arms classified with certainty as either suboptimal or Pareto optimal arms. The sample complexity estimates the number of samples required for an accurate approximation of the Pareto front using two different statistics, i.e. empirically determined means or quantiles. The analysed PAC models are empirically compared on realistic datasets with two and three objectives

Crossref

Repository TU/e

Pure OAI Repository