Search CORE

5,364 research outputs found

Advances in the Hierarchical Emergent Behaviors (HEB) approach to autonomous vehicles

Author: Milito Rodolfo
Nemirovsky Mario
Roca Damian
Valero Cortés Mateo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

Widespread deployment of autonomous vehicles (AVs) presents formidable challenges in terms on handling scalability and complexity, particularly regarding vehicular reaction in the face of unforeseen corner cases. Hierarchical Emergent Behaviors (HEB) is a scalable architecture based on the concepts of emergent behaviors and hierarchical decomposition. It relies on a few simple but powerful rules to govern local vehicular interactions. Rather than requiring prescriptive programming of every possible scenario, HEB’s approach relies on global behaviors induced by the application of these local, well-understood rules. Our first two papers on HEB focused on a primal set of rules applied at the first hierarchical level. On the path to systematize a solid design methodology, this paper proposes additional rules for the second level, studies through simulations the resultant richer set of emergent behaviors, and discusses the communica-tion mechanisms between the different levels.Peer ReviewedPostprint (author's final draft

UPCommons. Portal del coneixement obert de la UPC

Hi-Val: Iterative Learning of Hierarchical Value Functions for Policy Generation

Author: D Silver
D Silver
G Chowdhary
G Konidaris
J Hostetler
Levente Kocsis
M Jun
P Auer
RS Sutton
TG Dietterich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

Task decomposition is effective in manifold applications where the global complexity of a problem makes planning and decision-making too demanding. This is true, for example, in high-dimensional robotics domains, where (1) unpredictabilities and modeling limitations typically prevent the manual specification of robust behaviors, and (2) learning an action policy is challenging due to the curse of dimensionality. In this work, we borrow the concept of Hierarchical Task Networks (HTNs) to decompose the learning procedure, and we exploit Upper Confidence Tree (UCT) search to introduce HOP, a novel iterative algorithm for hierarchical optimistic planning with learned value functions. To obtain better generalization and generate policies, HOP simultaneously learns and uses action values. These are used to formalize constraints within the search space and to reduce the dimensionality of the problem. We evaluate our algorithm both on a fetching task using a simulated 7-DOF KUKA light weight arm and, on a pick and delivery task with a Pioneer robot

Crossref

Archivio della ricerca- Università di Roma La Sapienza

Mobility Study for Named Data Networking in Wireless Access Networks

Author: Azgin Aytac
Ravindran Ravishankar
Wang Guoqiang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 20/06/2014
Field of study

Information centric networking (ICN) proposes to redesign the Internet by replacing its host-centric design with information-centric design. Communication among entities is established at the naming level, with the receiver side (referred to as the Consumer) acting as the driving force behind content delivery, by interacting with the network through Interest message transmissions. One of the proposed advantages for ICN is its support for mobility, by de-coupling applications from transport semantics. However, so far, little research has been conducted to understand the interaction between ICN and mobility of consuming and producing applications, in protocols purely based on information-centric principles, particularly in the case of NDN. In this paper, we present our findings on the mobility-based performance of Named Data Networking (NDN) in wireless access networks. Through simulations, we show that the current NDN architecture is not efficient in handling mobility and architectural enhancements needs to be done to fully support mobility of Consumers and Producers.Comment: to appear in IEEE ICC 201

arXiv.org e-Print Archive

CiteSeerX

Crossref

Robotic Wireless Sensor Networks

Author: A Balasubramanian
A Chattopadhyay
A Fida
A Gasparri
A Gasparri
A Ghaffarkhah
A Gonzalez-Ruiz
A Gonzalez-Ruiz
A Sanfeliu
A Tiderko
AM Hsieh
AM Ladd
B Mohar
C Dixon
C Lochert
CE Perkins
CR Lin
D Calkins
D Son
D Tardioli
DV Dimarogonas
DV Dimarogonas
E Aahin
E Prassler
EA Thompson
EW Dijkstra
F Knorn
FT Dagefu
G Holland
G Sun
G Tuna
H Choset
HG Nguyen
Hui Liu
I Guvenc
J Baber
J Cortes
J Fink
J Ny Le
J Penders
J Zhou
JB Petelin
JC Curlander
JG Dai
JK Erickson
JR Pinta De La
K Fall
K Kamei
K Konolige
K Savla
KA Qaraqe
L Oliveira
L Sabattini
L Tassiulas
LE Parker
M Fiedler
M Franceschelli
M Guo
M Malmirchegini
M Mauve
M Mauve
M Michael
M Michael
M Michael
M Michael
M Naghshvar
M Saumitra
M Schuresko
MA Batalin
MA Hsieh
MD Weiss
N Bezzo
N Boillot
N Hazon
N Xiong
NP Papanikolopoulos
O Tekdas
P Brass
P Wilke
P Yang
P Yang
P. Ibach
Pradipta Ghosh
Pradipta Ghosh
PX Liu
Q Dong
R Olfati-Saber
RC Arkin
RR Murphy
RR Murphy
S Depatla
S Gil
S Manfredi
S Thrun
S Wang
SJ Lee
SR Theodore
T Gustavi
V Milanés
Y Mostofi
Y Mostofi
Y Mostofi
Y Mostofi
Y Uchimura
Y Yan
Y Yan
Z Lin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/09/2018
Field of study

In this chapter, we present a literature survey of an emerging, cutting-edge, and multi-disciplinary field of research at the intersection of Robotics and Wireless Sensor Networks (WSN) which we refer to as Robotic Wireless Sensor Networks (RWSN). We define a RWSN as an autonomous networked multi-robot system that aims to achieve certain sensing goals while meeting and maintaining certain communication performance requirements, through cooperative control, learning and adaptation. While both of the component areas, i.e., Robotics and WSN, are very well-known and well-explored, there exist a whole set of new opportunities and research directions at the intersection of these two fields which are relatively or even completely unexplored. One such example would be the use of a set of robotic routers to set up a temporary communication path between a sender and a receiver that uses the controlled mobility to the advantage of packet routing. We find that there exist only a limited number of articles to be directly categorized as RWSN related works whereas there exist a range of articles in the robotics and the WSN literature that are also relevant to this new field of research. To connect the dots, we first identify the core problems and research trends related to RWSN such as connectivity, localization, routing, and robust flow of information. Next, we classify the existing research on RWSN as well as the relevant state-of-the-arts from robotics and WSN community according to the problems and trends identified in the first step. Lastly, we analyze what is missing in the existing literature, and identify topics that require more research attention in the future

arXiv.org e-Print Archive

Crossref

Combining Planning and Deep Reinforcement Learning in Tactical Decision Making for Autonomous Driving

Author: Driggs-Campbell Katherine
Hoel Carl-Johan
Kochenderfer Mykel J.
Laine Leo
Wolff Krister
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

Tactical decision making for autonomous driving is challenging due to the diversity of environments, the uncertainty in the sensor information, and the complex interaction with other road users. This paper introduces a general framework for tactical decision making, which combines the concepts of planning and learning, in the form of Monte Carlo tree search and deep reinforcement learning. The method is based on the AlphaGo Zero algorithm, which is extended to a domain with a continuous state space where self-play cannot be used. The framework is applied to two different highway driving cases in a simulated environment and it is shown to perform better than a commonly used baseline method. The strength of combining planning and learning is also illustrated by a comparison to using the Monte Carlo tree search or the neural network policy separately

arXiv.org e-Print Archive

Chalmers Research

Fourteenth Biennial Status Report: März 2017 - February 2019

Author
Publication venue: Max-Planck-Institut für Informatik
Publication date: 01/01/2019
Field of study

MPG.PuRe