Search CORE

384 research outputs found

Recommended from our members

Multi-agent deep reinforcement learning for Robocup Rescue Simulator

Author: Goyal Animesh
Publication venue
Publication date: 20/09/2021
Field of study

Recent development in the field of Artificial Intelligence have dealt with building a winning strategy for video games where agents learn how to finish their task successfully using Deep Reinforcement Learning (DRL). The first major breakthrough came when Mnih et al. [22] showed how a DRL algorithm, termed Deep Q-Networks (DQN), can be applied to a collection of Atari 2600 games to surpass the performance of all previous algorithms and achieve a level that is comparable to a professional player. Their trained model received only raw pixels and game score as inputs to learn successful policies for single agents and was able to outperform professionals across a set of 49 Atari games. After a few years, focus shifted on training multiple agents using DRL, often known as multi-agent deep reinforcement learning (MADRL), for real time strategy games. Brockman et al. [4] achieved superhuman performance in the game of DOTA 2 which involves multi-agent collaboration, spatial and temporal reasoning, adversarial planning, and opponent modeling. Using Proximal Policy Optimization (PPO) algorithm and a LSTM layer as the primary component of the neural network, their trained model was able to defeat the human champion team, Team OG by 2:0. Most recently, Vinyals et al. [38] showed how a MADRL model can achieve grandmaster level in the game of StarCraft II. In this work, we apply MADRL to RoboCup Rescue Simulator (RCRS), which is part of the annual RoboCup Competition. RCRS is an open-source virtual environment that evaluates how effective multiple agents like ambulance team, police officer and fire brigades are in rescuing civilians and extinguishing fire from a city where an earthquake just happened. RCRS is challenging, easy to use and customize multi-agent scenario. In order to create RCRS environment where deep reinforcement learning algorithms can be tested, RCRS-gym, an open-source OpenAI Gym environment was developed. In this report, we have focused on training multiple fire brigades to collaboratively accomplish their task of extinguishing fire in the city. Fire Brigades were trained using two DRL algorithms: DQN and PPO. The performance of the algorithms was then compared with a greedy approach on two different map setting, "Small" map and "Big" map, each having different number of fire brigades and buildings. The agents were able to successfully finish their task of extinguishing fire on both map setting thus proving that RCRS is a suitable environment for developing deep reinforcement learning agent in a strategic multiagent game scenario. DQN outperformed PPO in the "Small" map setting while PPO outperformed a variant to DQN, H-DQN in the "Big" map setting. However, both the algorithms were not able to significantly outperform the greedy approach in either setting which opens up a promising avenue for future researchOperations Research and Industrial Engineerin

Texas ScholarWorks

Deep Q-Learning With Q-Matrix Transfer Learning for Novel Fire Evacuation Environment

Author: Andersen Per-Arne
Goodwin Morten
Granmo Ole-Christoffer
Sharma Jivitesh
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 29/05/2019
Field of study

Author's accepted manuscript.© 2020 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.acceptedVersio

arXiv.org e-Print Archive

NORA - Norwegian Open Research Archives

Agder University Research Archive

Advances in Deep Learning Towards Fire Emergency Application : Novel Architectures, Techniques and Applications of Neural Networks

Author: Sharma Jivitesh
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2020
Field of study

Paper IV is not published yet.With respect to copyright paper IV and paper VI was excluded from the dissertation.Deep Learning has been successfully used in various applications, and recently, there has been an increasing interest in applying deep learning in emergency management. However, there are still many significant challenges that limit the use of deep learning in the latter application domain. In this thesis, we address some of these challenges and propose novel deep learning methods and architectures. The challenges we address fall in these three areas of emergency management: Detection of the emergency (fire), Analysis of the situation without human intervention and finally Evacuation Planning. In this thesis, we have used computer vision tasks of image classification and semantic segmentation, as well as sound recognition, for detection and analysis. For evacuation planning, we have used deep reinforcement learning.publishedVersio

Agder University Research Archive

A NATURALISTIC COMPUTATIONAL MODEL OF HUMAN BEHAVIOR IN NAVIGATION AND SEARCH TASKS

Author: Perelman Brandon S.
Publication venue: Digital Commons @ Michigan Tech
Publication date: 01/01/2015
Field of study

Planning, navigation, and search are fundamental human cognitive abilities central to spatial problem solving in search and rescue, law enforcement, and military operations. Despite a wealth of literature concerning naturalistic spatial problem solving in animals, literature on naturalistic spatial problem solving in humans is comparatively lacking and generally conducted by separate camps among which there is little crosstalk. Addressing this deficiency will allow us to predict spatial decision making in operational environments, and understand the factors leading to those decisions. The present dissertation is comprised of two related efforts, (1) a set of empirical research studies intended to identify characteristics of planning, execution, and memory in naturalistic spatial problem solving tasks, and (2) a computational modeling effort to develop a model of naturalistic spatial problem solving. The results of the behavioral studies indicate that problem space hierarchical representations are linear in shape, and that human solutions are produced according to multiple optimization criteria. The Mixed Criteria Model presented in this dissertation accounts for global and local human performance in a traditional and naturalistic Traveling Salesman Problem. The results of the empirical and modeling efforts hold implications for basic and applied science in domains such as problem solving, operations research, human-computer interaction, and artificial intelligence

Michigan Technological University

Logistics and decision chains in disaster management; Floods in Austria / Case simulation: Krems-Mautern an der Donau

Author: Larraya Eraso Francisco Javier
Publication venue
Publication date: 01/01/2015
Field of study

The main purpose of this final master thesis is to analyse the analysis of the chain of decisions concerning a disaster situation as well as its consequences. Furthermore, the logistical issues concerning resource distribution in case of a flooding event are investigated. A particular disaster event situated in Lower Austria is used, due to the fact that the final master thesis is developed in Austria. Concerning this main objective, the study begins at an international level, where cooperation and coordination of several organizations are studied, to focus afterwards on Austrian organization. A case study is conducted to compare practice and guidelines regarding the main decision chains. In addition to this, a hypothetical situation is generated in terms of deciding and finding out which the best logistical decision is as far as it concerns the satisfaction of the population. The main objective is to reduce the number of unsatisfied customers to the minimum and to guarantee the minimum stocks to the local distribution centres. This operation is restricted to a limited governmental budget. Therefore, it is necessary to explain all the cooperating and coordinating organizations involved in a disaster situation. The simulation and subsequent conclusions are carried out with the simulation software AnyLogic. The simulation problem has not only the restriction of the available budget bounded to the supplying chain, but also technical (resources, trucks) constraints. The main objective is to minimize the number of unsatisfied people in the area. The final master thesis is developed with the collaboration of the BOKU University (University of Natural Resources and Life Science) of Vienna, in particular with the Institute of Production and Logistics. In contrast to the first theoretical part, the simulation offers a very wide range of possibilities to model different scenarios. It is a very dynamic and interesting procedure, capable itself to give important results. In this final master thesis, four different scenarios are simulated, implying each of them different constraints. In each of the scenarios the total demand of each store is given as well as the number of total satisfied customers and units in stock. Furthermore, the results obtained are very useful to get quite an exact approach to the real situation of a flood. In addition to this, technical parameters like the needed stock to satisfy the demand or the optimal number of trucks to ship stock are established. All the results are compared within different graphics and charts, finding out the best solution in each simulated scenario. In summary, and due to the lack of studies around this topic, this document becomes very interesting and a useful tool to start within the study of natural hazards.Ingeniería IndustrialIndustria Ingeniaritz

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Academica-e

Agent-Driven Representations, Algorithms, and Metrics for Automated Organizational Design.

Author: Sleight Jason Lee
Publication venue
Publication date: 01/01/2015
Field of study

As cooperative multiagent systems (MASs) increase in interconnectivity, complexity, size, and longevity, coordinating the agents' reasoning and behaviors becomes increasingly difficult. One approach to address these issues is to use insights from human organizations to design structures within which the agents can more efficiently reason and interact. Generally speaking, an organization influences each agent such that, by following its respective influences, an agent can make globally-useful local decisions without having to explicitly reason about the complete joint coordination problem. For example, an organizational influence might constrain and/or inform which actions an agent performs. If these influences are well-constructed to be cohesive and correlated across the agents, then each agent is influenced into reasoning about and performing only the actions that are appropriate for its (organizationally-designated) portion of the joint coordination problem. In this dissertation, I develop an agent-driven approach to organizations, wherein the foundation for representing and reasoning about an organization stems from the needs of the agents in the MAS. I create an organizational specification language to express the possible ways in which an organization could influence the agents' decision making processes, and leverage details from those decision processes to establish quantitative, principled metrics for organizational performance based on the expected impact that an organization will have on the agents' reasoning and behaviors. Building upon my agent-driven organizational representations, I identify a strategy for automating the organizational design process~(ODP), wherein my ODP computes a quantitative description of organizational patterns and then searches through those possible patterns to identify an (approximately) optimal set of organizational influences for the MAS. Evaluating my ODP reveals that it can create organizations that both influence the MAS into effective patterns of joint policies and also streamline the agents' decision making in a coordinate manner. Finally, I use my agent-driven approach to identify characteristics of effective abstractions over organizational influences and a heuristic strategy for converging on a good abstraction.PhDComputer Science and EngineeringUniversity of Michigan, Horace H. Rackham School of Graduate Studieshttp://deepblue.lib.umich.edu/bitstream/2027.42/113616/1/jsleight_1.pd

Deep Blue Documents at the University of Michigan

Governance and Community Responses to Floods in Poor Peri-urban Areas:The case of Urban Disaster Risk Reduction and Climate Change Adaptation in Pikine, Senegal

Author: Schaer Caroline
Publication venue: UNEP DTU Partnership
Publication date: 01/01/2015
Field of study

Online Research Database In Technology

CBR and MBR techniques: review for an application in the emergencies domain

Author: Merida-Campos Carlos
Rollón Rico Emma
Publication venue
Publication date: 01/01/2003
Field of study

The purpose of this document is to provide an in-depth analysis of current reasoning engine practice and the integration strategies of Case Based Reasoning and Model Based Reasoning that will be used in the design and development of the RIMSAT system. RIMSAT (Remote Intelligent Management Support and Training) is a European Commission funded project designed to: a.. Provide an innovative, 'intelligent', knowledge based solution aimed at improving the quality of critical decisions b.. Enhance the competencies and responsiveness of individuals and organisations involved in highly complex, safety critical incidents - irrespective of their location. In other words, RIMSAT aims to design and implement a decision support system that using Case Base Reasoning as well as Model Base Reasoning technology is applied in the management of emergency situations. This document is part of a deliverable for RIMSAT project, and although it has been done in close contact with the requirements of the project, it provides an overview wide enough for providing a state of the art in integration strategies between CBR and MBR technologies.Postprint (published version

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

UPCommons. Portal del coneixement obert de la UPC