Search CORE

3,217 research outputs found

Portfolio-based Planning: State of the Art, Common Practice and Open Challenges

Author: Auer
Bäckström
Censor
Chen
Coles
Fern
Gerevini
Gerevini
Gomes
Helmert
Hoffmann
Howe
Huberman
Hutter
Korf
López
Nissim
Rice
Richter
Roberts
Xu
Publication venue: 'IOS Press'
Publication date: 18/09/2015
Field of study

In recent years the field of automated planning has significantly advanced and several powerful domain-independent planners have been developed. However, none of these systems clearly outperforms all the others in every known benchmark domain. This observation motivated the idea of configuring and exploiting a portfolio of planners to perform better than any individual planner: some recent planning systems based on this idea achieved significantly good results in experimental analysis and International Planning Competitions. Such results let us suppose that future challenges of the Automated Planning community will converge on designing different approaches for combining existing planning algorithms. This paper reviews existing techniques and provides an exhaustive guide to portfolio-based planning. In addition, the paper outlines open issues of existing approaches and highlights possible future evolution of these techniques

Crossref

University of Huddersfield Repository

Huddersfield Research Portal

A reusable iterative optimization software library to solve combinatorial problems with approximate reasoning

Author: Raggl Andreas
Slany Wolfgang
Publication venue
Publication date: 31/08/1998
Field of study

Real world combinatorial optimization problems such as scheduling are typically too complex to solve with exact methods. Additionally, the problems often have to observe vaguely specified constraints of different importance, the available data may be uncertain, and compromises between antagonistic criteria may be necessary. We present a combination of approximate reasoning based constraints and iterative optimization based heuristics that help to model and solve such problems in a framework of C++ software libraries called StarFLIP++. While initially developed to schedule continuous caster units in steel plants, we present in this paper results from reusing the library components in a shift scheduling system for the workforce of an industrial production plant.Comment: 33 pages, 9 figures; for a project overview see http://www.dbai.tuwien.ac.at/proj/StarFLIP

arXiv.org e-Print Archive

Elsevier - Publisher Connector

Classical Planning in Deep Latent Space

Author: Asai Masataro
Fukunaga Alex
Kajino Hiroshi
Muise Christian
Publication venue
Publication date: 30/06/2021
Field of study

Current domain-independent, classical planners require symbolic models of the problem domain and instance as input, resulting in a knowledge acquisition bottleneck. Meanwhile, although deep learning has achieved significant success in many fields, the knowledge is encoded in a subsymbolic representation which is incompatible with symbolic systems such as planners. We propose Latplan, an unsupervised architecture combining deep learning and classical planning. Given only an unlabeled set of image pairs showing a subset of transitions allowed in the environment (training inputs), Latplan learns a complete propositional PDDL action model of the environment. Later, when a pair of images representing the initial and the goal states (planning inputs) is given, Latplan finds a plan to the goal state in a symbolic latent space and returns a visualized plan execution. We evaluate Latplan using image-based versions of 6 planning domains: 8-puzzle, 15-Puzzle, Blocksworld, Sokoban and Two variations of LightsOut.Comment: Under review at Journal of Artificial Intelligence Research (JAIR

arXiv.org e-Print Archive

Planning through Automatic Portfolio Configuration: The PbP Approach

Author: Gerevini Alfonso Emilio
Saetti Alessandro
Vallati Mauro
Publication venue: 'AI Access Foundation'
Publication date: 01/01/2014
Field of study

In the field of domain-independent planning, several powerful planners implementing different techniques have been developed. However, no one of these systems outperforms all others in every known benchmark domain. In this work, we propose a multi-planner approach that automatically configures a portfolio of planning techniques for each given domain. The configuration process for a given domain uses a set of training instances to: (i) compute and analyze some alternative sets of macro-actions for each planner in the portfolio identifying a (possibly empty) useful set, (ii) select a cluster of planners, each one with the identified useful set of macro-actions, that is expected to perform best, and (iii) derive some additional information for configuring the execution scheduling of the selected planners at planning time. The resulting planning system, called PbP (Portfolio- based Planner), has two variants focusing on speed and plan quality. Different versions of PbP entered and won the learning track of the sixth and seventh International Planning Competitions. In this paper, we experimentally analyze PbP considering planning speed and plan quality in depth. We provide a collection of results that help to understand PbP�s behavior, and demonstrate the effectiveness of our approach to configuring a portfolio of planners with macro-actions

Crossref

Archivio istituzionale della ricerca - Università di Brescia

University of Huddersfield Repository

Huddersfield Research Portal

Deep Reinforcement Learning for Decentralized Multi-Robot Exploration With Macro Actions

Author: Bejarano Federico Pizarro
Nejat Goldie
Ren Richard
Tan Aaron Hao
Zhu Yuhan
Publication venue
Publication date: 26/02/2024
Field of study

Cooperative multi-robot teams need to be able to explore cluttered and unstructured environments while dealing with communication dropouts that prevent them from exchanging local information to maintain team coordination. Therefore, robots need to consider high-level teammate intentions during action selection. In this letter, we present the first Macro Action Decentralized Exploration Network (MADE-Net) using multi-agent deep reinforcement learning (DRL) to address the challenges of communication dropouts during multi-robot exploration in unseen, unstructured, and cluttered environments. Simulated robot team exploration experiments were conducted and compared against classical and DRL methods where MADE-Net outperformed all benchmark methods in terms of computation time, total travel distance, number of local interactions between robots, and exploration rate across various degrees of communication dropouts. A scalability study in 3D environments showed a decrease in exploration time with MADE-Net with increasing team and environment sizes. The experiments presented highlight the effectiveness and robustness of our method.Comment: 8 pages, 7 figure

arXiv.org e-Print Archive

Recent Conceptual Consequences of Loop Quantum Gravity. Part II: Holistic Aspects

Author: Zimmermann Rainer E.
Publication venue
Publication date: 01/07/2001
Field of study

Based on the foundational aspects which have been discussed as consequences of ongoing research on loop quantum gravity in the first part of this paper, the holistic aspects of the latter are discussed in this second part, aiming at a consistent and systematic approach to eventually model a hierarchically ordered architecture of the world which is encompassing all of what there actually is. The idea is to clarify the explicit relationship between physics and philosophy on the one hand, and philosophy and the sciences in general, on the other. It is shown that the ontological determination of worldliness is practically identical with its epistemological determination so that the (scientific) activity of modelling and representing the world can be visualized itself as a (worldly) mode of being.Comment: 20 page

arXiv.org e-Print Archive

PhilSci Archive

Recommended from our members

Towards Informed Exploration for Deep Reinforcement Learning

Author: Tang Haoran
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

In this thesis, we discuss various techniques for improving exploration for deep reinforcement learning. We begin with a brief review of reinforcement learning (RL) and the fundamental v.s. exploitation trade-off. Then we review how deep RL has improved upon classical and summarize six categories of the latest exploration methods for deep RL, in the order increasing usage of prior information. We then explore representative works in three categories discuss their strengths and weaknesses. The first category, represented by Soft Q-learning, uses regularization to encourage exploration. The second category, represented by count-based via hashing, maps states to hash codes for counting and assigns higher exploration to less-encountered states. The third category utilizes hierarchy and is represented by modular architecture for RL agents to play StarCraft II. Finally, we conclude that exploration by prior knowledge is a promising research direction and suggest topics of potentially impact

eScholarship - University of California