22 research outputs found

    Learning in Real-Time Search: A Unifying Framework

    Full text link
    Real-time search methods are suited for tasks in which the agent is interacting with an initially unknown environment in real time. In such simultaneous planning and learning problems, the agent has to select its actions in a limited amount of time, while sensing only a local part of the environment centered at the agents current location. Real-time heuristic search agents select actions using a limited lookahead search and evaluating the frontier states with a heuristic function. Over repeated experiences, they refine heuristic values of states to avoid infinite loops and to converge to better solutions. The wide spread of such settings in autonomous software and hardware agents has led to an explosion of real-time search algorithms over the last two decades. Not only is a potential user confronted with a hodgepodge of algorithms, but he also faces the choice of control parameters they use. In this paper we address both problems. The first contribution is an introduction of a simple three-parameter framework (named LRTS) which extracts the core ideas behind many existing algorithms. We then prove that LRTA*, epsilon-LRTA*, SLA*, and gamma-Trap algorithms are special cases of our framework. Thus, they are unified and extended with additional features. Second, we prove completeness and convergence of any algorithm covered by the LRTS framework. Third, we prove several upper-bounds relating the control parameters and solution quality. Finally, we analyze the influence of the three control parameters empirically in the realistic scalable domains of real-time navigation on initially unknown maps from a commercial role-playing game as well as routing in ad hoc sensor networks

    Envisionment-Based Scheduling Using Time Interval Petri Networks: Representation, Inference, and Learning

    No full text
    331 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2000.This interdisciplinary research has made the following five main contributions: (1) A Petri Nets based approach to decision-making scheduling through environment modeling is presented. Petri Nets formalism is known for its solid theoretical base, clear syntax and semantics, intuitive graphic representation, and native concurrency support. (2) The classical Petri Net model is extended in various ways to make it suitable for AI scheduling tasks. The main extensions concern explicit temporal reasoning, context, and operator support. The new formalism is hence called Time Interval Petri Nets (TIPNs). (3) TIPN properties and relation to other Al and Petri Nets formalisms are studied. We also present analysis methods facilitating verification and refinement of Petri Net models. (4) Two machine learning algorithms are developed to synthesize Petri Net models automatically or semi-automatically. One learning algorithm exploits the connection between Petri Nets and Horn clauses by using inductive logic programming methods (ILP) to learn Horn-clauses first and then convert them to TIPNs. The other algorithm employs a general-to-specific search in the space of Petri Net topologies starting with a given initial topology. (5) The framework is applied in the real-time decision-making domain of ship damage control for the tasks of automated problem-solving and intelligent tutoring (advising, critiquing, and scoring). In a large exercise involving approximately 500 simulated ship crisis scenarios, our decision-making expert system showed a 318% improvement over Navy officers by saving 89 more ships.U of I OnlyRestricted to the U of I community idenfinitely during batch ingest of legacy ETD

    Group colorings of complete bipartite graphs and bounds for Häggkvist numbers

    No full text

    Automated instructor assistant for ship damage control

    No full text
    The decision making task of ship damage control includes addressing problems such as fire spread, flooding, smoke, equipment failures, and personnel casualties. It is a challenging and highly stressful domain with a limited provision for real-life training. In response to this need, a multimedia interactive damage control simulator system, called DC-Train 2.0 was recently deployed at a Navy officer training school; it provides officers with an immersive environment for damage control training. This paper describes a component of the DC-Train 2.0 system that provides feedback to the user, called the automated instructor assistant. This assistant is based on a blackboardbased expert system called Minerva-DCA, which is capable of solving damage control scenarios at the “expert ” level. It

    Index permutations and classes of additive cellular automata rules with isomorphic STD

    No full text
    First we consider the question of identifying linear transformations that transform any additive CA rule into an additive CA rule with an isomorphic STD. A general condition is derived. Following on this, we consider a subclass of such transformations (index permutations). This allows, on one hand, a complete description and on the other hand, generalization of the results for the class of linear rules. Then the case of binary valued 1-dimensional additive cellular automata (d = 1, p = 2) and classes of isomorphisms of STDs that contain only a single rule (singletons) are considered. It is shown how singletons can be used to extend known systems of isomorphic STD classes. Finally we study how the baker transformation provides information about singletons and, by using this, present an algorithm that generates all singletons for one-dimensional additive CA of odd sizes
    corecore