86 research outputs found

    Decision-Theoretic Foundations for Causal Reasoning

    Full text link
    We present a definition of cause and effect in terms of decision-theoretic primitives and thereby provide a principled foundation for causal reasoning. Our definition departs from the traditional view of causation in that causal assertions may vary with the set of decisions available. We argue that this approach provides added clarity to the notion of cause. Also in this paper, we examine the encoding of causal relationships in directed acyclic graphs. We describe a special class of influence diagrams, those in canonical form, and show its relationship to Pearl's representation of cause and effect. Finally, we show how canonical form facilitates counterfactual reasoning.Comment: See http://www.jair.org/ for any accompanying file

    A dynamic transmission model for predicting trends in Helicobacter pylori and associated diseases in the United States.

    Get PDF
    To assess the benefits of intervention programs against Helicobacter pylori infection, we estimated the baseline curves of its incidence and prevalence. We developed a mathematical (compartmental) model of the intrinsic dynamics of H. pylori, which represents the natural history of infection and disease progression. Our model divided the population according to age, infection status, and clinical state. Case-patients were followed from birth to death. A proportion of the population acquired H. pylori infection and became ill with gastritis, duodenal ulcer, chronic atrophic gastritis, or gastric cancer. We simulated the change in transmissibility consistent with the incidence of gastric cancer and duodenal ulcer over time, as well as current H. pylori prevalence. In the United States, transmissibility of H. pylori has decreased to values so low that, should this trend continue, the organism will disappear from the population without targeted intervention; this process, however, will take more than a century

    Influence Diagrams With Memory States: Representation and Algorithms

    Get PDF
    Abstract. Influence diagrams (IDs) offer a powerful framework for decision making under uncertainty, but their applicability has been hindered by the exponential growth of runtime and memory usage—largely due to the no-forgetting assumption. We present a novel way to maintain a limited amount of memory to inform each decision and still obtain near-optimal policies. The approach is based on augmenting the graphical model with memory states that represent key aspects of previous observations—a method that has proved useful in POMDP solvers. We also derive an efficient EM-based message-passing algorithm to compute the policy. Experimental results show that this approach produces highquality approximate polices and offers better scalability than existing methods.

    Graphical models for interactive POMDPs: representations and solutions

    Get PDF
    We develop new graphical representations for the problem of sequential decision making in partially observable multiagent environments, as formalized by interactive partially observable Markov decision processes (I-POMDPs). The graphical models called interactive inf uence diagrams (I-IDs) and their dynamic counterparts, interactive dynamic inf uence diagrams (I-DIDs), seek to explicitly model the structure that is often present in real-world problems by decomposing the situation into chance and decision variables, and the dependencies between the variables. I-DIDs generalize DIDs, which may be viewed as graphical representations of POMDPs, to multiagent settings in the same way that IPOMDPs generalize POMDPs. I-DIDs may be used to compute the policy of an agent given its belief as the agent acts and observes in a setting that is populated by other interacting agents. Using several examples, we show how I-IDs and I-DIDs may be applied and demonstrate their usefulness. We also show how the models may be solved using the standard algorithms that are applicable to DIDs. Solving I-DIDs exactly involves knowing the solutions of possible models of the other agents. The space of models grows exponentially with the number of time steps. We present a method of solving I-DIDs approximately by limiting the number of other agents’ candidate models at each time step to a constant. We do this by clustering models that are likely to be behaviorally equivalent and selecting a representative set from the clusters. We discuss the error bound of the approximation technique and demonstrate its empirical performance

    Optimal control as a graphical model inference problem

    Get PDF
    We reformulate a class of non-linear stochastic optimal control problems introduced by Todorov (2007) as a Kullback-Leibler (KL) minimization problem. As a result, the optimal control computation reduces to an inference computation and approximate inference methods can be applied to efficiently compute approximate optimal controls. We show how this KL control theory contains the path integral control method as a special case. We provide an example of a block stacking task and a multi-agent cooperative game where we demonstrate how approximate inference can be successfully applied to instances that are too complex for exact computation. We discuss the relation of the KL control approach to other inference approaches to control.Comment: 26 pages, 12 Figures; Machine Learning Journal (2012

    Strategic assessment: Using influence diagrams to design distance learning courseware

    Get PDF
    [[abstract]]Web-based distance learning programs are widely available. A few distance education platform and standards were developed or proposed. The importance of distance learning courseware brought the attention to teachers, administrators, and system developers. Among current software systems, it is hard to realize strategic assessment of student learning performance. Since one of the drawbacks of distance education is the load that an instructor needs to spend in courseware design, as well as to analyze student performance based on course contents and test outcomes, it is worthy to investigate an automatic mechanism to help an instructor to produce effective courseware. Thus, distance learning program can proceed efficiently. In this paper, we develop a mechanism for the construction of course structure based on influence diagram. The mechanism can be implemented as a decision support system for the instructor to analyze the relation among course units and test units. The overall value of a courseware can be systematically analyzed.[[notice]]補正完畢[[journaltype]]國外[[booktype]]紙本[[countrycodes]]DE
    corecore