22 research outputs found

    The economics of global environmental risk

    Get PDF
    This chapter focusses on global environmental risks such as climate change, an issue that must be confronted as we move into the future. It proposes sound principles of risk management that make sense in today's society generally, going beyond their role of averting and hedging climate risks. This chapter is about these and related questions. In attempting to answer them, it deals with different aspects of the theory of risk-bearing. I explain current responses to global change, focusing on the new challenges: human-induced or endogenous risks, including potentially catastrophic risks, which are not adequately treated by traditional economic analysis. In summary, we are dealing with risks that have two major new characteristics: they are endogenous and potentially catastrophic. In addition, climate risks have three more conventional features: they are poorly understood, correlated and irreversible. In all cases, this chapter proposes ways to advance our understanding of the problems. This chapter proposes ways to evaluate decisions under endogenous and potentially catastrophic risks, and incorporates often neglected features of correlated, poorly understood and irreversible risks. The analysis proposed here opens new ways of thinking and at the same time poses new challenges. At the end I indicate new areas of research.risk; global environment; climate change; endogenous risk; catastrophic risk; risk management; mathematical modeling; endogenous uncertainty; policy

    A General Overview of Risk Theory and its Application to Agriculture

    Get PDF

    Data-efficient reinforcement learning with self-predictive representations

    Full text link
    L'efficacitĂ© des donnĂ©es reste un dĂ©fi majeur dans l'apprentissage par renforcement profond. Bien que les techniques modernes soient capables d'atteindre des performances Ă©levĂ©es dans des tĂąches extrĂȘmement complexes, y compris les jeux de stratĂ©gie comme le StarCraft, les Ă©checs, le shogi et le go, ainsi que dans des domaines visuels exigeants comme les jeux Atari, cela nĂ©cessite gĂ©nĂ©ralement d'Ă©normes quantitĂ©s de donnĂ©es interactives, limitant ainsi l'application pratique de l'apprentissage par renforcement. Dans ce mĂ©moire, nous proposons la SPR, une mĂ©thode inspirĂ©e des rĂ©centes avancĂ©es en apprentissage auto-supervisĂ© de reprĂ©sentations, conçue pour amĂ©liorer l'efficacitĂ© des donnĂ©es des agents d'apprentissage par renforcement profond. Nous Ă©valuons cette mĂ©thode sur l'environement d'apprentissage Atari, et nous montrons qu'elle amĂ©liore considĂ©rablement les performances des agents avec un surcroĂźt de calcul modĂ©rĂ©. Lorsqu'on lui accorde Ă  peu prĂšs le mĂȘme temps d'apprentissage qu'aux testeurs humains, un agent d'apprentissage par renforcement augmentĂ© de SPR atteint des performances surhumaines dans 7 des 26 jeux, une augmentation de 350% par rapport Ă  l'Ă©tat de l'art prĂ©cĂ©dent, tout en amĂ©liorant fortement les performances moyennes et mĂ©dianes. Nous Ă©valuons Ă©galement cette mĂ©thode sur un ensemble de tĂąches de contrĂŽle continu, montrant des amĂ©liorations substantielles par rapport aux mĂ©thodes prĂ©cĂ©dentes. Le chapitre 1 prĂ©sente les concepts nĂ©cessaires Ă  la comprĂ©hension du travail prĂ©sentĂ©, y compris des aperçus de l'apprentissage par renforcement profond et de l'apprentissage auto-supervisĂ© de reprĂ©sentations. Le chapitre 2 contient une description dĂ©taillĂ©e de nos contributions Ă  l'exploitation de l'apprentissage de reprĂ©sentation auto-supervisĂ© pour amĂ©liorer l'efficacitĂ© des donnĂ©es dans l'apprentissage par renforcement. Le chapitre 3 prĂ©sente quelques conclusions tirĂ©es de ces travaux, y compris des propositions pour les travaux futurs.Data efficiency remains a key challenge in deep reinforcement learning. Although modern techniques have been shown to be capable of attaining high performance in extremely complex tasks, including strategy games such as StarCraft, Chess, Shogi, and Go as well as in challenging visual domains such as Atari games, doing so generally requires enormous amounts of interactional data, limiting how broadly reinforcement learning can be applied. In this thesis, we propose SPR, a method drawing from recent advances in self-supervised representation learning designed to enhance the data efficiency of deep reinforcement learning agents. We evaluate this method on the Atari Learning Environment, and show that it dramatically improves performance with limited computational overhead. When given roughly the same amount of learning time as human testers, a reinforcement learning agent augmented with SPR achieves super-human performance on 7 out of 26 games, an increase of 350% over the previous state of the art, while also strongly improving mean and median performance. We also evaluate this method on a set of continuous control tasks, showing substantial improvements over previous methods. Chapter 1 introduces concepts necessary to understand the work presented, including overviews of Deep Reinforcement Learning and Self-Supervised Representation learning. Chapter 2 contains a detailed description of our contributions towards leveraging self-supervised representation learning to improve data-efficiency in reinforcement learning. Chapter 3 provides some conclusions drawn from this work, including a number of proposals for future work

    Robust Door Operation with the Toyota Human Support Robot. Robotic perception, manipulation and learning

    Get PDF
    Robots are progressively spreading to urban, social and assistive domains. Service robots operating in domestic environments typically face a variety of objects they have to deal with to fulfill their tasks. Some of these objects are articulated such as cabinet doors and drawers. The ability to deal with such objects is relevant, as for example navigate between rooms or assist humans in their mobility. The exploration of this task rises interesting questions in some of the main robotic threads such as perception, manipulation and learning. In this work a general framework to robustly operate different types of doors with a mobile manipulator robot is proposed. To push the state-of-the-art, a novel algorithm, that fuses a Convolutional Neural Network with point cloud processing for estimating the end-effector grasping pose in real-time for multiple handles simultaneously from single RGB-D images, is proposed. Also, a Bayesian framework that embodies the robot with the ability to learn the kinematic model of the door from observations of its motion, as well as from previous experiences or human demonstrations. Combining this probabilistic approach with state-of-the-art motion planninOutgoin

    Mass spectrometry data mining for cancer detection

    Get PDF
    Early detection of cancer is crucial for successful intervention strategies. Mass spectrometry-based high throughput proteomics is recognized as a major breakthrough in cancer detection. Many machine learning methods have been used to construct classifiers based on mass spectrometry data for discriminating between cancer stages, yet, the classifiers so constructed generally lack biological interpretability. To better assist clinical uses, a key step is to discover ”biomarker signature profiles”, i.e. combinations of a small number of protein biomarkers strongly discriminating between cancer states. This dissertation introduces two innovative algorithms to automatically search for a signature and to construct a high-performance signature-based classifier for cancer discrimination tasks based on mass spectrometry data, such as data acquired by MALDI or SELDI techniques. Our first algorithm assumes that homogeneous groups of mass spectra can be modeled by (unknown) Gibbs distributions to generate an optimal signature and an associated signature-based classifier by robust log-likelihood analysis; our second algorithm uses a stochastic optimization algorithm to search for two lists of biomarkers, and then constructs a signature-based classifier. To support these two algorithms theoretically, this dissertation also studies the empirical probability distributions of mass spectrometry data and implements the actual fitting of Markov random fields to these high-dimensional distributions. We have validated our two signature discovery algorithms on several mass spectrometry datasets related to ovarian cancer and to colorectal cancer patients groups. For these cancer discrimination tasks, our algorithms have yielded better classification performances than existing machine learning algorithms and in addition,have generated more interpretable explicit signatures.Mathematics, Department o

    High-level environment representations for mobile robots

    Get PDF
    In most robotic applications we are faced with the problem of building a digital representation of the environment that allows the robot to autonomously complete its tasks. This internal representation can be used by the robot to plan a motion trajectory for its mobile base and/or end-effector. For most man-made environments we do not have a digital representation or it is inaccurate. Thus, the robot must have the capability of building it autonomously. This is done by integrating into an internal data structure incoming sensor measurements. For this purpose, a common solution consists in solving the Simultaneous Localization and Mapping (SLAM) problem. The map obtained by solving a SLAM problem is called ``metric'' and it describes the geometric structure of the environment. A metric map is typically made up of low-level primitives (like points or voxels). This means that even though it represents the shape of the objects in the robot workspace it lacks the information of which object a surface belongs to. Having an object-level representation of the environment has the advantage of augmenting the set of possible tasks that a robot may accomplish. To this end, in this thesis we focus on two aspects. We propose a formalism to represent in a uniform manner 3D scenes consisting of different geometric primitives, including points, lines and planes. Consequently, we derive a local registration and a global optimization algorithm that can exploit this representation for robust estimation. Furthermore, we present a Semantic Mapping system capable of building an \textit{object-based} map that can be used for complex task planning and execution. Our system exploits effective reconstruction and recognition techniques that require no a-priori information about the environment and can be used under general conditions

    The economics of global environmental risk

    Get PDF
    This chapter focusses on global environmental risks such as climate change, an issue that must be confronted as we move into the future. It proposes sound principles of risk management that make sense in today's society generally, going beyond their role of averting and hedging climate risks. This chapter is about these and related questions. In attempting to answer them, it deals with different aspects of the theory of risk-bearing. I explain current responses to global change, focusing on the new challenges: human-induced or endogenous risks, including potentially catastrophic risks, which are not adequately treated by traditional economic analysis. In summary, we are dealing with risks that have two major new characteristics: they are endogenous and potentially catastrophic. In addition, climate risks have three more conventional features: they are poorly understood, correlated and irreversible. In all cases, this chapter proposes ways to advance our understanding of the problems. This chapter proposes ways to evaluate decisions under endogenous and potentially catastrophic risks, and incorporates often neglected features of correlated, poorly understood and irreversible risks. The analysis proposed here opens new ways of thinking and at the same time poses new challenges. At the end I indicate new areas of research
    corecore