72 research outputs found

    Self-Supervised Reinforcement Learning that Transfers using Random Features

    Full text link
    Model-free reinforcement learning algorithms have exhibited great potential in solving single-task sequential decision-making problems with high-dimensional observations and long horizons, but are known to be hard to generalize across tasks. Model-based RL, on the other hand, learns task-agnostic models of the world that naturally enables transfer across different reward functions, but struggles to scale to complex environments due to the compounding error. To get the best of both worlds, we propose a self-supervised reinforcement learning method that enables the transfer of behaviors across tasks with different rewards, while circumventing the challenges of model-based RL. In particular, we show self-supervised pre-training of model-free reinforcement learning with a number of random features as rewards allows implicit modeling of long-horizon environment dynamics. Then, planning techniques like model-predictive control using these implicit models enable fast adaptation to problems with new reward functions. Our method is self-supervised in that it can be trained on offline datasets without reward labels, but can then be quickly deployed on new tasks. We validate that our proposed method enables transfer across tasks on a variety of manipulation and locomotion domains in simulation, opening the door to generalist decision-making agents

    WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens

    Full text link
    World models play a crucial role in understanding and predicting the dynamics of the world, which is essential for video generation. However, existing world models are confined to specific scenarios such as gaming or driving, limiting their ability to capture the complexity of general world dynamic environments. Therefore, we introduce WorldDreamer, a pioneering world model to foster a comprehensive comprehension of general world physics and motions, which significantly enhances the capabilities of video generation. Drawing inspiration from the success of large language models, WorldDreamer frames world modeling as an unsupervised visual sequence modeling challenge. This is achieved by mapping visual inputs to discrete tokens and predicting the masked ones. During this process, we incorporate multi-modal prompts to facilitate interaction within the world model. Our experiments show that WorldDreamer excels in generating videos across different scenarios, including natural scenes and driving environments. WorldDreamer showcases versatility in executing tasks such as text-to-video conversion, image-tovideo synthesis, and video editing. These results underscore WorldDreamer's effectiveness in capturing dynamic elements within diverse general world environments.Comment: project page: https://world-dreamer.github.io

    Alternate erosion and deposition in the Yangtze Estuary and its future change

    Get PDF
    The morphological changing trend of the Yangtze Estuary, the largest estuary of Asia, has become a focus of research in recent years. Based on a long series of topographic data from 1950 to 2015, this paper studied the erosion-deposition pattern of the entire Yangtze Estuary. An alternation between erosion and deposition was found during the past 65 years, which was in correspondence to the alternation between flood and dry periods identified by multi-year average duration days of high-level water flow (defined as discharge ≥ 60,000 m3/s, namely, D≥60,000) from the Yangtze River Basin. A quantitative relationship was further developed between the erosional/depositional rate of the Yangtze Estuary and the interpreting variables of yearly water discharge, D≥60,000 and yearly river sediment load, with contributing rates of 1%, 59% and 40%, respectively. Mechanism behind the alternate erosion and deposition pattern was analyzed by examining residual water surface slope and the corresponding capacity of sediment transport in flood and dry periods. In flood periods, a larger discharge results in steeper slope of residual water level which permits a greater capacity of sediment transport. Therefore, more bed materials can be washed to the sea, leading to erosion of the estuary. In contrast, flatter slope of residual water level occurs in dry periods, and deposition dominates the estuarine area due to the decreased capacity of sediment transport and the increased backwater effect of flood-tide. Coastal dynamics and estuarine engineering projects alter the local morphological changes, but slightly affect the total erosional/depositional rate of the whole estuarine region. Heavy sedimentation within the Yangtze Estuary after the impoundment of the Three Gorges Dam can be attributed to the reduced occurrence frequency of flood years due to water regulation by the dam, and largely (at least 36%–52%) sourced from the sea. Deposition is still possible to occur in the Yangtze Estuary in the future, because the multi-year average D≥60,000 is unlikely to exceed the critical value of 14 days/yr which corresponds to the future equilibrium state of the Yangtze Estuary, under the water regulation of the large cascade dams in the upper Yangtze. Nevertheless, the mean depositional rate will not surpass the peak value of the past years, since the total sediment load entering the Yangtze Estuary has presented a decreasing trend

    Impact of water sediment diversion and afflux on erosion deposition in the Luoshan Hankou reach, 1 middle Yangtze River, China

    Get PDF
    It is not yet fully understood how water-sediment diversion and afflux along a mainstream reach of a river affect erosion-deposition in downstream reaches. This study focuses on the Luoshan-Hankou mainstream reach of the middle Yangtze River, China. The Luoshan-Hankou reach is vitally important for flood control, being located downstream of three diversion mouths and an afflux outlet along the Jingjiang reach. We establish empirical formulae for sediment transport rates at boundary cross-sections, and hence estimate the amount and proportion of erosion-deposition and its relative increase (termed erosion-deposition promotion) in the Luoshan-Hankou reach. We then propose critical net water supplies from Dongting Lake to Luoshan-Hankou reach based on maxima and equilibria of erosion-deposition and its promotion. It is found that net water supply partly drives erosion-deposition in the Luoshan-Hankou reach where maximal proportions of deposition and deposition-promotion may be approximated by 0.01c-37.67 and 0.01c-37.67 + c-1, in which c is a dimensionless parameter representing the erosion-deposition condition in Luoshan-Hankou reach for no water-sediment exchange. At Zhicheng hydrological station, the critical ratio of net water supply to overall water discharge is 0.418c-33.33-1, and critical net water supply ratios for equilibria of erosion-deposition and its promotion are −1 (or c-33.33-1) and 0 (or (0.06 + 3.257c54.61)-1-1). A chart based on net water supply and c is devised representing four types of erosion–deposition and its promotion for the Luoshan-Hankou reach. Historical data over the past 65 years demonstrate that erosion-deposition and its promotion in the reach are respectively governed by c and net water supply; there is a remarkable shift from alternate erosion-deposition to monotonic erosion whilst the erosion-deposition effect remains consistent. The foregoing are in agreement with observed data, and comparable with data for the Jingjiang reach (affected by the three water-sediment diversion mouths). Satisfactory flood-control conditions in the convergence zone between the Yangtze mainstream and Dongting Lake accompanied by increasing erosion in the Luoshan-Hankou reach are predicted for the future

    Traffic condition tracking and visualization in virtual city testbed

    Get PDF
    Thesis (M. Eng.)--Massachusetts Institute of Technology, Dept. of Electrical Engineering and Computer Science, 2011.Cataloged from PDF version of thesis.Includes bibliographical references (p. 31).Computer traffic simulation is a tool widely used to understand how humans behave under varying traffic conditions. The Virtual City Testbed is a traffic simulation framework built to closely model human behavior by allowing direct user interaction in the simulation. Using the testbed, human subjects can remotely control vehicles in the virtual environment. A virtual positioning system (VPS) is displayed with the testbed client. It tracks and visualizes traffic, disruptions, and tolls local to the user's position. As traffic conditions shift, the VPS dynamically updates to reflect the changes. Together, the testbed and VPS provide an environment for studying how traffic conditions affect decision making.by Boyuan Zhu.M.Eng

    Constrained tropical land temperature-precipitation sensitivity reveals decreasing evapotranspiration and faster vegetation greening in CMIP6 3 projections

    Get PDF
    AbstractOver the tropical land surface, accurate estimates of future changes in temperature, precipitation and evapotranspiration are crucial for ecological sustainability, but remain highly uncertain. Here we develop a series of emergent constraints (ECs) by using historical and future outputs from the Coupled Model Inter-comparison Project Phase 6 (CMIP6) Earth System Models under the four basic Shared Socio-economic Pathway scenarios (SSP126, SSP245, SSP370, and SSP585). Results show that the temperature sensitivity to precipitation during 2015–2100, which varies substantially in the original CMIP6 outputs, becomes systematically negative across SSPs after application of the EC, with absolute values between −1.10 °C mm−1 day and −3.52 °C mm−1 day, and with uncertainties reduced by 9.4% to 41.4%. The trend in tropical land-surface evapotranspiration, which was increasing by 0.292 mm yr−1 in the original CMIP6 model outputs, becomes significantly negative (−0.469 mm yr−1) after applying the constraint. Moreover, we find a significant increase of 58.7% in the leaf area index growth rate.</jats:p
    • …
    corecore