291 research outputs found

    Causal Reinforcement Learning: A Survey

    Full text link
    Reinforcement learning is an essential paradigm for solving sequential decision problems under uncertainty. Despite many remarkable achievements in recent decades, applying reinforcement learning methods in the real world remains challenging. One of the main obstacles is that reinforcement learning agents lack a fundamental understanding of the world and must therefore learn from scratch through numerous trial-and-error interactions. They may also face challenges in providing explanations for their decisions and generalizing the acquired knowledge. Causality, however, offers a notable advantage as it can formalize knowledge in a systematic manner and leverage invariance for effective knowledge transfer. This has led to the emergence of causal reinforcement learning, a subfield of reinforcement learning that seeks to enhance existing algorithms by incorporating causal relationships into the learning process. In this survey, we comprehensively review the literature on causal reinforcement learning. We first introduce the basic concepts of causality and reinforcement learning, and then explain how causality can address core challenges in non-causal reinforcement learning. We categorize and systematically review existing causal reinforcement learning approaches based on their target problems and methodologies. Finally, we outline open issues and future directions in this emerging field.Comment: 48 pages, 10 figure

    Deterioration Effect of Sandstone Tensile Strength and Its Mesoscopic Mechanism under Dry-wet Cycles

    Get PDF
    The rock mass in the hydro-fluctuating zone of the reservoir bank slope is under dry-wet cycles for a long time, which will cause the deterioration of rock mass and induce geological disasters. In this study, a series of dry-wet cycle tests on the argillaceous quartz sandstone in the Three Gorges Reservoir area was carried out. Then, after different dry-wet cycles, the sandstone specimens were used to conduct the Brazilian splitting, scanning electron microscope, and 3D laser scanning tests. Herein, we provided detailed physical and microscopy image data to analyze the deterioration effect of tensile strength and mesostructure deterioration process of sandstone. With the increase of dry-wet cycles, the tensile strength of sandstone initially decreases rapidly, and then the decline rate tends to slow down. The evolution laws of fractal dimension and porosity are also significantly consistent with the deterioration of tensile strength. Moreover, further mesostructural analysis has revealed the repeated “absorption and swelling-dehydration and contraction” of clay minerals. This results in the breakage of framework mineral quartz and the expansion and connectivity of internal cracks, which ultimately deteriorates sandstone’s tensile strength

    Training-Time-Friendly Network for Real-Time Object Detection

    Full text link
    Modern object detectors can rarely achieve short training time, fast inference speed, and high accuracy at the same time. To strike a balance among them, we propose the Training-Time-Friendly Network (TTFNet). In this work, we start with light-head, single-stage, and anchor-free designs, which enable fast inference speed. Then, we focus on shortening training time. We notice that encoding more training samples from annotated boxes plays a similar role as increasing batch size, which helps enlarge the learning rate and accelerate the training process. To this end, we introduce a novel approach using Gaussian kernels to encode training samples. Besides, we design the initiative sample weights for better information utilization. Experiments on MS COCO show that our TTFNet has great advantages in balancing training time, inference speed, and accuracy. It has reduced training time by more than seven times compared to previous real-time detectors while maintaining state-of-the-art performances. In addition, our super-fast version of TTFNet-18 and TTFNet-53 can outperform SSD300 and YOLOv3 by less than one-tenth of their training time, respectively. The code has been made available at \url{https://github.com/ZJULearning/ttfnet}.Comment: Accepted to AAAI2020 (8 pages, 3 figures

    Damage Characteristics of Argillaceous Quartz Sandstone Mesostructure under Different Wetting-drying Conditions

    Get PDF
    Extensive water–rock interaction in the Three Gorges Reservoir area of the Yangtze River leads to rock mass deterioration along the reservoir banks. However, mineral evolution behavior and its effect on the mesostructure deterioration of rocks under the wetting–drying cycle condition remain unknown. So, the wetting–drying cycle tests were conducted on peculiar argillaceous quartz sandstone in TGRA under neutral (pH = 7) and alkaline (pH = 10) water environments. Here, we provided detailed physical and microscopy images data to determine the control mechanism of mineral behavior on the evolution of sandstone’s mesostructure. Under the neutral condition, repeated “absorption and swelling–dehydration and contraction” of clay minerals leads to the repeated physical action of “squeezing–unloading” in the interior of a rock. This results in the initiation and gradual expansion of cracks in the framework mineral quartz, exhibiting failure mode from the interior to the exterior. In contrast, under the alkaline condition, the dissolution on the surface of quartz particles leads to the expansion and connection of pores, implying that the sandstone exhibits failure mode from the exterior to the interior. Moreover, the internal mechanical analysis indicates the minerals are at high pressure because of the expansion of clay minerals in the neutral solution. However, in an alkaline water environment, the extrusion pressure of framework mineral quartz decreases significantly and is not easily broken due to increased porosity. Thus, the evolution behavior of minerals in different water environments plays an important role in the damage of the rock

    CD24 Expression as a Marker for Predicting Clinical Outcome in Human Gliomas

    Get PDF
    CD24 is overexpressed in glioma cells in vitro and in vivo. However, the correlation of its expression with clinicopathological parameters of gliomas and its prognostic significance in this tumor remain largely unknown. To address this problem, 151 glioma specimens and 10 nonneoplastic brain tissues were collected. Quantitative real-time PCR, immunochemistry assay, and Western blot analysis were carried out to investigate the expression of CD24. As per the results, CD24 was overexpressed in gliomas. Its expression levels in glioma tissues with higher grade (P < 0.001) and lower KPS (P < 0.001) were significantly higher than those with lower grade and higher KPS, respectively. Cox multifactor analysis showed that CD24 (P = 0.02) was an independent prognosis factor for human glioma. Our data provides convincing evidence for the first time that the overexpression of CD24 at gene and protein levels is correlated with advanced clinicopathological parameters and poor prognosis in patients with glioma
    • …
    corecore