291 research outputs found
Causal Reinforcement Learning: A Survey
Reinforcement learning is an essential paradigm for solving sequential
decision problems under uncertainty. Despite many remarkable achievements in
recent decades, applying reinforcement learning methods in the real world
remains challenging. One of the main obstacles is that reinforcement learning
agents lack a fundamental understanding of the world and must therefore learn
from scratch through numerous trial-and-error interactions. They may also face
challenges in providing explanations for their decisions and generalizing the
acquired knowledge. Causality, however, offers a notable advantage as it can
formalize knowledge in a systematic manner and leverage invariance for
effective knowledge transfer. This has led to the emergence of causal
reinforcement learning, a subfield of reinforcement learning that seeks to
enhance existing algorithms by incorporating causal relationships into the
learning process. In this survey, we comprehensively review the literature on
causal reinforcement learning. We first introduce the basic concepts of
causality and reinforcement learning, and then explain how causality can
address core challenges in non-causal reinforcement learning. We categorize and
systematically review existing causal reinforcement learning approaches based
on their target problems and methodologies. Finally, we outline open issues and
future directions in this emerging field.Comment: 48 pages, 10 figure
Deterioration Effect of Sandstone Tensile Strength and Its Mesoscopic Mechanism under Dry-wet Cycles
The rock mass in the hydro-fluctuating zone of the reservoir bank slope is under dry-wet cycles for a long time, which will cause the deterioration of rock mass and induce geological disasters. In this study, a series of dry-wet cycle tests on the argillaceous quartz sandstone in the Three Gorges Reservoir area was carried out. Then, after different dry-wet cycles, the sandstone specimens were used to conduct the Brazilian splitting, scanning electron microscope, and 3D laser scanning tests. Herein, we provided detailed physical and microscopy image data to analyze the deterioration effect of tensile strength and mesostructure deterioration process of sandstone. With the increase of dry-wet cycles, the tensile strength of sandstone initially decreases rapidly, and then the decline rate tends to slow down. The evolution laws of fractal dimension and porosity are also significantly consistent with the deterioration of tensile strength. Moreover, further mesostructural analysis has revealed the repeated âabsorption and swelling-dehydration and contractionâ of clay minerals. This results in the breakage of framework mineral quartz and the expansion and connectivity of internal cracks, which ultimately deteriorates sandstoneâs tensile strength
Training-Time-Friendly Network for Real-Time Object Detection
Modern object detectors can rarely achieve short training time, fast
inference speed, and high accuracy at the same time. To strike a balance among
them, we propose the Training-Time-Friendly Network (TTFNet). In this work, we
start with light-head, single-stage, and anchor-free designs, which enable fast
inference speed. Then, we focus on shortening training time. We notice that
encoding more training samples from annotated boxes plays a similar role as
increasing batch size, which helps enlarge the learning rate and accelerate the
training process. To this end, we introduce a novel approach using Gaussian
kernels to encode training samples. Besides, we design the initiative sample
weights for better information utilization. Experiments on MS COCO show that
our TTFNet has great advantages in balancing training time, inference speed,
and accuracy. It has reduced training time by more than seven times compared to
previous real-time detectors while maintaining state-of-the-art performances.
In addition, our super-fast version of TTFNet-18 and TTFNet-53 can outperform
SSD300 and YOLOv3 by less than one-tenth of their training time, respectively.
The code has been made available at
\url{https://github.com/ZJULearning/ttfnet}.Comment: Accepted to AAAI2020 (8 pages, 3 figures
Damage Characteristics of Argillaceous Quartz Sandstone Mesostructure under Different Wetting-drying Conditions
Extensive waterârock interaction in the Three Gorges Reservoir area of the Yangtze River leads to rock mass deterioration along the reservoir banks. However, mineral evolution behavior and its effect on the mesostructure deterioration of rocks under the wettingâdrying cycle condition remain unknown. So, the wettingâdrying cycle tests were conducted on peculiar argillaceous quartz sandstone in TGRA under neutral (pH = 7) and alkaline (pH = 10) water environments. Here, we provided detailed physical and microscopy images data to determine the control mechanism of mineral behavior on the evolution of sandstoneâs mesostructure. Under the neutral condition, repeated âabsorption and swellingâdehydration and contractionâ of clay minerals leads to the repeated physical action of âsqueezingâunloadingâ in the interior of a rock. This results in the initiation and gradual expansion of cracks in the framework mineral quartz, exhibiting failure mode from the interior to the exterior. In contrast, under the alkaline condition, the dissolution on the surface of quartz particles leads to the expansion and connection of pores, implying that the sandstone exhibits failure mode from the exterior to the interior. Moreover, the internal mechanical analysis indicates the minerals are at high pressure because of the expansion of clay minerals in the neutral solution. However, in an alkaline water environment, the extrusion pressure of framework mineral quartz decreases significantly and is not easily broken due to increased porosity. Thus, the evolution behavior of minerals in different water environments plays an important role in the damage of the rock
CD24 Expression as a Marker for Predicting Clinical Outcome in Human Gliomas
CD24 is overexpressed in glioma cells in vitro and in vivo. However, the correlation of its expression with clinicopathological parameters of gliomas and its prognostic significance in this tumor remain largely unknown. To address this problem, 151 glioma specimens and 10 nonneoplastic brain tissues were collected. Quantitative real-time PCR, immunochemistry assay, and Western blot analysis were carried out to investigate the expression of CD24. As per the results, CD24 was overexpressed in gliomas. Its expression levels in glioma tissues with higher grade (P < 0.001) and lower KPS (P < 0.001) were significantly higher than those with lower grade and higher KPS, respectively. Cox multifactor analysis showed that CD24 (P = 0.02) was an independent prognosis factor for human glioma. Our data provides convincing evidence for the first time that the overexpression of CD24 at gene and protein levels is correlated with advanced clinicopathological parameters and poor prognosis in patients with glioma
- âŚ