42 research outputs found

    An Integrated Enhancement Solution for 24-hour Colorful Imaging

    Full text link
    The current industry practice for 24-hour outdoor imaging is to use a silicon camera supplemented with near-infrared (NIR) illumination. This will result in color images with poor contrast at daytime and absence of chrominance at nighttime. For this dilemma, all existing solutions try to capture RGB and NIR images separately. However, they need additional hardware support and suffer from various drawbacks, including short service life, high price, specific usage scenario, etc. In this paper, we propose a novel and integrated enhancement solution that produces clear color images, whether at abundant sunlight daytime or extremely low-light nighttime. Our key idea is to separate the VIS and NIR information from mixed signals, and enhance the VIS signal adaptively with the NIR signal as assistance. To this end, we build an optical system to collect a new VIS-NIR-MIX dataset and present a physically meaningful image processing algorithm based on CNN. Extensive experiments show outstanding results, which demonstrate the effectiveness of our solution.Comment: AAAI 2020 (Oral

    Unveiling the Power of Self-supervision for Multi-view Multi-human Association and Tracking

    Full text link
    Multi-view multi-human association and tracking (MvMHAT), is a new but important problem for multi-person scene video surveillance, aiming to track a group of people over time in each view, as well as to identify the same person across different views at the same time, which is different from previous MOT and multi-camera MOT tasks only considering the over-time human tracking. This way, the videos for MvMHAT require more complex annotations while containing more information for self learning. In this work, we tackle this problem with a self-supervised learning aware end-to-end network. Specifically, we propose to take advantage of the spatial-temporal self-consistency rationale by considering three properties of reflexivity, symmetry and transitivity. Besides the reflexivity property that naturally holds, we design the self-supervised learning losses based on the properties of symmetry and transitivity, for both appearance feature learning and assignment matrix optimization, to associate the multiple humans over time and across views. Furthermore, to promote the research on MvMHAT, we build two new large-scale benchmarks for the network training and testing of different algorithms. Extensive experiments on the proposed benchmarks verify the effectiveness of our method. We have released the benchmark and code to the public

    Pathological Evidence Exploration in Deep Retinal Image Diagnosis

    Full text link
    Though deep learning has shown successful performance in classifying the label and severity stage of certain disease, most of them give few evidence on how to make prediction. Here, we propose to exploit the interpretability of deep learning application in medical diagnosis. Inspired by Koch's Postulates, a well-known strategy in medical research to identify the property of pathogen, we define a pathological descriptor that can be extracted from the activated neurons of a diabetic retinopathy detector. To visualize the symptom and feature encoded in this descriptor, we propose a GAN based method to synthesize pathological retinal image given the descriptor and a binary vessel segmentation. Besides, with this descriptor, we can arbitrarily manipulate the position and quantity of lesions. As verified by a panel of 5 licensed ophthalmologists, our synthesized images carry the symptoms that are directly related to diabetic retinopathy diagnosis. The panel survey also shows that our generated images is both qualitatively and quantitatively superior to existing methods.Comment: to appear in AAAI (2019). The first two authors contributed equally to the paper. Corresponding Author: Feng L

    Research on the characteristics and influencing factors of the spatial correlation network of cultivated land utilization ecological efficiency in the upper reaches of the Yangtze River, China.

    No full text
    Researching the structural characteristics of the spatial correlation network of cultivated land utilization ecological efficiency is of great significance to China's food security and agricultural green and low-carbon development. Taking 47 cities (autonomous prefectures) in the upper reaches of the Yangtze River as the research object, the ecological efficiency of cultivated land utilization from 2010 to 2020 was measured based on the unexpected output model (Super SBM), and the spatial correlation matrix was constructed using the revised gravity model. The structural characteristics of the spatial correlation network were analyzed using the social network model (SNA), and finally, the factors affecting the spatial correlation network of cultivated land utilization ecological efficiency in the upper reaches of the Yangtze River were analyzed through the quadratic assignment procedure (QAP) model. The results show that: (1) the ecological efficiency of cultivated land utilization in the upper reaches of the Yangtze River has been increasing year by year, but the overall level is low, and there is a large gap among provinces. Sichuan Province has the highest average value of 0.605, and Yunnan Province has the lowest average value of 0.359. (2) The ecological efficiency of cultivated land utilization in the upper reaches of the Yangtze River has broken through the provincial boundaries and has formed an obvious spatial correlation network, but the overall density is low, and the network is still relatively loose, needing further development and improvement. Chengdu, Yibin, Luzhou, and other cities are located in the center of the network and have formed four cohesive subgroups. (3)The differences in the level of agricultural economic development, the rural per capita disposable income, the differences in agricultural mechanization intensity, the regional population differences, and spatial adjacency have an impact on the spatial network of ecological efficiency of cultivated land utilization in the upper reaches of the Yangtze River. The difference in the level of agricultural economic development, the rural per capita disposable income, and the differences in agricultural mechanization intensity are negatively correlated, while the regional population differences are positively correlated with spatial adjacency

    Method of Calculating the Compensation for Rectifying the Horizontal Displacement of Existing Tunnels by Grouting

    No full text
    Sleeve valve pipe grouting, an effective method for reinforcing soil layers, is often employed to correct the deformation of subway tunnels. In order to study the effect of grouting on rectifying the displacement of existing tunnels, this paper proposes a mechanical model of the volume expansion of sleeve valve pipe grouting taking into consideration the volume expansion of the grouted soil mass. A formula for the additional stress on the soil layer caused by grouting was derived based on the principle of the mirror method. In addition, a formula for the horizontal displacement of a tunnel caused by grouting was developed through a calculation model of shearing dislocation and rigid body rotation. The results of the calculation method proposed herein were in good agreement with actual engineering data. In summary, enlarging the grouting volume within a reasonable range can effectively enhance the grouting corrective effect. Further, with an increase in the grouting distance, the influence of grouting gradually lessens. At a constant grouting length, setting the bottom of the grouting section at the same depth as the lower end of the tunnel can maximize the grouting corrective effect

    An Improved Temporal Fusion Transformers Model for Predicting Supply Air Temperature in High-Speed Railway Carriages

    No full text
    A key element for reducing energy consumption and improving thermal comfort on high-speed rail is controlling air-conditioning temperature. Accurate prediction of air supply temperature is aimed at improving control effects. Existing studies of supply air temperature prediction models are interdisciplinary, involving heat transfer science and computer science, where the problem is defined as time-series prediction. However, the model is widely accepted as a complex model that is nonlinear and dynamic. That makes it difficult for existing statistical and deep learning methods, e.g., autoregressive integrated moving average model (ARIMA), convolutional neural network (CNN), and long short-term memory network (LSTM), to fully capture the interaction between these variables and provide accurate prediction results. Recent studies have shown the potential of the Transformer to increase the prediction capacity. This paper offers an improved temporal fusion transformers (TFT) prediction model for supply air temperature in high-speed train carriages to tackle these challenges, with two improvements: (i) Double-convolutional residual encoder structure based on dilated causal convolution; (ii) Spatio-temporal double-gated structure based on Gated Linear Units. Moreover, this study designs a loss function suitable for general long sequence time-series forecast tasks for temperature forecasting. Empirical simulations using a high-speed rail air-conditioning operation dataset at a specific location in China show that the temperature prediction of the two units using the improved TFT model improves the MAPE by 21.70% and 11.73%, respectively the original model. Furthermore, experiments demonstrate that the model effectively outperforms seven popular methods on time series computing tasks, and the attention of the prediction problem in the time dimension is analyzed
    corecore