31 research outputs found

    Multi-view human action recognition in meeting scenarios

    No full text
    Due to the continuous development in deep learning and computer vision, the recognition of human actions has become one of the most popular research topics. Various methods have been proposed to tackle this problem. This project implements a Multi-View Human Action Recognition System with focus on spatio-temporal localization of actions in a meeting scenario. Existing human action recognition systems tend to face the problem of human-to-human or human-to-object occlusion in some cases. This can greatly affect the recognition accuracy. Most of the existing multi-view action recognition systems also do not focus on the spatio-temporal localization of actions. However, the problem of occlusion in meeting scenarios is a frequent phenomenon. Once it occurs, it can persist for a long time. Hence, existing methods and datasets do not work well in this scenario. This project aims to address the above limitations. We first process a multi-view meeting dataset, AMI (Augmented Multi-party Interaction) meeting corpus. To make it can be used for multi-view action recognition. In addition, we use SlowFast Network as a backbone network for action recognition and use Torchreid (A library for deep learning person re-identification in PyTorch) for instance association after learning the features of the input from different camera viewpoints. And finally, the system uses the method of late fusion to fuse the information from the left and right viewpoints into the center viewpoint that has occlusion problem. This method will improve the system's ability to deal with the occlusion problem. The method proposed in this project can improve by up to nearly 10 percent of the mAP (Mean Average Precision) on AMI meeting corpus compared to single-view recognition approaches.Master of Science (Signal Processing

    Piecewise Average-Value Model of PWM Converters With Applications to Large-Signal Transient Simulations

    No full text

    Real-Time Detection of Cook Assistant Overalls Based on Embedded Reasoning

    No full text
    Currently, the target detection based on convolutional neural network plays an important role in image recognition, speech recognition and other fields. However, the current network model features a complex structure, a huge number of parameters and resources. These conditions make it difficult to apply in embedded devices with limited computational capabilities and extreme sensitivity to power consumption. In this regard, the application scenarios of deep learning are limited. This paper proposes a real-time detection scheme for cook assistant overalls based on the Hi3559A embedded processor. With YOLOv3 as the benchmark network, this scheme fully mobilizes the hardware acceleration resources through the network model optimization and the parallel processing technology of the processor, and improves the network reasoning speed, so that the embedded device can complete the task of real-time detection on the local device. The experimental results show that through the purposeful cropping, segmentation and in-depth optimization of the neural network according to the specific processor, the neural network can recognize the image accurately. In an application environment where the power consumption is only 5.5 W, the recognition speed of the neural network on the embedded end is increased to about 28 frames (the design requirement was to achieve a recognition speed of 25 frames or more), so that the optimized network can be effectively applied in the back kitchen overalls identification scene

    Resilience-Oriented Pre-Hurricane Resource Allocation in Distribution Systems Considering Electric Buses

    No full text

    A Fast Neighbor Discovery Algorithm in WSNs

    No full text
    With the quick development of Internet of Things (IoT), one of its important supporting technologies, i.e., wireless sensor networks (WSNs), gets much more attention. Neighbor discovery is an indispensable procedure in WSNs. The existing deterministic neighbor discovery algorithms in WSNs ensure that successful discovery can be obtained within a given period of time, but the average discovery delay is long. It is difficult to meet the need for rapid discovery in mobile low duty cycle environments. In addition, with the rapid development of IoT, the node densities of many WSNs greatly increase. In such scenarios, existing neighbor discovery methods fail to satisfy the requirement in terms of discovery latency under the condition of the same energy consumption. This paper proposes a group-based fast neighbor discovery algorithm (GBFA) to address the issues. By carrying neighbor information in beacon packet, the node knows in advance some potential neighbors. It selects more energy efficient potential neighbors and proactively makes nodes wake up to verify whether these potential neighbors are true neighbors, thereby speeding up neighbor discovery, improving energy utilization efficiency and decreasing network communication load. The evaluation results indicate that, compared with other methods, GBFA decreases the average discovery latency up to 10 . 58 % at the same energy budget

    A novel CGBoost deep learning algorithm for coseismic landslide susceptibility prediction

    No full text
    The accurate prediction of landslide susceptibility shortly after a violent earthquake is quite vital to the emergency rescue in the 72-h “golden window”. However, the limited quantity of interpreted landslides shortly after a massive earthquake makes landslide susceptibility prediction become a challenge. To address this gap, this work suggests an integrated method of Crossing Graph attention network and xgBoost (CGBoost). This method contains three branches, which extract the interrelations among pixels within a slope unit, the interrelations among various slope units, and the relevance between influencing factors and landslide probability, respectively, and obtain rich and discriminative features by an adaptive fusion mechanism. Thus, the difficulty of susceptibility modeling under a small number of coseismic landslides can be reduced. As a basic module of CGBoost, the proposed Crossing graph attention network (Crossgat) could characterize the spatial heterogeneity within and among slope units to reduce the false alarm in the susceptibility results. Moreover, the rainfall dynamic factors are utilized as prediction indices to improve the susceptibility performance, and the prediction index set is established by terrain, geology, human activity, environment, meteorology, and earthquake factors. CGBoost is applied to predict landslide susceptibility in the Gorkha meizoseismal area. 3.43% of coseismic landslides are randomly selected, of which 70% are used for training, and the others for testing. In the testing set, the values of Overall Accuracy, Precision, Recall, F1-score, and Kappa coefficient of CGBoost attain 0.9800, 0.9577, 0.9999, 0.9784, and 0.9598, respectively. Validated by all the coseismic landslides, CGBoost outperforms the current major landslide susceptibility assessment methods. The suggested CGBoost can be also applied to landslide susceptibility prediction in new earthquakes in the future

    An Innovative Approach for Improving the Accuracy of Digital Elevation Models for Cultivated Land

    No full text
    The redistribution of solar radiation, temperature, soil moisture and heat by topography affects the physical and chemical properties of the soil and the spatial distribution characteristics of crop growth. Analyses of the relationship between topography and these variables may help to improve the accuracy of digital elevation models (DEMs). The purpose of correcting Shuttle Radar Topography Mission (SRTM) data is to obtain high-precision DEM data in cultivated land. A typical black soil area was studied. A high-precision reference DEM was generated from an unmanned aerial vehicle (UAV) and extensive measured ground elevation data. The normalized differential vegetation index (NDVI), perpendicular drought index (PDI) extracted from SPOT-6 remote sensing images and potential solar radiation (PSR) extracted from SRTM. The interactions between topography and NDVI, PDI, and PSR were analyzed. The NDVI, PDI and PSR in June, July, August and September of 2016 and the SRTM were used as independent variables, and the UAV DEM was used as the dependent variable. Linear stepwise regression (LSR) and a back-propagation neural network (BPNN) were used to establish an elevation prediction model. The results indicated that (1) The correlation between topography and NDVI, PSR, PDI was significant at 0.01 level. The PDI and PSR improved the spatial resolution of SRTM data and reduce the vertical error. (2) The BPNN (R21 = 0.98, root mean square error, RMSE1 = 0.54) yielded a higher SRTM accuracy than did the studied linear model (RMSE1 = 1.00, R21 = 0.90). (3) A series of significant improvements in the SRTM were observed when assessed with the reference DEMs for two different areas, with RMSE reductions of 91% (from 14.95 m to 1.23 m) and 93% (from 15.6 m to 0.94 m). The proposed method improved the accuracy of existing DEMs and could provide support for accurate farmland management
    corecore