Search CORE

64 research outputs found

SINet: A Scale-insensitive Convolutional Neural Network for Fast Vehicle Detection

Author: Chen Hao
He Shengfeng
Heng Pheng-Ann
Hu Xiaowei
Qin Jing
Xiao Yongjie
Xu Xuemiao
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/05/2018
Field of study

Vision-based vehicle detection approaches achieve incredible success in recent years with the development of deep convolutional neural network (CNN). However, existing CNN based algorithms suffer from the problem that the convolutional features are scale-sensitive in object detection task but it is common that traffic images and videos contain vehicles with a large variance of scales. In this paper, we delve into the source of scale sensitivity, and reveal two key issues: 1) existing RoI pooling destroys the structure of small scale objects, 2) the large intra-class distance for a large variance of scales exceeds the representation capability of a single network. Based on these findings, we present a scale-insensitive convolutional neural network (SINet) for fast detecting vehicles with a large variance of scales. First, we present a context-aware RoI pooling to maintain the contextual information and original structure of small scale objects. Second, we present a multi-branch decision network to minimize the intra-class distance of features. These lightweight techniques bring zero extra time complexity but prominent detection accuracy improvement. The proposed techniques can be equipped with any deep network architectures and keep them trained end-to-end. Our SINet achieves state-of-the-art performance in terms of accuracy and speed (up to 37 FPS) on the KITTI benchmark and a new highway dataset, which contains a large variance of scales and extremely small objects.Comment: Accepted by IEEE Transactions on Intelligent Transportation Systems (T-ITS

arXiv.org e-Print Archive

Enhancing the Safety Object Detection Accuracy in Construction Site Using a Frequency Channel Attention Network Layer

Author: Kang Inuk
Publication venue
Publication date: 26/05/2023
Field of study

Currently, research on securing safety by unmanned systems is being actively conducted. Development is underway to reduce costs and secure worker safety by filling safety-related personnel’s blind spots and reducing their burden. For intelligent safety security, we propose artificial intelligence models that can detect, identify and distinguish major objects based on photographic information. In addition, Frequency Channel Attention Network (FcaNet), which supplements the existing Global Average Pooling (GAP) method, is used to improve the existing algorithm, and the accuracy is improved. For this purpose, 12,000 pieces of photographic data images are collected for 5 major equipment to be encountered in the actual construction environment. The detection and identification performance of the model is maximized by using the FcaNet layer for learning through the existing Faster-RCNN, Libra-RCNN, and Double-Heads model. As a result, the accuracy of the test dataset is improved by 6%, 0.4%, and 0.4%, respectively. And, through using random initialization and improved batch normalization, the shortcomings of limited data are reduced, and the effect of pretraining is obtained without. This results in an improvement of more than 20% in each model, and the revised model shows 0.5% higher than the existing one. It is hoped that these results will be reflected in the work environment intelligence project to further reduce the burden on manpower and improve efficiency

Hybrid mobile computing for connected autonomous vehicles

Author: Wei Jian
Publication venue
Publication date
Field of study

With increasing urbanization and the number of cars on road, there are many global issues on modern transport systems, Autonomous driving and connected vehicles are the most promising technologies to tackle these issues. The so-called integrated technology connected autonomous vehicles (CAV) can provide a wide range of safety applications for safer, greener and more efficient intelligent transport systems (ITS). As computing is an extreme component for CAV systems,various mobile computing models including mobile local computing, mobile edge computing and mobile cloud computing are proposed. However it is believed that none of these models fits all CAV applications, which have highly diverse quality of service (QoS) requirements such as communication delay, data rate, accuracy, reliability and/or computing latency.In this thesis, we are motivated to propose a hybrid mobile computing model with objective of overcoming limitations of individual models and maximizing the performances for CAV applications.In proposed hybrid mobile computing model three basic computing models and/or their combinations are chosen and applied to different CAV applications, which include mobile local computing, mobile edge computing and mobile cloud computing. Different computing models and their combinations are selected according to the QoS requirements of the CAV applications.Following the idea, we first investigate the job offloading and allocation of computing and communication resources at the local hosts and external computing centers with QoS aware and resource awareness. Distributed admission control and resource allocation algorithms are proposed including two baseline non-cooperative algorithms and a matching theory based cooperative algorithm. Experiment results demonstrate the feasibility of the hybrid mobile computing model and show large improvement on the service quality and capacity over existing individual computing models. The matching algorithm also largely outperforms the baseline non-cooperative algorithms.In addition, two specific use cases of the hybrid mobile computing for CAV applications are investigated: object detection with mobile local computing where only local computing resources are used, and movie recommendation with mobile cloud computing where remote cloud resources are used. For object detection, we focus on the challenges of detecting vehicles, pedestrians and cyclists in driving environment and propose three methods to an existing CNN based object detector. Large detection performance improvement is obtained over the KITTI benchmark test dataset. For movie recommendation we propose two recommendation models based on a general framework of integrating machine learning and collaborative filtering approach.The experiment results on Netix movie dataset show that our models are very effective for cold start items recommendatio