3,826 research outputs found

    Automatic Detection of Road Cracks using EfficientNet with Residual U-Net-based Segmentation and YOLOv5-based Detection

    Get PDF
    The main factor affecting road performance is pavement damage. One of the difficulties in maintaining roads is pavement cracking. Credible and reliable inspection of heritage structural health relies heavily on crack detection on road surfaces. To achieve intelligent operation and maintenance, intelligent crack detection is essential to traffic safety. The detection of road pavement cracks using computer vision has gained popularity in recent years. Recent technological breakthroughs in general deep learning algorithms have resulted in improved results in the discipline of crack detection. In this paper, two techniques for object identification and segmentation are proposed. The EfficientNet with residual U-Net technique is suggested for segmentation, while the YOLO v5 algorithm is offered for crack detection. To correctly separate the pavement cracks, a crack segmentation network is used. Road crack identification and segmentation accuracy were enhanced by optimising the model's hyperparameters and increasing the feature extraction structure. The suggested algorithm's performance is compared to state-of-the-art algorithms. The suggested work achieves 99.35% accuracy

    Investigating the Application of Deep Convolutional Neural Networks in Semi-supervised Video Object Segmentation

    Get PDF
    This thesis investigates the different approaches to video object segmentation and the current state-of-the-art in the discipline, focusing on the different deep learning techniques used to solve the problem. The primary contribution of the thesis is the investigation of usefulness of Exponential Linear Units as activation functions for deep convolutional neural architectures trained to perform object semi-supervised segmentation in videos. Mask R-CNN was chosen as the base convolutional neural architecture, with the view of extending the image segmentation algorithm to videos. Two models were created, one with Rectified Linear Units and the other with Exponential Linear Units as the respective activation functions. The models were instantiated and fine-tuned on the first frame of each sequence on the test dataset before predicting segmentations. This was done to focus on the principal object in the video for segmentation. Mean Jaccard index was the metric chosen to evaluate the performance of the models. No significant difference was found between the performance of the two models on the test dataset. A qualitative analysis of the performance of the model with ReLU activation functions was conducted with the view of understanding its strengths and weaknesses. The thesis concludes with an overview and a discussion on limitations and recommendations for future work that can be done to extend on the work presented in this thesis
    • …
    corecore