Machine learning for improved detection and segmentation of building boundary

Algarni, Salem Khodair

Machine learning for improved detection and segmentation of building boundary

Authors: Salem Khodair Algarni
Publication date
Publisher

Abstract

The first step in rescuing and mitigating the losses from natural or man-made disasters is to assess damaged assets, including services, utilities and infrastructure, such as buildings. However, manual visual analysis of the affected buildings can be time consuming and labour intensive. Automatic detection of buildings, on the other hand, has the potential to overcome the limitations of conventional approaches. This thesis reviews the existing methods for the automated detection of objects using multi-source geospatial data and presents two novel post processing techniques. Effective building segmentation and recognition techniques are also investigated. Artificial intelligence techniques have been used to identify building boundaries in automated building-detection applications. Compared with other neural network models, the convolutional neural network (CNN) architectures based on supervised and unsupervised approaches provide better results by looking at the image details as spatial information of the entity in the frame. This research incorporates the improved semantic detection ability of Region-based Convolutional Neural Network (Mask R-CNN) and the segmentation refining capability of the conditional random field (CRF)s. Mask R-CNN uses a pre-trained network to recognise the boundary boxes around buildings. It also provides contour key points around buildings that are masked in satellite images. This thesis proposes two novel post-processing techniques that operate by modifying and detecting the building’s relative orientation properties and combining the key points predicted by the two head neural networks to modify the predicted contour with the help of the proposed novel snap algorithms. The results show significant improvements in the accuracy of boundary detection compared with the state-ofthe-art techniques of 2.5%, 4.6% and 1% for F1-Score, Intersection over Union also known as Jacard coefficient (IoU), and overall pixel accuracy, respectively. CNNs have proven to be powerful tools for a wide range of image processing tasks where they can be used to automatically learn mid-level and high-level concepts from raw data, such as images. Finally, the results highlight the potential of further approaches to these applications, such as the planning of infrastructure

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Supporting member

Online Research @ Cardiff

oai:https://orca.cardiff.ac.uk...

Last time updated on 21/10/2022