162 research outputs found

    DET: Data Enhancement Technique for Aerial Images

    Get PDF
    Deep learning and computer vision are two thriving research areas within machine learning. In recent years, as the available computing power has grown, it has led to the possibility of combining the approaches, achieving state-of-the-art results. An area of research that has greatly benefited from this development is building detection. Although the algorithms produce satisfactory results, there are still many limitations. One significant problem is the quality and edge sharpness of the segmentation masks, which are not up to the standard required by the mapping industry. The predicted mask boundaries need to be sharper and more precise to have practical use in map production. This thesis introduces a novel Data Enhancement Technique (DET) to improve the boundary quality of segmentation masks. DET has two approaches, Seg-DET, which uses a segmentation network, and Edge-DET, which uses an edge-detection network. Both techniques highlight buildings, creating a better input foundation for a secondary segmentation model. Additionally, we introduce ABL(RMI), a new compounding loss consisting of Region Mutual Information Loss (RMI), Lovasz-Softmax Loss (Lovasz), and Active Boundary Loss (ABL). The combination of loss functions in ABL(RMI) is optimized to enhance and improve mask boundaries. This thesis empirically shows that DET can successfully improve segmentation boundaries, but the practical results suggest that further refinement is needed. Additionally, the results show improvements when using the new compounding loss ABL(RMI) compared to its predecessor, ABL(CE) which substitutes RMI with Cross-Entropy loss(CE)

    Effective Image Tampering Localization via Semantic Segmentation Network

    Full text link
    With the widespread use of powerful image editing tools, image tampering becomes easy and realistic. Existing image forensic methods still face challenges of low accuracy and robustness. Note that the tampered regions are typically semantic objects, in this letter we propose an effective image tampering localization scheme based on deep semantic segmentation network. ConvNeXt network is used as an encoder to learn better feature representation. The multi-scale features are then fused by Upernet decoder for achieving better locating capability. Combined loss and effective data augmentation are adopted to ensure effective model training. Extensive experimental results confirm that localization performance of our proposed scheme outperforms other state-of-the-art ones
    • …
    corecore