Search CORE

9 research outputs found

Improving Object Localization with Fitness NMS and Bounded IoU Loss

Author: Petersson Lars
Tychsen-Smith Lachlan
Publication venue
Publication date: 12/03/2018
Field of study

We demonstrate that many detection methods are designed to identify only a sufficently accurate bounding box, rather than the best available one. To address this issue we propose a simple and fast modification to the existing methods called Fitness NMS. This method is tested with the DeNet model and obtains a significantly improved MAP at greater localization accuracies without a loss in evaluation rate, and can be used in conjunction with Soft NMS for additional improvements. Next we derive a novel bounding box regression loss based on a set of IoU upper bounds that better matches the goal of IoU maximization while still providing good convergence properties. Following these novelties we investigate RoI clustering schemes for improving evaluation rates for the DeNet wide model variants and provide an analysis of localization performance at various input image dimensions. We obtain a MAP of 33.6%@79Hz and 41.8%@5Hz for MSCOCO and a Titan X (Maxwell). Source code available from: https://github.com/lachlants/denetComment: CVPR2018 Main Conference (Poster

arXiv.org e-Print Archive

Crossref

Creating robust high-throughput traffic sign detectors using centre-surround HOG statistics

Author: Andersson Lars
Overett Gary
Petersson Lars
Pettersson Niklas
Tychsen-Smith Lachlan
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 10/12/2015
Field of study

In this paper, we detail a system for creating object detectors which meet the extreme demands of real-world traffic sign detection applications such as GPS map making and real-time in-car traffic sign detection. The resulting detectors are designed to detect and locate multiple traffic sign types in high-definition video (high throughput) from several cameras captured along thousands of kilometers of road with minimal false-positives and detection rates in excess of 99%. This allows for the accurate detection and location of traffic signs in geo-tagged video datasets of entire national road networks in reasonable time using only moderate computing infrastructure. A key to the success of the methods described in this paper is the use of extremely efficient classifier features. In this paper, we identify two obstacles to achieving the desired performance for all target traffic sign types, feature memory bandwidth requirements and feature discriminance. We introduce our use of centre-surround histogram of oriented gradient (HOG) statistics which greatly reduce the per-feature memory bandwidth requirements. Subsequently we extend our use of centre-surround HOG statistics to the color domain, raising the discriminant power of the final classifiers for more challenging sign types

The Australian National University

A Review of Hydrodynamic and Machine Learning Approaches for Flood Inundation Modeling

Author: Ahmedt-Aristizabal David
Armin Mohammed Ali
Karim Fazlul
Petersson Lars
Tychsen-Smith Lachlan
Publication venue: 'MDPI AG'
Publication date: 01/02/2023
Field of study

Machine learning (also called data-driven) methods have become popular in modeling flood inundations across river basins. Among data-driven methods, traditional machine learning (ML) approaches are widely used to model flood events, and recently deep learning (DL) approaches have gained more attention across the world. In this paper, we reviewed recently published literature on ML and DL applications for flood modeling for various hydrologic and catchment characteristics. Our extensive literature review shows that DL models produce better accuracy compared to traditional approaches. Unlike physically based models, ML/DL models suffer from the lack of using expert knowledge in modeling flood events. Apart from challenges in implementing a uniform modeling approach across river basins, the lack of benchmark data to evaluate model performance is a limiting factor for developing efficient ML/DL models for flood inundation modeling.</p

Queensland University of Technology ePrints Archive

Continuous Human Action Recognition for Human-machine Interaction: A Review

Author: Ahmedt-Aristizabal David
Denman Simon
Fookes Clinton
Gammulle Harshala
Petersson Lars
Tychsen-Smith Lachlan
Publication venue: Association for Computing Machinery (ACM)
Publication date: 26/02/2022
Field of study

With advances in data-driven machine learning research, a wide variety of prediction models have been proposed to capture spatio-temporal features for the analysis of video streams. Recognising actions and detecting action transitions within an input video are challenging but necessary tasks for applications that require real-time human-machine interaction. By reviewing a large body of recent related work in the literature, we thoroughly analyse, explain, and compare action segmentation methods and provide details on the feature extraction and learning strategies that are used on most state-of-the-art methods. We cover the impact of the performance of object detection and tracking techniques on human action segmentation methodologies. We investigate the application of such models to real-world scenarios and discuss several limitations and key research directions towards improving interpretability, generalisation, optimisation, and deployment.</p

arXiv.org e-Print Archive

Queensland University of Technology ePrints Archive

Monitoring of Pigmented Skin Lesions Using 3D Whole Body Imaging

Author: Ahmedt-Aristizabal David
Li Shenghong
Nguyen Chuong
Pathikulangara Joseph
Petersson Lars
Stacey Ashley
Tychsen-Smith Lachlan
Wang Dadong
Publication venue: Elsevier Ireland Ltd.
Publication date: 01/04/2023
Field of study

Background and objectives: Advanced artificial intelligence and machine learning have great potential to redefine how skin lesions are detected, mapped, tracked and documented. Here, we propose a 3D whole-body imaging system known as 3DSkin-mapper to enable automated detection, evaluation and mapping of skin lesions. Methods: A modular camera rig arranged in a cylindrical configuration was designed to automatically capture images of the entire skin surface of a subject synchronously from multiple angles. Based on the images, we developed algorithms for 3D model reconstruction, data processing and skin lesion detection and tracking based on deep convolutional neural networks. We also introduced a customised, user-friendly, and adaptable interface that enables individuals to interactively visualise, manipulate, and annotate the images. The interface includes built-in features such as mapping 2D skin lesions onto the corresponding 3D model. Results: The proposed system is developed for skin lesion screening, the focus of this paper is to introduce the system instead of clinical study. Using synthetic and real images we demonstrate the effectiveness of the proposed system by providing multiple views of a target skin lesion, enabling further 3D geometry analysis and longitudinal tracking. Skin lesions are identified as outliers which deserve more attention from a skin cancer physician. Our detector leverages expert annotated labels to learn representations of skin lesions, while capturing the effects of anatomical variability. It takes only a few seconds to capture the entire skin surface, and about half an hour to process and analyse the images. Conclusions: Our experiments show that the proposed system allows fast and easy whole body 3D imaging. It can be used by dermatological clinics to conduct skin screening, detect and track skin lesions over time, identify suspicious lesions, and document pigmented lesions. The system can potentially save clinicians time and effort significantly. The 3D imaging and analysis has the potential to change the paradigm of whole body photography with many applications in skin diseases, including inflammatory and pigmentary disorders. With reduced time requirements for recording and documenting high-quality skin information, doctors could spend more time providing better-quality treatment based on more detailed and accurate information.</p

Queensland University of Technology ePrints Archive

Creating robust high-throughput traffic sign detectors using centre-surround HOG statistics

Author: C. Bishop
C. Burges
Gary Overett
J. Friedman
Lachlan Tychsen-Smith
Lars Andersson
Lars Petersson
Niklas Pettersson
P. Viola
R.E. Schapire
R.O. Duda
S. Paisitkriangkrai
W.A. Wulf
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

In-situ data curation : A key to actionable AI at the edge

Author: Ahmedt-Aristizabal David
Babcock Russel
Crosswell Joey
Do Brendan
Kusy Brano
Li Yang
Liu Jiajun
Malpani Megha
Marchant Ross
Oerlemans Ard
Oorloff Jeremy
Saha Aninda
Steven Andy
Tychsen-Smith Lachlan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 14/10/2022
Field of study

Machine learning (ML) algorithms have shown great potential in edge-computing environments, however, the literature mainly focuses on model inference only. We investigate how ML can be operationalized and how in-situ curation can improve the quality of edge applications, in the context of ML-assisted environmental surveys. We show that camera-enabled ML systems deployed on edge devices can enable scientists to perform real-time monitoring of species of interest or characterization of natural habitats. However, the benefit of this new technology is only as good as the quality and accuracy of the edge ML model inferences. In this demonstration, we show that with small additional time investment, domain scientists can manually curate ML model outputs and thus obtain highly reliable scientific insights, leading to more effective and scalable environmental surveys. </p

Queensland University of Technology ePrints Archive

A real-time edge-AI system for reef surveys

Author: Ahmedt-Aristizabal David
Babcock Russ
Crosswell Joey
Do Brendan
Kusy Brano
Li Yang
Liu Jiajun
Malpani Megha
Marchant Ross
Merz Torsten
Moghadam Peyman
Oerlemans Ard
Oorloff Jeremy
Steven Andy
Tychsen-Smith Lachlan
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 14/10/2022
Field of study

Crown-of-Thorn Starfish (COTS) outbreaks are a major cause of coral loss on the Great Barrier Reef (GBR) and substantial surveillance and control programs are ongoing to manage COTS populations to ecologically sustainable levels. In this paper, we present a comprehensive real-time machine learning-based underwater data collection and curation system on edge devices for COTS monitoring. In particular, we leverage the power of deep learning-based object detection techniques, and propose a resource-efficient COTS detector that performs detection inferences on the edge device to assist marine experts with COTS identification during the data collection phase. The preliminary results show that several strategies for improving computational efficiency (e.g., batch-wise processing, frame skipping, model input size) can be combined to run the proposed detection model on edge hardware with low resource consumption and low information loss. </p

Queensland University of Technology ePrints Archive