17 research outputs found

    Training Deep Learning Models via Synthetic Data: Application in Unmanned Aerial Vehicles

    Get PDF
    This paper describes preliminary work in the recent promising approach of generating synthetic training data for facilitating the learning procedure of deep learning (DL) models, with a focus on aerial photos produced by unmanned aerial vehicles (UAV). The general concept and methodology are described, and preliminary results are presented, based on a classification problem of fire identification in forests as well as a counting problem of estimating number of houses in urban areas. The proposed technique constitutes a new possibility for the DL community, especially related to UAV-based imagery analysis, with much potential, promising results, and unexplored ground for further research.Comment: Workshop on Deep-learning based computer vision for UAV in conjunction with CAIP 2019, Salerno, italy, September 201

    Correction of arbitrary geometric artefacts in historical documents

    No full text
    The research presented in this thesis addresses the problem of correction of arbitrary geometric artefacts in historical documents. Geometric distortions in historical documents may be introduced at any time during the life cycle of a document, from when it was first printed to the time it is digitised by an imaging device. Such distortions appear as arbitrary warping, folds and page curl, and have detrimental effects to recognition (OCR) and readability (e.g. for print-on-demand). This thesis also critically examines the state of the art methods and identifies opportunities for significant improvement.Firstly, the present work focuses on the main issues in text line segmentation and proposes a method which is robust in the presence of various geometric distortions, other artefacts in historical documents, and dense and complex layout. Secondly, a precise base line detection method based on geometric features of the parametric model of the segmented line is presented. In other words, the proposed base line detection method not only takes into consideration unexpected geometric distortions, which are common in historical document images— but it also identifies certain main components of the text line, such as ascenders, descenders, and certain decorative marks, and makes intelligent distinctions between such native (but potentially misleading) components of the line and other global and local distortions of the whole page.Such precise derivation of the baselines (and in certain instances the top lines) will serve as building blocks for a major correction stage, namely the de-warping procedure. At its starting point, the proposed de-warping method takes into account both global and local characteristics of the text image and models the smooth deformations between text lines; by taking advantage of the proposed line segmentation and baseline detection stages, it can cope with a variety of distortions, such as page curl, arbitrary warping and fold, in a reliable, robust, and flexible manner

    Restoration of arbitrarily warped historical document images using flow lines

    No full text
    Historical documents frequently suffer from arbitrary geometric distortions (warping and folds) due to storage conditions, use and to, some extent, the printing process of the time. In addition, page curl can be prominent due to the scanning technique used. Such distortions adversely affect OCR and print-on-demand quality. Previous approaches to geometric restoration either focus only on the correction of page curl or require supplementary informatio n obtained by additional scanning hardware ` not practical for existing scans. This paper presents a new approach to detect and restore arbitrary warping and folds, in addition to page curl. Warped text lines and the smooth deformation between them are precisely modelled as primary and secondary flow lines that are then restored to their original linear shape. Preliminary, but representative, experimental results, in comparison to a leading page curl removal method and an industry-standard commercial system, demonstrate the effectiveness of the proposed metho

    Deep Learning with Data Augmentation for Fruit Counting

    No full text
    Counting the number of fruits in an image is important for orchard management, but is complex due to different challenging problems such as overlapping fruits and the difficulty to create large labeled datasets. In this paper, we propose the use of a data-augmentation technique that creates novel images by adding a number of manually cropped fruits to original images. This helps to increase the size of a dataset with new images containing more fruits and guarantees correct label information. Furthermore, two different approaches for fruit counting are compared: a holistic regression-based approach, and a detection-based approach. The regression-based approach has the advantage that it only needs as target value the number of fruits in an image compared to the detection-based approach where bounding boxes need to be specified. We combine both approaches with different deep convolutional neural network architectures and object-detection methods. We also introduce a new dataset of 1500 images named the Five-Tropical-Fruits dataset and perform experiments to evaluate the usefulness of augmenting the dataset for the different fruit-counting approaches. The results show that the regression-based approaches profit a lot from the data-augmentation method, whereas the detection-based approaches are not aided by data augmentation. Although one detection-based approach finally still works best, this comes with the cost of much more labeling effort.</p

    Performance improvements of a sweet pepper harvesting robot in protected cropping environments

    No full text
    Using robots to harvest sweet peppers in protected cropping environments has remained unsolved despite considerable effort by the research community over several decades. In this paper, we present the robotic harvester, Harvey, designed for sweet peppers in protected cropping environments that achieved a 76.5% success rate on 68 fruit (within a modified scenario) which improves upon our prior work which achieved 58% on 24 fruit and related sweet pepper harvesting work which achieved 33% on 39 fruit (for their best tool in a modified scenario). This improvement was primarily achieved through the introduction of a novel peduncle segmentation system using an efficient deep convolutional neural network, in conjunction with three-dimensional postfiltering to detect the critical cutting location. We benchmark the peduncle segmentation against prior art demonstrating an improvement in performance with a (Formula presented.) score of 0.564 compared to 0.302. The robotic harvester uses a perception pipeline to detect a target sweet pepper and an appropriate grasp and cutting pose used to determine the trajectory of a multimodal harvesting tool to grasp the sweet pepper and cut it from the plant. A novel decoupling mechanism enables the gripping and cutting operations to be performed independently. We perform an in-depth analysis of the full robotic harvesting system to highlight bottlenecks and failure points that future work could address.</p
    corecore