Search CORE

649 research outputs found

Industrial Segment Anything -- a Case Study in Aircraft Manufacturing, Intralogistics, Maintenance, Repair, and Overhaul

Author: Gierecker Johann
Gomse Martin
Holst Dirk
Koch Julian
Kähler Falko
Moenck Keno
Prünte Philipp
Sahrhage Arne
Schmedemann Ole
Schoepflin Daniel
Schüppstuhl Thorsten
Wendt Arne
Publication venue
Publication date: 24/07/2023
Field of study

Deploying deep learning-based applications in specialized domains like the aircraft production industry typically suffers from the training data availability problem. Only a few datasets represent non-everyday objects, situations, and tasks. Recent advantages in research around Vision Foundation Models (VFM) opened a new area of tasks and models with high generalization capabilities in non-semantic and semantic predictions. As recently demonstrated by the Segment Anything Project, exploiting VFM's zero-shot capabilities is a promising direction in tackling the boundaries spanned by data, context, and sensor variety. Although, investigating its application within specific domains is subject to ongoing research. This paper contributes here by surveying applications of the SAM in aircraft production-specific use cases. We include manufacturing, intralogistics, as well as maintenance, repair, and overhaul processes, also representing a variety of other neighboring industrial domains. Besides presenting the various use cases, we further discuss the injection of domain knowledge

arXiv.org e-Print Archive

Deep Neural Networks for Visual Bridge Inspections and Defect Visualisation in Civil Engineering

Author: Bush Julia
Corradi Tadeo
Ninic Jelena
Thermou Georgia
Publication venue: Universitätsverlag der Technischen Universität Berlin
Publication date: 03/01/2022
Field of study

University of Birmingham Research Portal

A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends

Author: Ansari Mohsen
Ejlali Alireza
Fazli MohammadAmin
Henkel Jörg
Shafique Muhammad
Younesi Abolfazl
Publication venue
Publication date: 28/02/2024
Field of study

In today's digital age, Convolutional Neural Networks (CNNs), a subset of Deep Learning (DL), are widely used for various computer vision tasks such as image classification, object detection, and image segmentation. There are numerous types of CNNs designed to meet specific needs and requirements, including 1D, 2D, and 3D CNNs, as well as dilated, grouped, attention, depthwise convolutions, and NAS, among others. Each type of CNN has its unique structure and characteristics, making it suitable for specific tasks. It's crucial to gain a thorough understanding and perform a comparative analysis of these different CNN types to understand their strengths and weaknesses. Furthermore, studying the performance, limitations, and practical applications of each type of CNN can aid in the development of new and improved architectures in the future. We also dive into the platforms and frameworks that researchers utilize for their research or development from various perspectives. Additionally, we explore the main research fields of CNN like 6D vision, generative models, and meta-learning. This survey paper provides a comprehensive examination and comparison of various CNN architectures, highlighting their architectural differences and emphasizing their respective advantages, disadvantages, applications, challenges, and future trends

arXiv.org e-Print Archive

EG-ICE 2021 Workshop on Intelligent Computing in Engineering

Author
Publication venue
Publication date: 03/01/2022
Field of study

The 28th EG-ICE International Workshop 2021 brings together international experts working at the interface between advanced computing and modern engineering challenges. Many engineering tasks require open-world resolutions to support multi-actor collaboration, coping with approximate models, providing effective engineer-computer interaction, search in multi-dimensional solution spaces, accommodating uncertainty, including specialist domain knowledge, performing sensor-data interpretation and dealing with incomplete knowledge. While results from computer science provide much initial support for resolution, adaptation is unavoidable and most importantly, feedback from addressing engineering challenges drives fundamental computer-science research. Competence and knowledge transfer goes both ways

Directory of Open Access Books (DOAB)

Robust Mobile Visual Recognition System: From Bag of Visual Words to Deep Learning

Author: Li Dawei
Publication venue: Lehigh Preserve
Publication date
Field of study

With billions of images captured by mobile users everyday, automatically recognizing contents in such images has become a particularly important feature for various mobile apps, including augmented reality, product search, visual-based authentication etc. Traditionally, a client-server architecture is adopted such that the mobile client sends captured images/video frames to a cloud server, which runs a set of task-specific computer vision algorithms and sends back the recognition results. However, such scheme may cause problems related to user privacy, network stability/availability and device energy.In this dissertation, we investigate the problem of building a robust mobile visual recognition system that achieves high accuracy, low latency, low energy cost and privacy protection. Generally, we study two broad types of recognition methods: the bag of visual words (BOVW) based retrieval methods, which search the nearest neighbor image to a query image, and the state-of-the-art deep learning based methods, which recognize a given image using a trained deep neural network. The challenges of deploying BOVW based retrieval methods include: size of indexed image database, query latency, feature extraction efficiency and re-ranking performance. To address such challenges, we first proposed EMOD which enables efficient on-device image retrieval on a downloaded context-dependent partial image database. The efficiency is achieved by analyzing the BOVW processing pipeline and optimizing each module with algorithmic improvement.Recent deep learning based recognition approaches have been shown to greatly exceed the performance of traditional approaches. We identify several challenges of applying deep learning based recognition methods on mobile scenarios, namely energy efficiency and privacy protection for real-time visual processing, and mobile visual domain biases. Thus, we proposed two techniques to address them, (i) efficiently splitting the workload across heterogeneous computing resources, i.e., mobile devices and the cloud using our Moca framework, and (ii) using mobile visual domain adaptation as proposed in our collaborative edge-mediated platform DeepCham. Our extensive experiments on large-scale benchmark datasets and off-the-shelf mobile devices show our solutions provide better results than the state-of-the-art solutions

Lehigh University: Lehigh Preserve

System Abstractions for Scalable Application Development at the Edge

Author: Hu Bo
Publication venue: EliScholar – A Digital Platform for Scholarly Publishing at Yale
Publication date: 01/04/2022
Field of study

Recent years have witnessed an explosive growth of Internet of Things (IoT) devices, which collect or generate huge amounts of data. Given diverse device capabilities and application requirements, data processing takes place across a range of settings, from on-device to a nearby edge server/cloud and remote cloud. Consequently, edge-cloud coordination has been studied extensively from the perspectives of job placement, scheduling and joint optimization. Typical approaches focus on performance optimization for individual applications. This often requires domain knowledge of the applications, but also leads to application-specific solutions. Application development and deployment over diverse scenarios thus incur repetitive manual efforts. There are two overarching challenges to provide system-level support for application development at the edge. First, there is inherent heterogeneity at the device hardware level. The execution settings may range from a small cluster as an edge cloud to on-device inference on embedded devices, differing in hardware capability and programming environments. Further, application performance requirements vary significantly, making it even more difficult to map different applications to already heterogeneous hardware. Second, there are trends towards incorporating edge and cloud and multi-modal data. Together, these add further dimensions to the design space and increase the complexity significantly. In this thesis, we propose a novel framework to simplify application development and deployment over a continuum of edge to cloud. Our framework provides key connections between different dimensions of design considerations, corresponding to the application abstraction, data abstraction and resource management abstraction respectively. First, our framework masks hardware heterogeneity with abstract resource types through containerization, and abstracts away the application processing pipelines into generic flow graphs. Further, our framework further supports a notion of degradable computing for application scenarios at the edge that are driven by multimodal sensory input. Next, as video analytics is the killer app of edge computing, we include a generic data management service between video query systems and a video store to organize video data at the edge. We propose a video data unit abstraction based on a notion of distance between objects in the video, quantifying the semantic similarity among video data. Last, considering concurrent application execution, our framework supports multi-application offloading with device-centric control, with a userspace scheduler service that wraps over the operating system scheduler

Yale University

EG-ICE 2021 Workshop on Intelligent Computing in Engineering

Author
Publication venue
Publication date
Field of study

OAPEN Library

Recommended from our members

Application of Radial Distribution Functions to Diffraction and Imaging Data: Interfacial Structures, Amorphous, Disordered Materials

Author: Mukaddem Karim Tanju
Publication venue: University of Cambridge
Publication date: 16/04/2020
Field of study

The central theme of this thesis is the application of radial and pair distribution function analysis to materials characterisation problems for nanotechnology. These concepts are introduced in Chapter 1, and the associated methods are described in Chapter 2. Chapter 3 details the first of the results which discusses the design and development of a software tool called ImageDataExtractor. This auto-extracts microscopy images and then analyses them to afford quantitative information regarding particles in a sample, such as shape, size and distribution. It realises an opportunity for data-mining the ubiquity of readily available images in the literature. Chapter 4 presents results of the development and execution of a novel experimental technique, called glancing-angle pair distribution function (gaPDF) analysis, applied to the structure of the working electrode in dye-sensitised solar cells (DSSCs). This structure was successfully observed, validating this novel method. The investigation also suggested preferred binding modes of the carboxylic acid anchoring groups present in this interfacial structure. Chapters 5 and 6 demonstrate the application of PDF analysis to synchrotron-based powder diffraction data of two material case studies: the rare earth phosphate glass (REPG) (Gd2O3)0.230(P2O5)0.770, and four Ru based photo-isomers. The closest R…R rare earth separation, which governs optical properties of REPGs, was determined to be 4.2(1) Å, aided by various statistical techniques. Analysis on four Ru-based photo-isomers confirmed: the existence of local structure in such compounds, their ability to be photo-isomerised in powder form, the theoretical models constructed using computational techniques, and the lack of heterogeneity in photo-isomerisation throughout a given light-induced sample. Chapter 7 concludes the work and offers a future outlook

Apollo (Cambridge)