21,900 research outputs found
Adding Conditional Control to Text-to-Image Diffusion Models
We present ControlNet, a neural network architecture to add spatial
conditioning controls to large, pretrained text-to-image diffusion models.
ControlNet locks the production-ready large diffusion models, and reuses their
deep and robust encoding layers pretrained with billions of images as a strong
backbone to learn a diverse set of conditional controls. The neural
architecture is connected with "zero convolutions" (zero-initialized
convolution layers) that progressively grow the parameters from zero and ensure
that no harmful noise could affect the finetuning. We test various conditioning
controls, eg, edges, depth, segmentation, human pose, etc, with Stable
Diffusion, using single or multiple conditions, with or without prompts. We
show that the training of ControlNets is robust with small (<50k) and large
(>1m) datasets. Extensive results show that ControlNet may facilitate wider
applications to control image diffusion models.Comment: Codes and Supplementary Material:
https://github.com/lllyasviel/ControlNe
Assessing overall network structure in regional innovation policies: a case study of cluster policy in the West Midlands in the UK
Revisiting the theoretical roots of the key concepts of âembeddednessâ and ânetworksâ that underpin many recent regional innovation polices, this paper strives to achieve a more systematic understanding of the overall network structure of geographic agglomerations, which helps to form a more convincing model of regional development based on learning. This also helps to establish an analytical framework with indicators to assess the overall network structure in regional innovation policies. Employing the framework, the examination of cluster policy in the West Midlands highlights its weakness in addressing the overall cluster network structure and the contingent factors influencing the structure. The analysis suggests that there may be similar weaknesses in other regional innovation policies and the theories underpinning them as they share a common weakness in addressing the structural characteristics of overall networks
Neural radiance fields in the industrial and robotics domain: applications, research opportunities and use cases
The proliferation of technologies, such as extended reality (XR), has
increased the demand for high-quality three-dimensional (3D) graphical
representations. Industrial 3D applications encompass computer-aided design
(CAD), finite element analysis (FEA), scanning, and robotics. However, current
methods employed for industrial 3D representations suffer from high
implementation costs and reliance on manual human input for accurate 3D
modeling. To address these challenges, neural radiance fields (NeRFs) have
emerged as a promising approach for learning 3D scene representations based on
provided training 2D images. Despite a growing interest in NeRFs, their
potential applications in various industrial subdomains are still unexplored.
In this paper, we deliver a comprehensive examination of NeRF industrial
applications while also providing direction for future research endeavors. We
also present a series of proof-of-concept experiments that demonstrate the
potential of NeRFs in the industrial domain. These experiments include
NeRF-based video compression techniques and using NeRFs for 3D motion
estimation in the context of collision avoidance. In the video compression
experiment, our results show compression savings up to 48\% and 74\% for
resolutions of 1920x1080 and 300x168, respectively. The motion estimation
experiment used a 3D animation of a robotic arm to train Dynamic-NeRF (D-NeRF)
and achieved an average peak signal-to-noise ratio (PSNR) of disparity map with
the value of 23 dB and an structural similarity index measure (SSIM) 0.97
Non-destructive testing of composite fibre materials with hyperspectral imaging: evaluative studies in the EU H2020 FibreEUse project.
Through capturing spectral data from a wide frequency range along with the spatial information, hyperspectral imaging (HSI) can detect minor differences in terms of temperature, moisture and chemical composition. Therefore, HSI has been successfully applied in various applications, including remote sensing for security and defense, precision agriculture for vegetation and crop monitoring, food/drink, and pharmaceuticals quality control. However, for condition monitoring and damage detection in carbon fibre reinforced polymer (CFRP), the use of HSI is a relatively untouched area, as existing non-destructive testing (NDT) techniques focus mainly on delivering information about physical integrity of structures but not on material composition. To this end, HSI can provide a unique way to tackle this challenge. In this paper, with the use of a near-infrared HSI camera, applications of HSI for the non-destructive inspection of CFRP products are introduced, taking the EU H2020 FibreEUse project as the background. Technical challenges and solutions on three case studies are presented in detail, including adhesive residues detection, surface damage detection and Cobot based automated inspection. Experimental results have fully demonstrated the great potential of HSI and related vision techniques for NDT of CFRP, especially the potential to satisfy the industrial manufacturing environment
A Survey on Unsupervised Anomaly Detection Algorithms for Industrial Images
In line with the development of Industry 4.0, surface defect
detection/anomaly detection becomes a topical subject in the industry field.
Improving efficiency as well as saving labor costs has steadily become a matter
of great concern in practice, where deep learning-based algorithms perform
better than traditional vision inspection methods in recent years. While
existing deep learning-based algorithms are biased towards supervised learning,
which not only necessitates a huge amount of labeled data and human labor, but
also brings about inefficiency and limitations. In contrast, recent research
shows that unsupervised learning has great potential in tackling the above
disadvantages for visual industrial anomaly detection. In this survey, we
summarize current challenges and provide a thorough overview of recently
proposed unsupervised algorithms for visual industrial anomaly detection
covering five categories, whose innovation points and frameworks are described
in detail. Meanwhile, publicly available datasets for industrial anomaly
detection are introduced. By comparing different classes of methods, the
advantages and disadvantages of anomaly detection algorithms are summarized.
Based on the current research framework, we point out the core issue that
remains to be resolved and provide further improvement directions. Meanwhile,
based on the latest technological trends, we offer insights into future
research directions. It is expected to assist both the research community and
industry in developing a broader and cross-domain perspective
Towards Artificial General Intelligence (AGI) in the Internet of Things (IoT): Opportunities and Challenges
Artificial General Intelligence (AGI), possessing the capacity to comprehend,
learn, and execute tasks with human cognitive abilities, engenders significant
anticipation and intrigue across scientific, commercial, and societal arenas.
This fascination extends particularly to the Internet of Things (IoT), a
landscape characterized by the interconnection of countless devices, sensors,
and systems, collectively gathering and sharing data to enable intelligent
decision-making and automation. This research embarks on an exploration of the
opportunities and challenges towards achieving AGI in the context of the IoT.
Specifically, it starts by outlining the fundamental principles of IoT and the
critical role of Artificial Intelligence (AI) in IoT systems. Subsequently, it
delves into AGI fundamentals, culminating in the formulation of a conceptual
framework for AGI's seamless integration within IoT. The application spectrum
for AGI-infused IoT is broad, encompassing domains ranging from smart grids,
residential environments, manufacturing, and transportation to environmental
monitoring, agriculture, healthcare, and education. However, adapting AGI to
resource-constrained IoT settings necessitates dedicated research efforts.
Furthermore, the paper addresses constraints imposed by limited computing
resources, intricacies associated with large-scale IoT communication, as well
as the critical concerns pertaining to security and privacy
- âŠ