Search CORE

1,884 research outputs found

IOD-CNN: Integrating Object Detection Networks for Event Recognition

Author: Doermann David
Eum Sungmin
Kwon Heesung
Lee Hyungtae
Publication venue
Publication date: 21/03/2017
Field of study

Many previous methods have showed the importance of considering semantically relevant objects for performing event recognition, yet none of the methods have exploited the power of deep convolutional neural networks to directly integrate relevant object information into a unified network. We present a novel unified deep CNN architecture which integrates architecturally different, yet semantically-related object detection networks to enhance the performance of the event recognition task. Our architecture allows the sharing of the convolutional layers and a fully connected layer which effectively integrates event recognition, rigid object detection and non-rigid object detection.Comment: submitted to IEEE International Conference on Image Processing 201

arXiv.org e-Print Archive

Crossref

Project RISE: Recognizing Industrial Smoke Emissions

Author: Dille Paul
Hoffman Ryan
Hsu Yen-Chia
Hu Ting-Yao
Huang Ting-Hao 'Kenneth'
Nourbakhsh Illah
Pachuta Jessica
Prendi Sean
Sargent Randy
Tsuhlares Anastasia
Publication venue
Publication date: 14/09/2020
Field of study

Industrial smoke emissions pose a significant concern to human health. Prior works have shown that using Computer Vision (CV) techniques to identify smoke as visual evidence can influence the attitude of regulators and empower citizens to pursue environmental justice. However, existing datasets are not of sufficient quality nor quantity to train the robust CV models needed to support air quality advocacy. We introduce RISE, the first large-scale video dataset for Recognizing Industrial Smoke Emissions. We adopted a citizen science approach to collaborate with local community members to annotate whether a video clip has smoke emissions. Our dataset contains 12,567 clips from 19 distinct views from cameras that monitored three industrial facilities. These daytime clips span 30 days over two years, including all four seasons. We ran experiments using deep neural networks to establish a strong performance baseline and reveal smoke recognition challenges. Our survey study discussed community feedback, and our data analysis displayed opportunities for integrating citizen scientists and crowd workers into the application of Artificial Intelligence for social good.Comment: Technical repor

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Convolutional Neural Network on Three Orthogonal Planes for Dynamic Texture Classification

Author: Andrearczyk Vincent
Whelan Paul F.
Publication venue
Publication date: 16/03/2017
Field of study

Dynamic Textures (DTs) are sequences of images of moving scenes that exhibit certain stationarity properties in time such as smoke, vegetation and fire. The analysis of DT is important for recognition, segmentation, synthesis or retrieval for a range of applications including surveillance, medical imaging and remote sensing. Deep learning methods have shown impressive results and are now the new state of the art for a wide range of computer vision tasks including image and video recognition and segmentation. In particular, Convolutional Neural Networks (CNNs) have recently proven to be well suited for texture analysis with a design similar to a filter bank approach. In this paper, we develop a new approach to DT analysis based on a CNN method applied on three orthogonal planes x y , xt and y t . We train CNNs on spatial frames and temporal slices extracted from the DT sequences and combine their outputs to obtain a competitive DT classifier. Our results on a wide range of commonly used DT classification benchmark datasets prove the robustness of our approach. Significant improvement of the state of the art is shown on the larger datasets.Comment: 19 pages, 10 figure

arXiv.org e-Print Archive

Irish Universities

DCU Online Research Access Service