Search CORE

7,669 research outputs found

Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection

Author: Bilen Hakan
Chen Shang-Fu
Chen Tianshui
Gong Yunchao
He Kaiming
Hu Hexiang
Lawrence Zitnick C.
Li Dong
Li Qiang
Lin Tsung-Yi
Liu Feng
Romero Adriana
Simonyan Karen
Tan Mingkui
Wang Jiang
Wang Zhouxia
Xavier Glorot
Xie Pengtao
Yang Hao
Yeh Chih-Kuan
Zagoruyko Sergey
Zhang Junjie
Zhu Feng
Zhu Yi
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 21/02/2019
Field of study

Multi-label image classification is a fundamental but challenging task towards general visual understanding. Existing methods found the region-level cues (e.g., features from RoIs) can facilitate multi-label classification. Nevertheless, such methods usually require laborious object-level annotations (i.e., object labels and bounding boxes) for effective learning of the object-level visual features. In this paper, we propose a novel and efficient deep framework to boost multi-label classification by distilling knowledge from weakly-supervised detection task without bounding box annotations. Specifically, given the image-level annotations, (1) we first develop a weakly-supervised detection (WSD) model, and then (2) construct an end-to-end multi-label image classification framework augmented by a knowledge distillation module that guides the classification model by the WSD model according to the class-level predictions for the whole image and the object-level visual features for object RoIs. The WSD model is the teacher model and the classification model is the student model. After this cross-task knowledge distillation, the performance of the classification model is significantly improved and the efficiency is maintained since the WSD model can be safely discarded in the test phase. Extensive experiments on two large-scale datasets (MS-COCO and NUS-WIDE) show that our framework achieves superior performances over the state-of-the-art methods on both performance and efficiency.Comment: accepted by ACM Multimedia 2018, 9 pages, 4 figures, 5 table

arXiv.org e-Print Archive

Crossref

Trash and recyclable material identification using convolutional neural networks (CNN)

Author: NC DOCKS at Western Carolina University
Sultana Rumana
Publication venue
Publication date: 01/01/2020
Field of study

The aim of this research is to improve municipal trash collection using image processing algorithms and deep learning technologies for detecting trash in public spaces. This research will help to improve trash management systems and create a smart city. Two Convolutional Neural Networks (CNN), both based on the AlexNet network architecture, were developed to search for trash objects in an image and separate recyclable items from the landfill trash objects, respectively. The two-stage CNN system was first trained and tested on the benchmark TrashNet indoor image dataset and achieved great performance to prove the concept. Then the system was trained and tested on outdoor images taken by the authors in the intended usage environment. Using the outdoor image dataset, the first CNN achieved a preliminary 93.6% accuracy to identify trash and non-trash items on an image database of assorted trash items. A second CNN was then trained to distinguish trash that will go to a landfill from the recyclable items with an accuracy ranging from 89.7% to 93.4% and overall, 92%. A future goal is to integrate this image processing-based trash identification system in a smart trashcan robot with a camera to take real-time photos that can detect and collect the trash all around it

The University of North Carolina at Greensboro

Exploring Object Relation in Mean Teacher for Cross-Domain Detection

Author: Cai Qi
Duan Lingyu
Ngo Chong-Wah
Pan Yingwei
Tian Xinmei
Yao Ting
Publication venue
Publication date: 01/06/2019
Field of study

Rendering synthetic data (e.g., 3D CAD-rendered images) to generate annotations for learning deep models in vision tasks has attracted increasing attention in recent years. However, simply applying the models learnt on synthetic images may lead to high generalization error on real images due to domain shift. To address this issue, recent progress in cross-domain recognition has featured the Mean Teacher, which directly simulates unsupervised domain adaptation as semi-supervised learning. The domain gap is thus naturally bridged with consistency regularization in a teacher-student scheme. In this work, we advance this Mean Teacher paradigm to be applicable for cross-domain detection. Specifically, we present Mean Teacher with Object Relations (MTOR) that novelly remolds Mean Teacher under the backbone of Faster R-CNN by integrating the object relations into the measure of consistency cost between teacher and student modules. Technically, MTOR firstly learns relational graphs that capture similarities between pairs of regions for teacher and student respectively. The whole architecture is then optimized with three consistency regularizations: 1) region-level consistency to align the region-level predictions between teacher and student, 2) inter-graph consistency for matching the graph structures between teacher and student, and 3) intra-graph consistency to enhance the similarity between regions of same class within the graph of student. Extensive experiments are conducted on the transfers across Cityscapes, Foggy Cityscapes, and SIM10k, and superior results are reported when comparing to state-of-the-art approaches. More remarkably, we obtain a new record of single model: 22.8% of mAP on Syn2Real detection dataset.Comment: CVPR 2019; The codes and model of our MTOR are publicly available at: https://github.com/caiqi/mean-teacher-cross-domain-detectio

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University

Generalized Anomaly Detection

Author: Li Yu
Luo Minwei
Singh Suresh
Publication venue
Publication date: 13/10/2022
Field of study

We study anomaly detection for the case when the normal class consists of more than one object category. This is an obvious generalization of the standard one-class anomaly detection problem. However, we show that jointly using multiple one-class anomaly detectors to solve this problem yields poorer results as compared to training a single one-class anomaly detector on all normal object categories together. We further develop a new anomaly detector called DeepMAD that learns compact distinguishing features by exploiting the multiple normal objects categories. This algorithm achieves higher AUC values for different datasets compared to two top performing one-class algorithms that either are trained on each normal object category or jointly trained on all normal object categories combined. In addition to theoretical results we present empirical results using the CIFAR-10, fMNIST, CIFAR-100, and a new dataset we developed called RECYCLE.Comment: 13 page

arXiv.org e-Print Archive

A Smart Management System For Garbage Classification Using Deep Learning

Author: Nagaraj et al.
Publication venue: Auricle Global Society of Education and Research
Publication date: 02/11/2023
Field of study

Thanks to the development of artificial intelligence (AI), the outdated trash system now offers better time monitoring and enables for better waste management. The purpose of this paper is to develop a smart sterile management system using a Tensor Flow-based deep learning model. In real time, it recognizes and categorizes items. Metal, plastic, and paper waste are separated from other sorts of trash in the bin's several divisions. Object detection and garbage classification are carried out using the Tensor Flow framework and a trained object recognition model. In order to create a frozen inference graph that can be used to recognize things using a camera, this trash detection model is trained on garbage photographs

International Journal on Recent and Innovation Trends in Computing and Communication