Search CORE

372 research outputs found

segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection

Author: Fidler Sanja
Salakhutdinov Ruslan
Urtasun Raquel
Zhu Yukun
Publication venue
Publication date: 01/01/2015
Field of study

In this paper, we propose an approach that exploits object segmentation in order to improve the accuracy of object detection. We frame the problem as inference in a Markov Random Field, in which each detection hypothesis scores object appearance as well as contextual information using Convolutional Neural Networks, and allows the hypothesis to choose and score a segment out of a large pool of accurate object segmentation proposals. This enables the detector to incorporate additional evidence when it is available and thus results in more accurate detections. Our experiments show an improvement of 4.1% in mAP over the R-CNN baseline on PASCAL VOC 2010, and 3.4% over the current state-of-the-art, demonstrating the power of our approach

arXiv.org e-Print Archive

CiteSeerX

Crossref

Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization

Author: Bonarens Frank
Fei Juncong
Heidenreich Philipp
Rambach Jason
Stiller Christoph
Stricker Didier
Stäcker Lukas
Publication venue: Institute of Electrical and Electronics Engineers
Publication date: 28/09/2021
Field of study

KITopen

Improving region based CNN object detector using bayesian optimization

Author: Muhammad Amgad
Publication venue: AUC Knowledge Fountain
Publication date: 01/06/2018
Field of study

Using Deep Neural Networks for object detection tasks has had groundbreaking results on several object detection benchmarks. Although the trained models have high capacity and strong discrimination power, yet inaccurate localization is a major source of error for those detection systems. In my work, I\u27m developing a sequential searching algorithm based on Bayesian Optimization to propose better candidate bounding boxes for the objects of interest. The work is focusing on formulating effective region proposal as an optimization problem and using Bayesian Optimization algorithm as a black-box optimizer to sequentially solve this problem. The proposed algorithm demonstrated the state-of-the-art performance on PASCAL VOC 2007 benchmark under the standard localization requirements

AUC Knowledge Fountain (American Univ. in Cairo)

Scalable Content-Based Analysis of Images in Web Archives with TensorFlow and the Archives Unleashed Toolkit

Author: Lin Jimmy
Liu Linqing
Milligan Ian
Ruest Nick
Yang Hsiu-Wei
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2019
Field of study

We demonstrate the integration of the Archives Unleashed Toolkit, a scalable platform for exploring web archives, with Google's TensorFlow deep learning toolkit to provide scholars with content-based image analysis capabilities. By applying pretrained deep neural networks for object detection, we are able to extract images of common objects from a 4TB web archive of GeoCities, which we then compile into browsable collages. This case study illustrates the types of interesting analyses enabled by combining big data and deep learning capabilities.This work was primarily supported by the Natural Sciences and Engineering Research Council of Canada. Additional funding for this project has come from the Andrew W. Mellon Foundation. Our sincerest thanks to the Internet Archive for providing us with the GeoCities web archive

Crossref

YorkSpace

G-CAME: Gaussian-Class Activation Mapping Explainer for Object Detectors

Author: Cao Quoc Hung
Nguyen Quoc Khanh
Nguyen Truong Thanh Hung
Nguyen Vo Thanh Khang
Truong Van Binh
Publication venue
Publication date: 06/06/2023
Field of study

Nowadays, deep neural networks for object detection in images are very prevalent. However, due to the complexity of these networks, users find it hard to understand why these objects are detected by models. We proposed Gaussian Class Activation Mapping Explainer (G-CAME), which generates a saliency map as the explanation for object detection models. G-CAME can be considered a CAM-based method that uses the activation maps of selected layers combined with the Gaussian kernel to highlight the important regions in the image for the predicted box. Compared with other Region-based methods, G-CAME can transcend time constraints as it takes a very short time to explain an object. We also evaluated our method qualitatively and quantitatively with YOLOX on the MS-COCO 2017 dataset and guided to apply G-CAME into the two-stage Faster-RCNN model.Comment: 10 figure

arXiv.org e-Print Archive