Search CORE

7,846 research outputs found

Target-Tailored Source-Transformation for Scene Graph Generation

Author: Lan Cuiling
Liao Wentong
Rosenhahn Bodo
Yang Michael Ying
Zeng Wenjun
Publication venue
Publication date: 27/05/2020
Field of study

Scene graph generation aims to provide a semantic and structural description of an image, denoting the objects (with nodes) and their relationships (with edges). The best performing works to date are based on exploiting the context surrounding objects or relations,e.g., by passing information among objects. In these approaches, to transform the representation of source objects is a critical process for extracting information for the use by target objects. In this work, we argue that a source object should give what tar-get object needs and give different objects different information rather than contributing common information to all targets. To achieve this goal, we propose a Target-TailoredSource-Transformation (TTST) method to efficiently propagate information among object proposals and relations. Particularly, for a source object proposal which will contribute information to other target objects, we transform the source object feature to the target object feature domain by simultaneously taking both the source and target into account. We further explore more powerful representations by integrating language prior with the visual context in the transformation for the scene graph generation. By doing so the target object is able to extract target-specific information from the source object and source relation accordingly to refine its representation. Our framework is validated on the Visual Genome bench-mark and demonstrated its state-of-the-art performance for the scene graph generation. The experimental results show that the performance of object detection and visual relation-ship detection are promoted mutually by our method

arXiv.org e-Print Archive

University of Twente Research Information

Weakly Supervised Localization using Deep Feature Maps

Author: Bency Archith J.
Karthikeyan S.
Kwon Heesung
Lee Hyungtae
Manjunath B. S.
Publication venue
Publication date: 01/01/2016
Field of study

Object localization is an important computer vision problem with a variety of applications. The lack of large scale object-level annotations and the relative abundance of image-level labels makes a compelling case for weak supervision in the object localization task. Deep Convolutional Neural Networks are a class of state-of-the-art methods for the related problem of object recognition. In this paper, we describe a novel object localization algorithm which uses classification networks trained on only image labels. This weakly supervised method leverages local spatial and semantic patterns captured in the convolutional layers of classification networks. We propose an efficient beam search based approach to detect and localize multiple objects in images. The proposed method significantly outperforms the state-of-the-art in standard object localization data-sets with a 8 point increase in mAP scores

arXiv.org e-Print Archive

Crossref

eScholarship - University of California

Weakly- and Semi-Supervised Panoptic Segmentation

Author: A Arnab
A Arnab
A Bearman
A Kolesnikov
B Hariharan
DA Forsyth
DP Papadopoulos
H Scudder
J Pont-Tuset
J Shi
J Uhrig
J Zhang
Mark Everingham
MM Cheng
T-Y Lin
Y Wei
Publication venue
Publication date: 01/01/2018
Field of study

We present a weakly supervised model that jointly performs both semantic- and instance-segmentation -- a particularly relevant problem given the substantial cost of obtaining pixel-perfect annotation for these tasks. In contrast to many popular instance segmentation approaches based on object detectors, our method does not predict any overlapping instances. Moreover, we are able to segment both "thing" and "stuff" classes, and thus explain all the pixels in the image. "Thing" classes are weakly-supervised with bounding boxes, and "stuff" with image-level tags. We obtain state-of-the-art results on Pascal VOC, for both full and weak supervision (which achieves about 95% of fully-supervised performance). Furthermore, we present the first weakly-supervised results on Cityscapes for both semantic- and instance-segmentation. Finally, we use our weakly supervised framework to analyse the relationship between annotation quality and predictive performance, which is of interest to dataset creators.Comment: ECCV 2018. The first two authors contributed equall

arXiv.org e-Print Archive

Crossref

Oxford University Research Archive