12 research outputs found
GeoSay: A Geometric Saliency for Extracting Buildings in Remote Sensing Images
Automatic extraction of buildings in remote sensing images is an important
but challenging task and finds many applications in different fields such as
urban planning, navigation and so on. This paper addresses the problem of
buildings extraction in very high-spatial-resolution (VHSR) remote sensing (RS)
images, whose spatial resolution is often up to half meters and provides rich
information about buildings. Based on the observation that buildings in VHSR-RS
images are always more distinguishable in geometry than in texture or spectral
domain, this paper proposes a geometric building index (GBI) for accurate
building extraction, by computing the geometric saliency from VHSR-RS images.
More precisely, given an image, the geometric saliency is derived from a
mid-level geometric representations based on meaningful junctions that can
locally describe geometrical structures of images. The resulting GBI is finally
measured by integrating the derived geometric saliency of buildings.
Experiments on three public and commonly used datasets demonstrate that the
proposed GBI achieves the state-of-the-art performance and shows impressive
generalization capability. Additionally, GBI preserves both the exact position
and accurate shape of single buildings compared to existing methods
Holistically-Attracted Wireframe Parsing
This paper presents a fast and parsimonious parsing method to accurately and
robustly detect a vectorized wireframe in an input image with a single forward
pass. The proposed method is end-to-end trainable, consisting of three
components: (i) line segment and junction proposal generation, (ii) line
segment and junction matching, and (iii) line segment and junction
verification. For computing line segment proposals, a novel exact dual
representation is proposed which exploits a parsimonious geometric
reparameterization for line segments and forms a holistic 4-dimensional
attraction field map for an input image. Junctions can be treated as the
"basins" in the attraction field. The proposed method is thus called
Holistically-Attracted Wireframe Parser (HAWP). In experiments, the proposed
method is tested on two benchmarks, the Wireframe dataset, and the YorkUrban
dataset. On both benchmarks, it obtains state-of-the-art performance in terms
of accuracy and efficiency. For example, on the Wireframe dataset, compared to
the previous state-of-the-art method L-CNN, it improves the challenging mean
structural average precision (msAP) by a large margin ( absolute
improvements) and achieves 29.5 FPS on single GPU ( relative
improvement). A systematic ablation study is performed to further justify the
proposed method.Comment: Accepted by CVPR 202
Adaptively Transforming Graph Matching
Recently, many graph matching methods that incorporate pairwise constraint
and that can be formulated as a quadratic assignment problem (QAP) have been
proposed. Although these methods demonstrate promising results for the graph
matching problem, they have high complexity in space or time. In this paper, we
introduce an adaptively transforming graph matching (ATGM) method from the
perspective of functional representation. More precisely, under a
transformation formulation, we aim to match two graphs by minimizing the
discrepancy between the original graph and the transformed graph. With a linear
representation map of the transformation, the pairwise edge attributes of
graphs are explicitly represented by unary node attributes, which enables us to
reduce the space and time complexity significantly. Due to an efficient
Frank-Wolfe method-based optimization strategy, we can handle graphs with
hundreds and thousands of nodes within an acceptable amount of time. Meanwhile,
because transformation map can preserve graph structures, a domain
adaptation-based strategy is proposed to remove the outliers. The experimental
results demonstrate that our proposed method outperforms the state-of-the-art
graph matching algorithms
Learning Regional Attraction for Line Segment Detection
This paper presents regional attraction of line segment maps, and hereby
poses the problem of line segment detection (LSD) as a problem of region
coloring. Given a line segment map, the proposed regional attraction first
establishes the relationship between line segments and regions in the image
lattice. Based on this, the line segment map is equivalently transformed to an
attraction field map (AFM), which can be remapped to a set of line segments
without loss of information. Accordingly, we develop an end-to-end framework to
learn attraction field maps for raw input images, followed by a squeeze module
to detect line segments. Apart from existing works, the proposed detector
properly handles the local ambiguity and does not rely on the accurate
identification of edge pixels. Comprehensive experiments on the Wireframe
dataset and the YorkUrban dataset demonstrate the superiority of our method. In
particular, we achieve an F-measure of 0.831 on the Wireframe dataset,
advancing the state-of-the-art performance by 10.3 percent.Comment: Accepted to IEEE TPAMI. arXiv admin note: text overlap with
arXiv:1812.0212