216,187 research outputs found
CornerFormer: Boosting Corner Representation for Fine-Grained Structured Reconstruction
Structured reconstruction is a non-trivial dense prediction problem, which
extracts structural information (\eg, building corners and edges) from a raster
image, then reconstructs it to a 2D planar graph accordingly. Compared with
common segmentation or detection problems, it significantly relays on the
capability that leveraging holistic geometric information for structural
reasoning. Current transformer-based approaches tackle this challenging problem
in a two-stage manner, which detect corners in the first model and classify the
proposed edges (corner-pairs) in the second model. However, they separate
two-stage into different models and only share the backbone encoder. Unlike the
existing modeling strategies, we present an enhanced corner representation
method: 1) It fuses knowledge between the corner detection and edge prediction
by sharing feature in different granularity; 2) Corner candidates are proposed
in four heatmap channels w.r.t its direction. Both qualitative and quantitative
evaluations demonstrate that our proposed method can better reconstruct
fine-grained structures, such as adjacent corners and tiny edges. Consequently,
it outperforms the state-of-the-art model by +1.9\%@F-1 on Corner and
+3.0\%@F-1 on Edge
RCDN -- Robust X-Corner Detection Algorithm based on Advanced CNN Model
Accurate detection and localization of X-corner on both planar and non-planar
patterns is a core step in robotics and machine vision. However, previous works
could not make a good balance between accuracy and robustness, which are both
crucial criteria to evaluate the detectors performance. To address this
problem, in this paper we present a novel detection algorithm which can
maintain high sub-pixel precision on inputs under multiple interference, such
as lens distortion, extreme poses and noise. The whole algorithm, adopting a
coarse-to-fine strategy, contains a X-corner detection network and three
post-processing techniques to distinguish the correct corner candidates, as
well as a mixed sub-pixel refinement technique and an improved region growth
strategy to recover the checkerboard pattern partially visible or occluded
automatically. Evaluations on real and synthetic images indicate that the
presented algorithm has the higher detection rate, sub-pixel accuracy and
robustness than other commonly used methods. Finally, experiments of camera
calibration and pose estimation verify it can also get smaller re-projection
error in quantitative comparisons to the state-of-the-art.Comment: 15 pages, 8 figures and 4 tables. Unpublished further research and
experiments of Checkerboard corner detection network CCDN (arXiv:2302.05097)
and application exploration for robust camera calibration
(https://ieeexplore.ieee.org/abstract/document/9428389
Feature Based Multi View Image Registration by Detecting the Feature with Fuzzy Logic for Corner Detection
This paper aim to Present accurate feature base registration by detecting the feature with Fuzzy logic for corner detection. Image registration is process used to match two or more partially overlapping image taken for example at different times ,from different sensors, or from different viewpoints and stitch these image into one panoramic image comprising whole scene. It is a fundamental image processing technique very useful in integrating information from different sensors, finding changes in image taken at different time, inferring three-dimensional information from stereo images and recognizing model-based objects. The paper presents a corner detection algorithm for feature detection which employs such fuzzy reasoning. The robustness of the proposed algorithm is compared to well-known conventional Harris corner detectors and its performance is also tested over a noise image.
DOI: 10.17762/ijritcc2321-8169.150616
A model-based machine vision system using fuzzy logic
AbstractAn effective model-based machine vision system is designed for use in practical production lines. In the proposed system, the gray level corner is selected as a local feature, and a gray level corner detector is developed. The gray level corner detection problem is formulated as a pattern classification problem to determine whether a pixel belongs to the class of corners or not. The probability density function is estimated by means of fuzzy logic. A corner matching method is developed to minimize the amount of calculation. All available information obtained from the gray level corner detector is used to make the model. From a fuzzy inference procedure, a matched segment list is extracted, and the resulted segment list is used to calculate the transformations between the model object and each object in the scene. In order to reduce the fuzzy rule set, a notion of overlapping cost is introduced. To show the effectiveness of the developed algorithm, simulations are conducted for synthetic images, and an experiment is conducted on an image of a real industrial component
Automatic Detection and Rectification of Paper Receipts on Smartphones
We describe the development of a real-time smartphone app that allows the
user to digitize paper receipts in a novel way by "waving" their phone over the
receipts and letting the app automatically detect and rectify the receipts for
subsequent text recognition.
We show that traditional computer vision algorithms for edge and corner
detection do not robustly detect the non-linear and discontinuous edges and
corners of a typical paper receipt in real-world settings. This is particularly
the case when the colors of the receipt and background are similar, or where
other interfering rectangular objects are present. Inaccurate detection of a
receipt's corner positions then results in distorted images when using an
affine projective transformation to rectify the perspective.
We propose an innovative solution to receipt corner detection by treating
each of the four corners as a unique "object", and training a Single Shot
Detection MobileNet object detection model. We use a small amount of real data
and a large amount of automatically generated synthetic data that is designed
to be similar to real-world imaging scenarios.
We show that our proposed method robustly detects the four corners of a
receipt, giving a receipt detection accuracy of 85.3% on real-world data,
compared to only 36.9% with a traditional edge detection-based approach. Our
method works even when the color of the receipt is virtually indistinguishable
from the background.
Moreover, our method is trained to detect only the corners of the central
target receipt and implicitly learns to ignore other receipts, and other
rectangular objects. Including synthetic data allows us to train an even better
model. These factors are a major advantage over traditional edge
detection-based approaches, allowing us to deliver a much better experience to
the user
SuperPoint: Self-Supervised Interest Point Detection and Description
This paper presents a self-supervised framework for training interest point
detectors and descriptors suitable for a large number of multiple-view geometry
problems in computer vision. As opposed to patch-based neural networks, our
fully-convolutional model operates on full-sized images and jointly computes
pixel-level interest point locations and associated descriptors in one forward
pass. We introduce Homographic Adaptation, a multi-scale, multi-homography
approach for boosting interest point detection repeatability and performing
cross-domain adaptation (e.g., synthetic-to-real). Our model, when trained on
the MS-COCO generic image dataset using Homographic Adaptation, is able to
repeatedly detect a much richer set of interest points than the initial
pre-adapted deep model and any other traditional corner detector. The final
system gives rise to state-of-the-art homography estimation results on HPatches
when compared to LIFT, SIFT and ORB.Comment: Camera-ready version for CVPR 2018 Deep Learning for Visual SLAM
Workshop (DL4VSLAM2018
Reflectance Intensity Assisted Automatic and Accurate Extrinsic Calibration of 3D LiDAR and Panoramic Camera Using a Printed Chessboard
This paper presents a novel method for fully automatic and convenient
extrinsic calibration of a 3D LiDAR and a panoramic camera with a normally
printed chessboard. The proposed method is based on the 3D corner estimation of
the chessboard from the sparse point cloud generated by one frame scan of the
LiDAR. To estimate the corners, we formulate a full-scale model of the
chessboard and fit it to the segmented 3D points of the chessboard. The model
is fitted by optimizing the cost function under constraints of correlation
between the reflectance intensity of laser and the color of the chessboard's
patterns. Powell's method is introduced for resolving the discontinuity problem
in optimization. The corners of the fitted model are considered as the 3D
corners of the chessboard. Once the corners of the chessboard in the 3D point
cloud are estimated, the extrinsic calibration of the two sensors is converted
to a 3D-2D matching problem. The corresponding 3D-2D points are used to
calculate the absolute pose of the two sensors with Unified Perspective-n-Point
(UPnP). Further, the calculated parameters are regarded as initial values and
are refined using the Levenberg-Marquardt method. The performance of the
proposed corner detection method from the 3D point cloud is evaluated using
simulations. The results of experiments, conducted on a Velodyne HDL-32e LiDAR
and a Ladybug3 camera under the proposed re-projection error metric,
qualitatively and quantitatively demonstrate the accuracy and stability of the
final extrinsic calibration parameters.Comment: 20 pages, submitted to the journal of Remote Sensin
- …