234 research outputs found
Fast Compressed Automatic Target Recognition for a Compressive Infrared Imager
Many military systems utilize infrared sensors which allow an operator to see targets at night. Several of these are either mid-wave or long-wave high resolution infrared sensors, which are expensive to manufacture. But compressive sensing, which has primarily been demonstrated in medical applications, can be used to minimize the number of measurements needed to represent a high-resolution image. Using these techniques, a relatively low cost mid-wave infrared sensor can be realized which has a high effective resolution. In traditional military infrared sensing applications, like targeting systems, automatic targeting recognition algorithms are employed to locate and identify targets of interest to reduce the burden on the operator. The resolution of the sensor can increase the accuracy and operational range of a targeting system. When using a compressive sensing infrared sensor, traditional decompression techniques can be applied to form a spatial-domain infrared image, but most are iterative and not ideal for real-time environments. A more efficient method is to adapt the target recognition algorithms to operate directly on the compressed samples. In this work, we will present a target recognition algorithm which utilizes a compressed target detection method to identify potential target areas and then a specialized target recognition technique that operates directly on the same compressed samples. We will demonstrate our method on the U.S. Army Night Vision and Electronic Sensors Directorate ATR Algorithm Development Image Database which has been made available by the Sensing Information Analysis Center
One-Stage Cascade Refinement Networks for Infrared Small Target Detection
Single-frame InfraRed Small Target (SIRST) detection has been a challenging
task due to a lack of inherent characteristics, imprecise bounding box
regression, a scarcity of real-world datasets, and sensitive localization
evaluation. In this paper, we propose a comprehensive solution to these
challenges. First, we find that the existing anchor-free label assignment
method is prone to mislabeling small targets as background, leading to their
omission by detectors. To overcome this issue, we propose an all-scale
pseudo-box-based label assignment scheme that relaxes the constraints on scale
and decouples the spatial assignment from the size of the ground-truth target.
Second, motivated by the structured prior of feature pyramids, we introduce the
one-stage cascade refinement network (OSCAR), which uses the high-level head as
soft proposals for the low-level refinement head. This allows OSCAR to process
the same target in a cascade coarse-to-fine manner. Finally, we present a new
research benchmark for infrared small target detection, consisting of the
SIRST-V2 dataset of real-world, high-resolution single-frame targets, the
normalized contrast evaluation metric, and the DeepInfrared toolkit for
detection. We conduct extensive ablation studies to evaluate the components of
OSCAR and compare its performance to state-of-the-art model-driven and
data-driven methods on the SIRST-V2 benchmark. Our results demonstrate that a
top-down cascade refinement framework can improve the accuracy of infrared
small target detection without sacrificing efficiency. The DeepInfrared
toolkit, dataset, and trained models are available at
https://github.com/YimianDai/open-deepinfrared to advance further research in
this field.Comment: Submitted to TGR
An Adaptive Spatial-Temporal Local Feature Difference Method for Infrared Small-moving Target Detection
Detecting small moving targets accurately in infrared (IR) image sequences is
a significant challenge. To address this problem, we propose a novel method
called spatial-temporal local feature difference (STLFD) with adaptive
background suppression (ABS). Our approach utilizes filters in the spatial and
temporal domains and performs pixel-level ABS on the output to enhance the
contrast between the target and the background. The proposed method comprises
three steps. First, we obtain three temporal frame images based on the current
frame image and extract two feature maps using the designed spatial domain and
temporal domain filters. Next, we fuse the information of the spatial domain
and temporal domain to produce the spatial-temporal feature maps and suppress
noise using our pixel-level ABS module. Finally, we obtain the segmented binary
map by applying a threshold. Our experimental results demonstrate that the
proposed method outperforms existing state-of-the-art methods for infrared
small-moving target detection
Click on Mask: A Labor-efficient Annotation Framework with Level Set for Infrared Small Target Detection
Infrared Small Target Detection is a challenging task to separate small
targets from infrared clutter background. Recently, deep learning paradigms
have achieved promising results. However, these data-driven methods need plenty
of manual annotation. Due to the small size of infrared targets, manual
annotation consumes more resources and restricts the development of this field.
This letter proposed a labor-efficient and cursory annotation framework with
level set, which obtains a high-quality pseudo mask with only one cursory
click. A variational level set formulation with an expectation difference
energy functional is designed, in which the zero level contour is intrinsically
maintained during the level set evolution. It solves the issue that zero level
contour disappearing due to small target size and excessive regularization.
Experiments on the NUAA-SIRST and IRSTD-1k datasets reveal that our approach
achieves superior performance. Code is available at
https://github.com/Li-Haoqing/COM.Comment: 4 pages, 5 figures, references adde
ILNet: Low-level Matters for Salient Infrared Small Target Detection
Infrared small target detection is a technique for finding small targets from
infrared clutter background. Due to the dearth of high-level semantic
information, small infrared target features are weakened in the deep layers of
the CNN, which underachieves the CNN's representation ability. To address the
above problem, in this paper, we propose an infrared low-level network (ILNet)
that considers infrared small targets as salient areas with little semantic
information. Unlike other SOTA methods, ILNet pays greater attention to
low-level information instead of treating them equally. A new lightweight
feature fusion module, named Interactive Polarized Orthogonal Fusion module
(IPOF), is proposed, which integrates more important low-level features from
the shallow layers into the deep layers. A Dynamic One-Dimensional Aggregation
layers (DODA) are inserted into the IPOF, to dynamically adjust the aggregation
of low dimensional information according to the number of input channels. In
addition, the idea of ensemble learning is used to design a Representative
Block (RB) to dynamically allocate weights for shallow and deep layers.
Experimental results on the challenging NUAA-SIRST (78.22% nIoU and 1.33e-6 Fa)
and IRSTD-1K (68.91% nIoU and 3.23e-6 Fa) dataset demonstrate that the proposed
ILNet can get better performances than other SOTA methods. Moreover, ILNet can
obtain a greater improvement with the increasement of data volume. Training
code are available at https://github.com/Li-Haoqing/ILNet
Monte Carlo Linear Clustering with Single-Point Supervision is Enough for Infrared Small Target Detection
Single-frame infrared small target (SIRST) detection aims at separating small
targets from clutter backgrounds on infrared images. Recently, deep learning
based methods have achieved promising performance on SIRST detection, but at
the cost of a large amount of training data with expensive pixel-level
annotations. To reduce the annotation burden, we propose the first method to
achieve SIRST detection with single-point supervision. The core idea of this
work is to recover the per-pixel mask of each target from the given single
point label by using clustering approaches, which looks simple but is indeed
challenging since targets are always insalient and accompanied with background
clutters. To handle this issue, we introduce randomness to the clustering
process by adding noise to the input images, and then obtain much more reliable
pseudo masks by averaging the clustered results. Thanks to this "Monte Carlo"
clustering approach, our method can accurately recover pseudo masks and thus
turn arbitrary fully supervised SIRST detection networks into weakly supervised
ones with only single point annotation. Experiments on four datasets
demonstrate that our method can be applied to existing SIRST detection networks
to achieve comparable performance with their fully supervised counterparts,
which reveals that single-point supervision is strong enough for SIRST
detection. Our code will be available at:
https://github.com/YeRen123455/SIRST-Single-Point-Supervision
Infrared small-target detection based on background-suppression proximal gradient and GPU acceleration
Patch-based methods improve the performance of infrared small target detection, transforming the detection problem into a Low-Rank Sparse Decomposition (LRSD) problem. However, two challenges hinder the success of these methods: (1) The interference from strong edges of the background, and (2) the time-consuming nature of solving the model. To tackle these two challenges, we propose a novel infrared small-target detection method using a Background-Suppression Proximal Gradient (BSPG) and GPU parallelism. We first propose a new continuation strategy to suppress the strong edges. This strategy enables the model to simultaneously consider heterogeneous components while dealing with low-rank backgrounds. Then, the Approximate Partial Singular Value Decomposition (APSVD) is presented to accelerate solution of the LRSD problem and further improve the solution accuracy. Finally, we implement our method on GPU using multi-threaded parallelism, in order to further enhance the computational efficiency of the model. The experimental results demonstrate that our method out-performs existing advanced methods, in terms of detection accuracy and execution time
- …