178 research outputs found
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation
Recently, CLIP has been applied to pixel-level zero-shot learning tasks via a
two-stage scheme. The general idea is to first generate class-agnostic region
proposals and then feed the cropped proposal regions to CLIP to utilize its
image-level zero-shot classification capability. While effective, such a scheme
requires two image encoders, one for proposal generation and one for CLIP,
leading to a complicated pipeline and high computational cost. In this work, we
pursue a simpler-and-efficient one-stage solution that directly extends CLIP's
zero-shot prediction capability from image to pixel level. Our investigation
starts with a straightforward extension as our baseline that generates semantic
masks by comparing the similarity between text and patch embeddings extracted
from CLIP. However, such a paradigm could heavily overfit the seen classes and
fail to generalize to unseen classes. To handle this issue, we propose three
simple-but-effective designs and figure out that they can significantly retain
the inherent zero-shot capacity of CLIP and improve pixel-level generalization
ability. Incorporating those modifications leads to an efficient zero-shot
semantic segmentation system called ZegCLIP. Through extensive experiments on
three public benchmarks, ZegCLIP demonstrates superior performance,
outperforming the state-of-the-art methods by a large margin under both
"inductive" and "transductive" zero-shot settings. In addition, compared with
the two-stage method, our one-stage ZegCLIP achieves a speedup of about 5 times
faster during inference. We release the code at
https://github.com/ZiqinZhou66/ZegCLIP.git.Comment: 12 pages, 8 figure
LoS Sensing-based Channel Estimation in UAV-Assisted OFDM Systems
In unmanned aerial vehicle (UAV)-assisted orthogonal frequency division
multiplexing (OFDM) systems, the potential advantage of the line-of-sight (LoS)
path, characterized by its high probability of existence, has not been fully
harnessed, thereby impeding the improvement of channel estimation (CE)
accuracy. Inspired by the ideas of integrated sensing and communication (ISAC),
this letter develops a LoS sensing method aimed at detecting the presence of
LoS path. Leveraging the prior information obtained from LoS path detection,
the detection thresholds for resolvable paths are proposed for LoS and Non-LoS
(NLoS) scenarios, respectively. By employing these specifically designed
detection thresholds, denoising processing is applied to classical least square
(LS) CE, thereby improving the CE accuracy. Simulation results validate the
effectiveness of the proposed method in enhancing CE accuracy and demonstrate
its robustness against parameter variations
Secured green communication scheme for interference alignment based networks
In this paper, a new security and green communication scheme is proposed to the Interference-Alignment (IA) based networks. To achieve a secured communication, full-duplex receivers are utilized to transmit artificial noise (AN). Both the signals and the ANs are used to harvest energy to realize green communication. For these reasons, the feasible conditions of this scheme are analyzed first. Secondly, the average transmission rate, the secrecy performance and the harvested energy are investigated. Thirdly, an optimization scheme of simultaneous wireless information and power transfer (SWIPT) is given to optimize the information transmission and the energy harvesting efficiency. Meanwhile, an improved IA iteration algorithm is designed to eliminate both the AN and the interference. Furthermore, relay cooperation is considered and its system performance is analyzed. The simulations show that the target average transmission rate is not affected by AN, while the secrecy performance can be greatly improved. The energy harvesting efficiency is also better than the traditional schemes. As expected, the average transmission rate further is improved with the relay cooperation
- …