410 research outputs found
On -dimensional neither pointed nor semisimple Hopf algebras and the associated weak Hopf algebras
For a class of neither pointed nor semisimple Hopf algebras of
dimension , it is shown that they are quasi-triangular, which universal
-matrices are described. The corresponding weak Hopf algebras
and their representations are constructed. Finally, their
duality and their Green rings are established by generators and relations
explicitly. It turns out that the Green rings of the associated weak Hopf
algebras are not commutative even if the Green rings of are
commutative.Comment: 18 page
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation
Panoramic videos contain richer spatial information and have attracted
tremendous amounts of attention due to their exceptional experience in some
fields such as autonomous driving and virtual reality. However, existing
datasets for video segmentation only focus on conventional planar images. To
address the challenge, in this paper, we present a panoramic video dataset,
PanoVOS. The dataset provides 150 videos with high video resolutions and
diverse motions. To quantify the domain gap between 2D planar videos and
panoramic videos, we evaluate 15 off-the-shelf video object segmentation (VOS)
models on PanoVOS. Through error analysis, we found that all of them fail to
tackle pixel-level content discontinues of panoramic videos. Thus, we present a
Panoramic Space Consistency Transformer (PSCFormer), which can effectively
utilize the semantic boundary information of the previous frame for pixel-level
matching with the current frame. Extensive experiments demonstrate that
compared with the previous SOTA models, our PSCFormer network exhibits a great
advantage in terms of segmentation results under the panoramic setting. Our
dataset poses new challenges in panoramic VOS and we hope that our PanoVOS can
advance the development of panoramic segmentation/tracking
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection
Open-vocabulary object detection (OVOD) aims to detect the objects beyond the
set of classes observed during training. This work introduces a straightforward
and efficient strategy that utilizes pre-trained vision-language models (VLM),
like CLIP, to identify potential novel classes through zero-shot
classification. Previous methods use a class-agnostic region proposal network
to detect object proposals and consider the proposals that do not match the
ground truth as background. Unlike these methods, our method will select a
subset of proposals that will be considered as background during the training.
Then, we treat them as novel classes during training. We refer to this approach
as the self-training strategy, which enhances recall and accuracy for novel
classes without requiring extra annotations, datasets, and re-training.
Compared to previous pseudo methods, our approach does not require re-training
and offline labeling processing, which is more efficient and effective in
one-shot training. Empirical evaluations on three datasets, including LVIS,
V3Det, and COCO, demonstrate significant improvements over the baseline
performance without incurring additional parameters or computational costs
during inference. In addition, we also apply our method to various baselines.
In particular, compared with the previous method, F-VLM, our method achieves a
1.7% improvement on the LVIS dataset. Combined with the recent method CLIPSelf,
our method also achieves 46.7 novel class AP on COCO without introducing extra
data for pertaining. We also achieve over 6.5% improvement over the F-VLM
baseline in the recent challenging V3Det dataset. We release our code and
models at https://github.com/xushilin1/dst-det
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection
Grounding-DINO is a state-of-the-art open-set detection model that tackles
multiple vision tasks including Open-Vocabulary Detection (OVD), Phrase
Grounding (PG), and Referring Expression Comprehension (REC). Its effectiveness
has led to its widespread adoption as a mainstream architecture for various
downstream applications. However, despite its significance, the original
Grounding-DINO model lacks comprehensive public technical details due to the
unavailability of its training code. To bridge this gap, we present
MM-Grounding-DINO, an open-source, comprehensive, and user-friendly baseline,
which is built with the MMDetection toolbox. It adopts abundant vision datasets
for pre-training and various detection and grounding datasets for fine-tuning.
We give a comprehensive analysis of each reported result and detailed settings
for reproduction. The extensive experiments on the benchmarks mentioned
demonstrate that our MM-Grounding-DINO-Tiny outperforms the Grounding-DINO-Tiny
baseline. We release all our models to the research community. Codes and
trained models are released at
https://github.com/open-mmlab/mmdetection/tree/main/configs/mm_grounding_dino.Comment: 10 pages, 6 figure
Recommended from our members
Electron dynamics in laser-driven atoms near the continuum threshold
Strong-field ionization and Rydberg-state excitation (RSE) near the continuum threshold exhibit two phenomena that have attracted a lot of recent attention: the low-energy structure (LES) just above and frustrated tunneling ionization just below the threshold. The former becomes apparent for longer laser wavelengths, while the latter has been especially investigated in the near infrared; both have been treated as separate phenomena so far. Here we present a unified perspective based on electron trajectories, which emphasizes the very important role of the electron-ion Coulomb interaction as expected in this energy region. Namely, those trajectories that generate the LES can also be recaptured into a Rydberg state. The coherent superposition of the contributions of such trajectories with different travel times (each generating one of the various LES peaks) causes an oscillation in the intensity dependence of the RSE yield, which is especially noticeable for longer wavelengths. The theory is illustrated by RSE experiments at 1800 nm, which agree very well with the theory with respect to position and period of the oscillation. The wavelength scaling of the RSE oscillation is also discussed. Our work establishes a solid relationship between processes below and above the threshold and sheds new light on atomic dynamics driven by intense laser fields in this critical energy region
- …