262 research outputs found
RELAX: Reinforcement Learning Enabled 2D-LiDAR Autonomous System for Parsimonious UAVs
Unmanned Aerial Vehicles (UAVs) have become increasingly prominence in recent
years, finding applications in surveillance, package delivery, among many
others. Despite considerable efforts in developing algorithms that enable UAVs
to navigate through complex unknown environments autonomously, they often
require expensive hardware and sensors, such as RGB-D cameras and 3D-LiDAR,
leading to a persistent trade-off between performance and cost. To this end, we
propose RELAX, a novel end-to-end autonomous framework that is exceptionally
cost-efficient, requiring only a single 2D-LiDAR to enable UAVs operating in
unknown environments. Specifically, RELAX comprises three components: a
pre-processing map constructor; an offline mission planner; and a reinforcement
learning (RL)-based online re-planner. Experiments demonstrate that RELAX
offers more robust dynamic navigation compared to existing algorithms, while
only costing a fraction of the others. The code will be made public upon
acceptance
SportsMOT: A Large Multi-Object Tracking Dataset in Multiple Sports Scenes
Multi-object tracking in sports scenes plays a critical role in gathering
players statistics, supporting further analysis, such as automatic tactical
analysis. Yet existing MOT benchmarks cast little attention on the domain,
limiting its development. In this work, we present a new large-scale
multi-object tracking dataset in diverse sports scenes, coined as
\emph{SportsMOT}, where all players on the court are supposed to be tracked. It
consists of 240 video sequences, over 150K frames (almost 15\times MOT17) and
over 1.6M bounding boxes (3\times MOT17) collected from 3 sports categories,
including basketball, volleyball and football. Our dataset is characterized
with two key properties: 1) fast and variable-speed motion and 2) similar yet
distinguishable appearance. We expect SportsMOT to encourage the MOT trackers
to promote in both motion-based association and appearance-based association.
We benchmark several state-of-the-art trackers and reveal the key challenge of
SportsMOT lies in object association. To alleviate the issue, we further
propose a new multi-object tracking framework, termed as \emph{MixSort},
introducing a MixFormer-like structure as an auxiliary association model to
prevailing tracking-by-detection trackers. By integrating the customized
appearance-based association with the original motion-based association,
MixSort achieves state-of-the-art performance on SportsMOT and MOT17. Based on
MixSort, we give an in-depth analysis and provide some profound insights into
SportsMOT. The dataset and code will be available at
https://deeperaction.github.io/datasets/sportsmot.html
StableDrag: Stable Dragging for Point-based Image Editing
Point-based image editing has attracted remarkable attention since the
emergence of DragGAN. Recently, DragDiffusion further pushes forward the
generative quality via adapting this dragging technique to diffusion models.
Despite these great success, this dragging scheme exhibits two major drawbacks,
namely inaccurate point tracking and incomplete motion supervision, which may
result in unsatisfactory dragging outcomes. To tackle these issues, we build a
stable and precise drag-based editing framework, coined as StableDrag, by
designing a discirminative point tracking method and a confidence-based latent
enhancement strategy for motion supervision. The former allows us to precisely
locate the updated handle points, thereby boosting the stability of long-range
manipulation, while the latter is responsible for guaranteeing the optimized
latent as high-quality as possible across all the manipulation steps. Thanks to
these unique designs, we instantiate two types of image editing models
including StableDrag-GAN and StableDrag-Diff, which attains more stable
dragging performance, through extensive qualitative experiments and
quantitative assessment on DragBench
Self-Improvement Programming for Temporal Knowledge Graph Question Answering
Temporal Knowledge Graph Question Answering (TKGQA) aims to answer questions
with temporal intent over Temporal Knowledge Graphs (TKGs). The core challenge
of this task lies in understanding the complex semantic information regarding
multiple types of time constraints (e.g., before, first) in questions. Existing
end-to-end methods implicitly model the time constraints by learning time-aware
embeddings of questions and candidate answers, which is far from understanding
the question comprehensively. Motivated by semantic-parsing-based approaches
that explicitly model constraints in questions by generating logical forms with
symbolic operators, we design fundamental temporal operators for time
constraints and introduce a novel self-improvement Programming method for TKGQA
(Prog-TQA). Specifically, Prog-TQA leverages the in-context learning ability of
Large Language Models (LLMs) to understand the combinatory time constraints in
the questions and generate corresponding program drafts with a few examples
given. Then, it aligns these drafts to TKGs with the linking module and
subsequently executes them to generate the answers. To enhance the ability to
understand questions, Prog-TQA is further equipped with a self-improvement
strategy to effectively bootstrap LLMs using high-quality self-generated
drafts. Extensive experiments demonstrate the superiority of the proposed
Prog-TQA on MultiTQ and CronQuestions datasets, especially in the Hits@1
metric.Comment: Accepted by LREC-COLING 2024 (long paper
Possible Manifestation of a Non-Pointness of the Electron in Annihilation Reaction at Centre of Mass Energies 55-207 GeV
The experimental data from VENUS, TOPAS, OPAL, DELPHI, ALEPH and L3
collaborations, collected from 1989 to 2003, are applied to study the QED
framework through direct contact interaction terms approach, using the
annihilation reaction . The
analysis involves performing of a test to detect the presence of an
excited electron and evidence of non-point like behavior in the
annihilation zone. The results of the analysis indicate a strong
signal, with a confidence level of approximately , for the presence of
an excited electron with a mass of GeV, and a deviation from a
point-like behavior of the charge distribution of the electron. The radius of
this deviation is cm, which can be interpreted as
the size of the electron.Comment: 47 pages, 15 figures, 6 tables. The revised versio
- …