188 research outputs found
Neighbor Based Enhancement for the Long-Tail Ranking Problem in Video Rank Models
Rank models play a key role in industrial recommender systems, advertising,
and search engines. Existing works utilize semantic tags and user-item
interaction behaviors, e.g., clicks, views, etc., to predict the user interest
and the item hidden representation for estimating the user-item preference
score. However, these behavior-tag-based models encounter great challenges and
reduced effectiveness when user-item interaction activities are insufficient,
which we called "the long-tail ranking problem". Existing rank models ignore
this problem, but its common and important because any user or item can be
long-tailed once they are not consistently active for a short period. In this
paper, we propose a novel neighbor enhancement structure to help train the
representation of the target user or item. It takes advantage of similar
neighbors (static or dynamic similarity) with multi-level attention operations
balancing the weights of different neighbors. Experiments on the well-known
public dataset MovieLens 1M demonstrate the efficiency of the method over the
baseline behavior-tag-based model with an absolute CTR AUC gain of 0.0259 on
the long-tail user dataset.Comment: 5 page
Convolutional Sequence to Sequence Non-intrusive Load Monitoring
A convolutional sequence to sequence non-intrusive load monitoring model is
proposed in this paper. Gated linear unit convolutional layers are used to
extract information from the sequences of aggregate electricity consumption.
Residual blocks are also introduced to refine the output of the neural network.
The partially overlapped output sequences of the network are averaged to
produce the final output of the model. We apply the proposed model to the REDD
dataset and compare it with the convolutional sequence to point model in the
literature. Results show that the proposed model is able to give satisfactory
disaggregation performance for appliances with varied characteristics.Comment: This paper is submitted to IET-The Journal of Engineerin
HS-CAI: A Hybrid DCOP Algorithm via Combining Search with Context-based Inference
Search and inference are two main strategies for optimally solving
Distributed Constraint Optimization Problems (DCOPs). Recently, several
algorithms were proposed to combine their advantages. Unfortunately, such
algorithms only use an approximated inference as a one-shot preprocessing phase
to construct the initial lower bounds which lead to inefficient pruning under
the limited memory budget. On the other hand, iterative inference algorithms
(e.g., MB-DPOP) perform a context-based complete inference for all possible
contexts but suffer from tremendous traffic overheads. In this paper,
hybridizing search with context-based inference, we propose a complete
algorithm for DCOPs, named {HS-CAI} where the inference utilizes the contexts
derived from the search process to establish tight lower bounds while the
search uses such bounds for efficient pruning and thereby reduces contexts for
the inference. Furthermore, we introduce a context evaluation mechanism
to select the context patterns for the inference to further reduce the
overheads incurred by iterative inferences. Finally, we prove the
correctness of our algorithm and the experimental results demonstrate its
superiority over the state-of-the-art
Real-Time Vehicle Detection from Short-range Aerial Image with Compressed MobileNet
Vehicle detection from short-range aerial image faces challenges including vehicle blocking, irrelevant object interference, motion blurring, color variation etc., leading to the difficulty to achieve high detection accuracy and real-time detection speed. In this paper, benefiting from the recent development in MobileNet family network engineering, we propose a compressed MobileNet which is not only internally resistant to the above listed challenges but also gains the best detection accuracy/speed tradeoff when comparing with the original MobileNet. In a nutshell, we reduce the bottleneck architecture number during the feature map downsampling stage but add more bottlenecks during the feature map plateau stage, neither extra FLOPs nor parameters are thus involved but reduced inference time and better accuracy are expected. We conduct experiment on our collected 5-k short-range aerial images, containing six vehicle categories: truck, car, bus, bicycle, motorcycle, crowded bicycles and crowded motorcycles. Our proposed compressed MobileNet achieves 110 FPS (GPU), 31 FPS (CPU) and 15 FPS (mobile phone), 1.2 times faster and 2% more accurate (mAP) than the original MobileNet
Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks
To reduce the reliance on large-scale datasets, recent works in 3D
segmentation resort to few-shot learning. Current 3D few-shot semantic
segmentation methods first pre-train the models on `seen' classes, and then
evaluate their generalization performance on `unseen' classes. However, the
prior pre-training stage not only introduces excessive time overhead, but also
incurs a significant domain gap on `unseen' classes. To tackle these issues, we
propose an efficient Training-free Few-shot 3D Segmentation netwrok, TFS3D, and
a further training-based variant, TFS3D-T. Without any learnable parameters,
TFS3D extracts dense representations by trigonometric positional encodings, and
achieves comparable performance to previous training-based methods. Due to the
elimination of pre-training, TFS3D can alleviate the domain gap issue and save
a substantial amount of time. Building upon TFS3D, TFS3D-T only requires to
train a lightweight query-support transferring attention (QUEST), which
enhances the interaction between the few-shot query and support data.
Experiments demonstrate TFS3D-T improves previous state-of-the-art methods by
+6.93% and +17.96% mIoU respectively on S3DIS and ScanNet, while reducing the
training time by -90%, indicating superior effectiveness and efficiency.Comment: Code is available at https://github.com/yangyangyang127/TFS3
- …