167 research outputs found
Electroweak Penguin Contributions in and decays Beyond Leading Logarithms
Using the next-to-leading order low energy effective Hamiltonian for , transitions, the contributions of electroweak
penguin operators in and decays
are estimated in the standard model. We find that, for some channels, the
electroweak penguin effects can enhance or reduce the QCD penguin and/or tree
level contributions by at least , and can even play dominant role.Comment: 12 pages, late
Rethinking Object Detection in Retail Stores
The convention standard for object detection uses a bounding box to represent
each individual object instance. However, it is not practical in the
industry-relevant applications in the context of warehouses due to severe
occlusions among groups of instances of the same categories. In this paper, we
propose a new task, ie, simultaneously object localization and counting,
abbreviated as Locount, which requires algorithms to localize groups of objects
of interest with the number of instances. However, there does not exist a
dataset or benchmark designed for such a task. To this end, we collect a
large-scale object localization and counting dataset with rich annotations in
retail stores, which consists of 50,394 images with more than 1.9 million
object instances in 140 categories. Together with this dataset, we provide a
new evaluation protocol and divide the training and testing subsets to fairly
evaluate the performance of algorithms for Locount, developing a new benchmark
for the Locount task. Moreover, we present a cascaded localization and counting
network as a strong baseline, which gradually classifies and regresses the
bounding boxes of objects with the predicted numbers of instances enclosed in
the bounding boxes, trained in an end-to-end manner. Extensive experiments are
conducted on the proposed dataset to demonstrate its significance and the
analysis discussions on failure cases are provided to indicate future
directions. Dataset is available at
https://isrc.iscas.ac.cn/gitlab/research/locount-dataset.Comment: Information Erro
Dual encoding for abstractive text summarization
Recurrent Neural Network (RNN) based sequence-to-sequence attentional models have proven effective in abstractive text summarization. In this paper, we model abstractive text summarization using a dual encoding model. Different from the previous works only using a single encoder, the proposed method employs a dual encoder including the primary and the secondary encoders. Specifically, the primary encoder conducts coarse encoding in a regular way, while the secondary encoder models the importance of words and generates more fine encoding based on the input raw text and the previously generated output text summarization. The two level encodings are combined and fed into the decoder to generate more diverse summary that can decrease repetition phenomenon for long sequence generation. The experimental results on two challenging datasets (i.e., CNN/DailyMail and DUC 2004) demonstrate that our dual encoding model performs against existing methods
Scale Invariant Fully Convolutional Network: Detecting Hands Efficiently
Existing hand detection methods usually follow the pipeline of multiple
stages with high computation cost, i.e., feature extraction, region proposal,
bounding box regression, and additional layers for rotated region detection. In
this paper, we propose a new Scale Invariant Fully Convolutional Network
(SIFCN) trained in an end-to-end fashion to detect hands efficiently.
Specifically, we merge the feature maps from high to low layers in an iterative
way, which handles different scales of hands better with less time overhead
comparing to concatenating them simply. Moreover, we develop the Complementary
Weighted Fusion (CWF) block to make full use of the distinctive features among
multiple layers to achieve scale invariance. To deal with rotated hand
detection, we present the rotation map to get rid of complex rotation and
derotation layers. Besides, we design the multi-scale loss scheme to accelerate
the training process significantly by adding supervision to the intermediate
layers of the network. Compared with the state-of-the-art methods, our
algorithm shows comparable accuracy and runs a 4.23 times faster speed on the
VIVA dataset and achieves better average precision on Oxford hand detection
dataset at a speed of 62.5 fps.Comment: Accepted to AAAI201
- …