23 research outputs found
Context Autoencoder for Self-Supervised Representation Learning
We present a novel masked image modeling (MIM) approach, context autoencoder
(CAE), for self-supervised representation pretraining. The goal is to pretrain
an encoder by solving the pretext task: estimate the masked patches from the
visible patches in an image. Our approach first feeds the visible patches into
the encoder, extracting the representations. Then, we make predictions from
visible patches to masked patches in the encoded representation space. We
introduce an alignment constraint, encouraging that the representations for
masked patches, predicted from the encoded representations of visible patches,
are aligned with the masked patch presentations computed from the encoder. In
other words, the predicted representations are expected to lie in the encoded
representation space, which empirically shows the benefit to representation
learning. Last, the predicted masked patch representations are mapped to the
targets of the pretext task through a decoder. In comparison to previous MIM
methods (e.g., BEiT) that couple the encoding and pretext task completion
roles, our approach benefits the separation of the representation learning
(encoding) role and the pretext task completion role, improving the
representation learning capacity and accordingly helping more on downstream
tasks. In addition, we present the explanations about why contrastive
pretraining and supervised pretraining perform similarly and why MIM
potentially performs better. We demonstrate the effectiveness of our CAE
through superior transfer performance in downstream tasks: semantic
segmentation, and object detection and instance segmentation
Determination of the number of events at BESIII
The numbers of ψ(3686) events accumulated by the BESIII detector for the data taken during 2009 and 2012 are determined to be and , respectively, by counting inclusive hadronic events, where the uncertainties are systematic and the statistical uncertainties are negligible. The number of events for the sample taken in 2009 is consistent with that of the previous measurement. The total number of ψ(3686) events for the two data taking periods is