23 research outputs found

    Context Autoencoder for Self-Supervised Representation Learning

    Full text link
    We present a novel masked image modeling (MIM) approach, context autoencoder (CAE), for self-supervised representation pretraining. The goal is to pretrain an encoder by solving the pretext task: estimate the masked patches from the visible patches in an image. Our approach first feeds the visible patches into the encoder, extracting the representations. Then, we make predictions from visible patches to masked patches in the encoded representation space. We introduce an alignment constraint, encouraging that the representations for masked patches, predicted from the encoded representations of visible patches, are aligned with the masked patch presentations computed from the encoder. In other words, the predicted representations are expected to lie in the encoded representation space, which empirically shows the benefit to representation learning. Last, the predicted masked patch representations are mapped to the targets of the pretext task through a decoder. In comparison to previous MIM methods (e.g., BEiT) that couple the encoding and pretext task completion roles, our approach benefits the separation of the representation learning (encoding) role and the pretext task completion role, improving the representation learning capacity and accordingly helping more on downstream tasks. In addition, we present the explanations about why contrastive pretraining and supervised pretraining perform similarly and why MIM potentially performs better. We demonstrate the effectiveness of our CAE through superior transfer performance in downstream tasks: semantic segmentation, and object detection and instance segmentation

    Determination of the number of ψ(3686)\psi(3686) events at BESIII

    No full text
    The numbers of ψ(3686) events accumulated by the BESIII detector for the data taken during 2009 and 2012 are determined to be and , respectively, by counting inclusive hadronic events, where the uncertainties are systematic and the statistical uncertainties are negligible. The number of events for the sample taken in 2009 is consistent with that of the previous measurement. The total number of ψ(3686) events for the two data taking periods is
    corecore