410 research outputs found

    ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training

    Full text link
    Recent Vision-Language Pre-trained (VLP) models based on dual encoder have attracted extensive attention from academia and industry due to their superior performance on various cross-modal tasks and high computational efficiency. They attempt to learn cross-modal representation using contrastive learning on image-text pairs, however, the built inter-modal correlations only rely on a single view for each modality. Actually, an image or a text contains various potential views, just as humans could capture a real-world scene via diverse descriptions or photos. In this paper, we propose ERNIE-ViL 2.0, a Multi-View Contrastive learning framework to build intra-modal and inter-modal correlations between diverse views simultaneously, aiming at learning a more robust cross-modal representation. Specifically, we construct multiple views within each modality to learn the intra-modal correlation for enhancing the single-modal representation. Besides the inherent visual/textual views, we construct sequences of object tags as a special textual view to narrow the cross-modal semantic gap on noisy image-text pairs. Pre-trained with 29M publicly available datasets, ERNIE-ViL 2.0 achieves competitive results on English cross-modal retrieval. Additionally, to generalize our method to Chinese cross-modal tasks, we train ERNIE-ViL 2.0 through scaling up the pre-training datasets to 1.5B Chinese image-text pairs, resulting in significant improvements compared to previous SOTA results on Chinese cross-modal retrieval. We release our pre-trained models in https://github.com/PaddlePaddle/ERNIE.Comment: 14 pages, 6 figure

    Boosting Studies of Multi-Agent Reinforcement Learning on Google Research Football Environment: the Past, Present, and Future

    Full text link
    Even though Google Research Football (GRF) was initially benchmarked and studied as a single-agent environment in its original paper, recent years have witnessed an increasing focus on its multi-agent nature by researchers utilizing it as a testbed for Multi-Agent Reinforcement Learning (MARL). However, the absence of standardized environment settings and unified evaluation metrics for multi-agent scenarios hampers the consistent understanding of various studies. Furthermore, the challenging 5-vs-5 and 11-vs-11 full-game scenarios have received limited thorough examination due to their substantial training complexities. To address these gaps, this paper extends the original environment by not only standardizing the environment settings and benchmarking cooperative learning algorithms across different scenarios, including the most challenging full-game scenarios, but also by discussing approaches to enhance football AI from diverse perspectives and introducing related research tools. Specifically, we provide a distributed and asynchronous population-based self-play framework with diverse pre-trained policies for faster training, two football-specific analytical tools for deeper investigation, and an online leaderboard for broader evaluation. The overall expectation of this work is to advance the study of Multi-Agent Reinforcement Learning on Google Research Football environment, with the ultimate goal of benefiting real-world sports beyond virtual games

    Skyrmion-Bubble Bundles in an X-type Sr2Co2Fe28O46 Hexaferrite above Room Temperature

    Full text link
    Magnetic skyrmions are spin swirls that possess topological nontriviality and are considered particle-like entities. They are distinguished by an integer topological charge Q. The presence of skyrmion bundles provides an opportunity to explore the range of values for Q, which is crucial for the advancement of topological spintronic devices with multi-Q properties. In this study, we present a new material candidate, Sr2Co2Fe28O46 hexaferrite of the X-type, which hosts small dipolar skyrmions at room temperature and above. By exploiting reversed magnetic fields from metastable skyrmion bubbles at zero fields, we can incorporate skyrmion-bubble bundles with different interior skyrmion/bubble numbers, topological charges, and morphologies at room temperature. Our experimental findings are consistently supported by micromagnetic simulations. Our results highlight the versatility of topological spin textures in centrosymmetric uniaxial magnets, thereby paving the way for the development of room-temperature topological spintronic devices with multi-Q characteristics.Comment: https://doi.org/10.1002/adma.20230611
    • …
    corecore