126 research outputs found

    BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling

    Full text link
    Like masked language modeling (MLM) in natural language processing, masked image modeling (MIM) aims to extract valuable insights from image patches to enhance the feature extraction capabilities of the underlying deep neural network (DNN). Contrasted with other training paradigms like supervised learning and unsupervised contrastive learning, masked image modeling (MIM) pretraining typically demands significant computational resources in order to manage large training data batches (e.g., 4096). The significant memory and computation requirements pose a considerable challenge to its broad adoption. To mitigate this, we introduce a novel learning framework, termed~\textit{Block-Wise Masked Image Modeling} (BIM). This framework involves decomposing the MIM tasks into several sub-tasks with independent computation patterns, resulting in block-wise back-propagation operations instead of the traditional end-to-end approach. Our proposed BIM maintains superior performance compared to conventional MIM while greatly reducing peak memory consumption. Moreover, BIM naturally enables the concurrent training of numerous DNN backbones of varying depths. This leads to the creation of multiple trained DNN backbones, each tailored to different hardware platforms with distinct computing capabilities. This approach significantly reduces computational costs in comparison with training each DNN backbone individually. Our framework offers a promising solution for resource constrained training of MIM

    Guideline Learning for In-context Information Extraction

    Full text link
    Large language models (LLMs) can perform a new task by merely conditioning on task instructions and a few input-output examples, without optimizing any parameters. This is called In-Context Learning (ICL). In-context Information Extraction (IE) has recently garnered attention in the research community. However, the performance of In-context IE generally lags behind the state-of-the-art supervised expert models. We highlight a key reason for this shortfall: underspecified task description. The limited-length context struggles to thoroughly express the intricate IE task instructions and various edge cases, leading to misalignment in task comprehension with humans. In this paper, we propose a Guideline Learning (GL) framework for In-context IE which reflectively learns and follows guidelines. During the learning phrase, GL automatically synthesizes a set of guidelines based on a few error cases, and during inference, GL retrieves helpful guidelines for better ICL. Moreover, we propose a self-consistency-based active learning method to enhance the efficiency of GL. Experiments on event extraction and relation extraction show that GL can significantly improve the performance of in-context IE.Comment: EMNLP 2023 main conferenc

    ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection

    Full text link
    Post-hoc out-of-distribution (OOD) detection has garnered intensive attention in reliable machine learning. Many efforts have been dedicated to deriving score functions based on logits, distances, or rigorous data distribution assumptions to identify low-scoring OOD samples. Nevertheless, these estimate scores may fail to accurately reflect the true data density or impose impractical constraints. To provide a unified perspective on density-based score design, we propose a novel theoretical framework grounded in Bregman divergence, which extends distribution considerations to encompass an exponential family of distributions. Leveraging the conjugation constraint revealed in our theorem, we introduce a \textsc{ConjNorm} method, reframing density function design as a search for the optimal norm coefficient pp against the given dataset. In light of the computational challenges of normalization, we devise an unbiased and analytically tractable estimator of the partition function using the Monte Carlo-based importance sampling technique. Extensive experiments across OOD detection benchmarks empirically demonstrate that our proposed \textsc{ConjNorm} has established a new state-of-the-art in a variety of OOD detection setups, outperforming the current best method by up to 13.25%\% and 28.19%\% (FPR95) on CIFAR-100 and ImageNet-1K, respectively.Comment: ICLR24 poste

    Global quantum phase diagram and non-Abelian chiral spin liquid in a spin-3/2 square lattice antiferromagnet

    Full text link
    Since strong quantum fluctuations are essential for the emergence of quantum spin liquids, there have been extensive exploration and identification of spin liquid candidates in spin-1/21/2 systems, while such activities are rare in higher spin systems. Here we report an example of non-Abelian chiral spin liquid emerging in spin-3/23/2 Heisenberg model on a square lattice. By tuning Heisenberg exchange interaction and scalar chirality interaction, we map out a quantum phase diagram enclosing three conventional magnetic orders and a chiral spin liquid based on density matrix renormalization group studies. The nature of the spin liquid is identified as a long-sought bosonic version of Read-Rezayi state that supports non-Abelian Fibonacci anyonic statistics, identified by the ground state entanglement spectrum. Significantly, we establish that the non-Abelian CSL emerges through the enlarged local degrees of freedom and enhanced quantum fluctuations near the classical phase boundaries of competing magnetic orders. Our numerical discovery of an exotic quantum spin liquid in spin-3/23/2 system suggests a new route for discovering fractionalized quantum phases in frustrated higher spin magnetic compounds.Comment: LA-UR-22-3320

    Double Self-Sustainable Reconfigurable Intelligent Surfaces Aided Wireless Communications

    Full text link
    A double self-sustainable reconfigurable intelligent surfaces (RISs) assisted multi-user multiple input multiple output (MIMO) system is investigated. Two RISs are equipped with energy harvesting circuit to achieve self-sustainable transmission. The aim is to minimize the transmission power at the base station (BS), while guaranteeing the quality of service (QoS) requirements of the users and meeting the power consumption requirements of the RISs. A block coordinate descent (BCD) algorithm based on the penalty-based method and successive convex approximation (SCA) is employed to alternatively optimize the active beamforming at the BS and the phase shifts, as well as amplitude coefficients of two RISs. Simulation results show that the required power consumption at the BS for the proposed double self-sustainable RISs system is significantly reduced compared to conventional RIS systems
    • …
    corecore