126 research outputs found
BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling
Like masked language modeling (MLM) in natural language processing, masked
image modeling (MIM) aims to extract valuable insights from image patches to
enhance the feature extraction capabilities of the underlying deep neural
network (DNN). Contrasted with other training paradigms like supervised
learning and unsupervised contrastive learning, masked image modeling (MIM)
pretraining typically demands significant computational resources in order to
manage large training data batches (e.g., 4096). The significant memory and
computation requirements pose a considerable challenge to its broad adoption.
To mitigate this, we introduce a novel learning framework,
termed~\textit{Block-Wise Masked Image Modeling} (BIM). This framework involves
decomposing the MIM tasks into several sub-tasks with independent computation
patterns, resulting in block-wise back-propagation operations instead of the
traditional end-to-end approach. Our proposed BIM maintains superior
performance compared to conventional MIM while greatly reducing peak memory
consumption. Moreover, BIM naturally enables the concurrent training of
numerous DNN backbones of varying depths. This leads to the creation of
multiple trained DNN backbones, each tailored to different hardware platforms
with distinct computing capabilities. This approach significantly reduces
computational costs in comparison with training each DNN backbone individually.
Our framework offers a promising solution for resource constrained training of
MIM
Guideline Learning for In-context Information Extraction
Large language models (LLMs) can perform a new task by merely conditioning on
task instructions and a few input-output examples, without optimizing any
parameters. This is called In-Context Learning (ICL). In-context Information
Extraction (IE) has recently garnered attention in the research community.
However, the performance of In-context IE generally lags behind the
state-of-the-art supervised expert models. We highlight a key reason for this
shortfall: underspecified task description. The limited-length context
struggles to thoroughly express the intricate IE task instructions and various
edge cases, leading to misalignment in task comprehension with humans. In this
paper, we propose a Guideline Learning (GL) framework for In-context IE which
reflectively learns and follows guidelines. During the learning phrase, GL
automatically synthesizes a set of guidelines based on a few error cases, and
during inference, GL retrieves helpful guidelines for better ICL. Moreover, we
propose a self-consistency-based active learning method to enhance the
efficiency of GL. Experiments on event extraction and relation extraction show
that GL can significantly improve the performance of in-context IE.Comment: EMNLP 2023 main conferenc
ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection
Post-hoc out-of-distribution (OOD) detection has garnered intensive attention
in reliable machine learning. Many efforts have been dedicated to deriving
score functions based on logits, distances, or rigorous data distribution
assumptions to identify low-scoring OOD samples. Nevertheless, these estimate
scores may fail to accurately reflect the true data density or impose
impractical constraints. To provide a unified perspective on density-based
score design, we propose a novel theoretical framework grounded in Bregman
divergence, which extends distribution considerations to encompass an
exponential family of distributions. Leveraging the conjugation constraint
revealed in our theorem, we introduce a \textsc{ConjNorm} method, reframing
density function design as a search for the optimal norm coefficient
against the given dataset. In light of the computational challenges of
normalization, we devise an unbiased and analytically tractable estimator of
the partition function using the Monte Carlo-based importance sampling
technique. Extensive experiments across OOD detection benchmarks empirically
demonstrate that our proposed \textsc{ConjNorm} has established a new
state-of-the-art in a variety of OOD detection setups, outperforming the
current best method by up to 13.25 and 28.19 (FPR95) on CIFAR-100 and
ImageNet-1K, respectively.Comment: ICLR24 poste
Global quantum phase diagram and non-Abelian chiral spin liquid in a spin-3/2 square lattice antiferromagnet
Since strong quantum fluctuations are essential for the emergence of quantum
spin liquids, there have been extensive exploration and identification of spin
liquid candidates in spin- systems, while such activities are rare in
higher spin systems. Here we report an example of non-Abelian chiral spin
liquid emerging in spin- Heisenberg model on a square lattice. By tuning
Heisenberg exchange interaction and scalar chirality interaction, we map out a
quantum phase diagram enclosing three conventional magnetic orders and a chiral
spin liquid based on density matrix renormalization group studies. The nature
of the spin liquid is identified as a long-sought bosonic version of
Read-Rezayi state that supports non-Abelian Fibonacci anyonic statistics,
identified by the ground state entanglement spectrum. Significantly, we
establish that the non-Abelian CSL emerges through the enlarged local degrees
of freedom and enhanced quantum fluctuations near the classical phase
boundaries of competing magnetic orders. Our numerical discovery of an exotic
quantum spin liquid in spin- system suggests a new route for discovering
fractionalized quantum phases in frustrated higher spin magnetic compounds.Comment: LA-UR-22-3320
Double Self-Sustainable Reconfigurable Intelligent Surfaces Aided Wireless Communications
A double self-sustainable reconfigurable intelligent surfaces (RISs) assisted
multi-user multiple input multiple output (MIMO) system is investigated. Two
RISs are equipped with energy harvesting circuit to achieve self-sustainable
transmission. The aim is to minimize the transmission power at the base station
(BS), while guaranteeing the quality of service (QoS) requirements of the users
and meeting the power consumption requirements of the RISs. A block coordinate
descent (BCD) algorithm based on the penalty-based method and successive convex
approximation (SCA) is employed to alternatively optimize the active
beamforming at the BS and the phase shifts, as well as amplitude coefficients
of two RISs. Simulation results show that the required power consumption at the
BS for the proposed double self-sustainable RISs system is significantly
reduced compared to conventional RIS systems
- …