Search CORE

126 research outputs found

BIM: Block-Wise Self-Supervised Learning with Masked Image Modeling

Author: Luo Yixuan
Ren Mengye
Zhang Sai Qian
Publication venue
Publication date: 28/11/2023
Field of study

Like masked language modeling (MLM) in natural language processing, masked image modeling (MIM) aims to extract valuable insights from image patches to enhance the feature extraction capabilities of the underlying deep neural network (DNN). Contrasted with other training paradigms like supervised learning and unsupervised contrastive learning, masked image modeling (MIM) pretraining typically demands significant computational resources in order to manage large training data batches (e.g., 4096). The significant memory and computation requirements pose a considerable challenge to its broad adoption. To mitigate this, we introduce a novel learning framework, termed~\textit{Block-Wise Masked Image Modeling} (BIM). This framework involves decomposing the MIM tasks into several sub-tasks with independent computation patterns, resulting in block-wise back-propagation operations instead of the traditional end-to-end approach. Our proposed BIM maintains superior performance compared to conventional MIM while greatly reducing peak memory consumption. Moreover, BIM naturally enables the concurrent training of numerous DNN backbones of varying depths. This leads to the creation of multiple trained DNN backbones, each tailored to different hardware platforms with distinct computing capabilities. This approach significantly reduces computational costs in comparison with training each DNN backbone individually. Our framework offers a promising solution for resource constrained training of MIM

arXiv.org e-Print Archive

Guideline Learning for In-context Information Extraction

Author: Cao Yixuan
Ding Qiang
Luo Ping
Pang Chaoxu
Publication venue
Publication date: 21/10/2023
Field of study

Large language models (LLMs) can perform a new task by merely conditioning on task instructions and a few input-output examples, without optimizing any parameters. This is called In-Context Learning (ICL). In-context Information Extraction (IE) has recently garnered attention in the research community. However, the performance of In-context IE generally lags behind the state-of-the-art supervised expert models. We highlight a key reason for this shortfall: underspecified task description. The limited-length context struggles to thoroughly express the intricate IE task instructions and various edge cases, leading to misalignment in task comprehension with humans. In this paper, we propose a Guideline Learning (GL) framework for In-context IE which reflectively learns and follows guidelines. During the learning phrase, GL automatically synthesizes a set of guidelines based on a few error cases, and during inference, GL retrieves helpful guidelines for better ICL. Moreover, we propose a self-consistency-based active learning method to enhance the efficiency of GL. Experiments on event extraction and relation extraction show that GL can significantly improve the performance of in-context IE.Comment: EMNLP 2023 main conferenc

arXiv.org e-Print Archive

ConjNorm: Tractable Density Estimation for Out-of-Distribution Detection

Author: Fang Zhen
Li Yixuan
Luo Yadan
Peng Bo
Zhang Yonggang
Publication venue
Publication date: 27/02/2024
Field of study

Post-hoc out-of-distribution (OOD) detection has garnered intensive attention in reliable machine learning. Many efforts have been dedicated to deriving score functions based on logits, distances, or rigorous data distribution assumptions to identify low-scoring OOD samples. Nevertheless, these estimate scores may fail to accurately reflect the true data density or impose impractical constraints. To provide a unified perspective on density-based score design, we propose a novel theoretical framework grounded in Bregman divergence, which extends distribution considerations to encompass an exponential family of distributions. Leveraging the conjugation constraint revealed in our theorem, we introduce a \textsc{ConjNorm} method, reframing density function design as a search for the optimal norm coefficient

p

against the given dataset. In light of the computational challenges of normalization, we devise an unbiased and analytically tractable estimator of the partition function using the Monte Carlo-based importance sampling technique. Extensive experiments across OOD detection benchmarks empirically demonstrate that our proposed \textsc{ConjNorm} has established a new state-of-the-art in a variety of OOD detection setups, outperforming the current best method by up to 13.25

\%

and 28.19

\%

(FPR95) on CIFAR-100 and ImageNet-1K, respectively.Comment: ICLR24 poste

arXiv.org e-Print Archive

Global quantum phase diagram and non-Abelian chiral spin liquid in a spin-3/2 square lattice antiferromagnet

Author: Huang Yixuan
Luo Wei-Wei
Sheng D. N.
Zhu W.
Publication venue
Publication date: 29/12/2022
Field of study

Since strong quantum fluctuations are essential for the emergence of quantum spin liquids, there have been extensive exploration and identification of spin liquid candidates in spin-

1/2

systems, while such activities are rare in higher spin systems. Here we report an example of non-Abelian chiral spin liquid emerging in spin-

3/2

Heisenberg model on a square lattice. By tuning Heisenberg exchange interaction and scalar chirality interaction, we map out a quantum phase diagram enclosing three conventional magnetic orders and a chiral spin liquid based on density matrix renormalization group studies. The nature of the spin liquid is identified as a long-sought bosonic version of Read-Rezayi state that supports non-Abelian Fibonacci anyonic statistics, identified by the ground state entanglement spectrum. Significantly, we establish that the non-Abelian CSL emerges through the enlarged local degrees of freedom and enhanced quantum fluctuations near the classical phase boundaries of competing magnetic orders. Our numerical discovery of an exotic quantum spin liquid in spin-

3/2

system suggests a new route for discovering fractionalized quantum phases in frustrated higher spin magnetic compounds.Comment: LA-UR-22-3320

arXiv.org e-Print Archive

Editorial: Emerging talents in water science: water and critical zone 2021/22

Author: Kenneth Hurst Williams
Pingping Luo
Pingping Luo
Pingping Luo
Quoc Bao Pham
Yixuan Zhang
Yixuan Zhang
Yixuan Zhang
Yubin Zhang
Yubin Zhang
Yubin Zhang
Publication venue: Frontiers Media S.A.
Publication date: 01/04/2024
Field of study

Directory of Open Access Journals

Double Self-Sustainable Reconfigurable Intelligent Surfaces Aided Wireless Communications

Author: Li Xingwang
Li Yixuan
Luo Suhong
Nallanathan Arumugam
Wang Ji
Xie Wenwu
Publication venue
Publication date: 07/05/2024
Field of study

A double self-sustainable reconfigurable intelligent surfaces (RISs) assisted multi-user multiple input multiple output (MIMO) system is investigated. Two RISs are equipped with energy harvesting circuit to achieve self-sustainable transmission. The aim is to minimize the transmission power at the base station (BS), while guaranteeing the quality of service (QoS) requirements of the users and meeting the power consumption requirements of the RISs. A block coordinate descent (BCD) algorithm based on the penalty-based method and successive convex approximation (SCA) is employed to alternatively optimize the active beamforming at the BS and the phase shifts, as well as amplitude coefficients of two RISs. Simulation results show that the required power consumption at the BS for the proposed double self-sustainable RISs system is significantly reduced compared to conventional RIS systems

arXiv.org e-Print Archive