Search CORE

320 research outputs found

ULTRA-DP: Unifying Graph Pre-training with Multi-task Graph Dual Prompt

Author: Chen Mouxiang
Li Jundong
Liu Chenghao
Liu Zemin
Mao Qiheng
Sun Jianling
Publication venue
Publication date: 17/12/2023
Field of study

Recent research has demonstrated the efficacy of pre-training graph neural networks (GNNs) to capture the transferable graph semantics and enhance the performance of various downstream tasks. However, the semantic knowledge learned from pretext tasks might be unrelated to the downstream task, leading to a semantic gap that limits the application of graph pre-training. To reduce this gap, traditional approaches propose hybrid pre-training to combine various pretext tasks together in a multi-task learning fashion and learn multi-grained knowledge, which, however, cannot distinguish tasks and results in some transferable task-specific knowledge distortion by each other. Moreover, most GNNs cannot distinguish nodes located in different parts of the graph, making them fail to learn position-specific knowledge and lead to suboptimal performance. In this work, inspired by the prompt-based tuning in natural language processing, we propose a unified framework for graph hybrid pre-training which injects the task identification and position identification into GNNs through a prompt mechanism, namely multi-task graph dual prompt (ULTRA-DP). Based on this framework, we propose a prompt-based transferability test to find the most relevant pretext task in order to reduce the semantic gap. To implement the hybrid pre-training tasks, beyond the classical edge prediction task (node-node level), we further propose a novel pre-training paradigm based on a group of

k

-nearest neighbors (node-group level). The combination of them across different scales is able to comprehensively express more structural semantics and derive richer multi-grained knowledge. Extensive experiments show that our proposed ULTRA-DP can significantly enhance the performance of hybrid pre-training methods and show the generalizability to other pre-training tasks and backbone architectures

arXiv.org e-Print Archive

Calibration of Time-Series Forecasting Transformers: Detecting and Adapting Context-Driven Distribution Shift

Author: Chen Mouxiang
Fu Han
Li Zhuo
Liu Chenghao
Shen Lefei
Sun Jianling
Publication venue
Publication date: 23/10/2023
Field of study

Recent years have witnessed the success of introducing Transformers to time series forecasting. From a data generation perspective, we illustrate that existing Transformers are susceptible to distribution shifts driven by temporal contexts, whether observed or unobserved. Such context-driven distribution shift (CDS) introduces biases in predictions within specific contexts and poses challenges for conventional training paradigm. In this paper, we introduce a universal calibration methodology for the detection and adaptation of CDS with a trained Transformer model. To this end, we propose a novel CDS detector, termed the "residual-based CDS detector" or "Reconditionor", which quantifies the model's vulnerability to CDS by evaluating the mutual information between prediction residuals and their corresponding contexts. A high Reconditionor score indicates a severe susceptibility, thereby necessitating model adaptation. In this circumstance, we put forth a straightforward yet potent adapter framework for model calibration, termed the "sample-level contextualized adapter" or "SOLID". This framework involves the curation of a contextually similar dataset to the provided test sample and the subsequent fine-tuning of the model's prediction layer with a limited number of steps. Our theoretical analysis demonstrates that this adaptation strategy is able to achieve an optimal equilibrium between bias and variance. Notably, our proposed Reconditionor and SOLID are model-agnostic and readily adaptable to a wide range of Transformers. Extensive experiments show that SOLID consistently enhances the performance of current SOTA Transformers on real-world datasets, especially on cases with substantial CDS detected by the proposed Reconditionor, thus validate the effectiveness of the calibration approach

arXiv.org e-Print Archive

Learning transferrable parameters for long-tailed sequential user behavior modeling

Author: HOI Steven C. H.
LIU Chenghao
SUN Jianling
WANG Weiqing
YIN Jianwen
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/08/2020
Field of study

National Research Foundation (NRF) Singapore under its AI Singapore Programm

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University

Compositional coding for collaborative filtering

Author: CHENG Zhiyong
HOI Steven C. H.
LIU Chenghao
LU Tao
SUN Jianling
WANG Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 09/05/2019
Field of study

National Research Foundation (NRF) Singapore under its AI Singapore Programm

arXiv.org e-Print Archive

Crossref

Institutional Knowledge at Singapore Management University