Search CORE

3 research outputs found

Debiased Fine-Tuning for Vision-language Models by Prompt Regularization

Author: Hur Minhoe
Lee Saeil
Niu Yulei
Zhang Hanwang
Zhu Beier
Publication venue
Publication date: 31/03/2023
Field of study

We present a new paradigm for fine-tuning large-scale visionlanguage pre-trained models on downstream task, dubbed Prompt Regularization (ProReg). Different from traditional fine-tuning which easily overfits to the downstream task data, ProReg uses the prediction by prompting the pretrained model to regularize the fine-tuning. The motivation is: by prompting the large model "a photo of a [CLASS]", the fil-lin answer is only dependent on the pretraining encyclopedic knowledge while independent of the task data distribution, which is usually biased. Specifically, given a training sample prediction during fine-tuning, we first calculate its KullbackLeibler loss of the prompt prediction and Cross-Entropy loss of the ground-truth label, and then combine them with a proposed sample-wise adaptive trade-off weight, which automatically adjusts the transfer between the pretrained and downstream domains. On various out-of-distribution benchmarks, we show the consistently strong performance of ProReg compared with conventional fine-tuning, zero-shot prompt, prompt tuning, and other state-of-the-art methods.Comment: AAAI2023 accepte

arXiv.org e-Print Archive

Debiased Fine-Tuning for Vision-Language Models by Prompt Regularization

Author: Hur Minhoe
Lee Saeil
Niu Yulei
Zhang Hanwang
Zhu Beier
Publication venue: Association for the Advancement of Artificial Intelligence
Publication date: 26/06/2023
Field of study

We present a new paradigm for fine-tuning large-scale vision-language pre-trained models on downstream task, dubbed Prompt Regularization (ProReg). Different from traditional fine-tuning which easily overfits to the downstream task data, ProReg uses the prediction by prompting the pretrained model to regularize the fine-tuning. The motivation is: by prompting the large model “a photo of a [CLASS]”, the fill-in answer is only dependent on the pretraining encyclopedic knowledge while independent of the task data distribution, which is usually biased. Specifically, given a training sample prediction during fine-tuning, we first calculate its Kullback-Leibler loss of the prompt prediction and Cross-Entropy loss of the ground-truth label, and then combine them with a proposed sample-wise adaptive trade- off weight, which automatically adjusts the transfer between the pretrained and downstream domains. On various out-of-distribution benchmarks, we show the consistently strong performance of ProReg compared with conventional fine-tuning, zero-shot prompt, prompt tuning, and other state-of-the-art methods

Association for the Advancement of Artificial Intelligence: AAAI Publications

Multi-disciplinary design optimization and performance evaluation of a single stage transonic axial compressor

Author: A Keskin
A Oyama
A Samad
Byeung Jun Lim
C Leyens
Dong-Ho Lee
E Benini
K-S Lee
K-Y Lee
Kyu-Hong Kim
L Reid
M G Neubauer
S Hong
S Hong
S Hong
S Jun
S Pierret
Saeil Lee
Tae Choon Park
Y Kim
Y Lian
Young-Seok Kang
Z-Z Li
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref