Search CORE

6 research outputs found

Towards Omni-supervised Referring Expression Segmentation

Author: Huang Minglang
Jiang Guannan
Luo Gen
Sun Xiaoshuai
Zhou Yiyi
Zhuang Weilin
Publication venue
Publication date: 27/11/2023
Field of study

Referring Expression Segmentation (RES) is an emerging task in computer vision, which segments the target instances in images based on text descriptions. However, its development is plagued by the expensive segmentation labels. To address this issue, we propose a new learning task for RES called Omni-supervised Referring Expression Segmentation (Omni-RES), which aims to make full use of unlabeled, fully labeled and weakly labeled data, e.g., referring points or grounding boxes, for efficient RES training. To accomplish this task, we also propose a novel yet strong baseline method for Omni-RES based on the recently popular teacher-student learning, where the weak labels are not directly transformed into supervision signals but used as a yardstick to select and refine high-quality pseudo-masks for teacher-student learning. To validate the proposed Omni-RES method, we apply it to a set of state-of-the-art RES models and conduct extensive experiments on a bunch of RES datasets. The experimental results yield the obvious merits of Omni-RES than the fully-supervised and semi-supervised training schemes. For instance, with only 10% fully labeled data, Omni-RES can help the base model achieve 100% fully supervised performance, and it also outperform the semi-supervised alternative by a large margin, e.g., +14.93% on RefCOCO and +14.95% on RefCOCO+, respectively. More importantly, Omni-RES also enable the use of large-scale vision-langauges like Visual Genome to facilitate low-cost RES training, and achieve new SOTA performance of RES, e.g., 80.66 on RefCOCO

arXiv.org e-Print Archive

Towards Efficient Visual Adaption via Structural Re-parameterization

Author: Huang Minglang
Ji Rongrong
Jiang Guannan
Luo Gen
Sun Xiaoshuai
Wang Zhiyu
Zhou Yiyi
Publication venue
Publication date: 16/02/2023
Field of study

Parameter-efficient transfer learning (PETL) is an emerging research spot aimed at inexpensively adapting large-scale pre-trained models to downstream tasks. Recent advances have achieved great success in saving storage costs for various vision tasks by updating or injecting a small number of parameters instead of full fine-tuning. However, we notice that most existing PETL methods still incur non-negligible latency during inference. In this paper, we propose a parameter-efficient and computationally friendly adapter for giant vision models, called RepAdapter. Specifically, we prove that the adaption modules, even with a complex structure, can be seamlessly integrated into most giant vision models via structural re-parameterization. This property makes RepAdapter zero-cost during inference. In addition to computation efficiency, RepAdapter is more effective and lightweight than existing PETL methods due to its sparse structure and our careful deployment. To validate RepAdapter, we conduct extensive experiments on 27 benchmark datasets of three vision tasks, i.e., image and video classifications and semantic segmentation. Experimental results show the superior performance and efficiency of RepAdapter than the state-of-the-art PETL methods. For instance, by updating only 0.6% parameters, we can improve the performance of ViT from 38.8 to 55.1 on Sun397. Its generalizability is also well validated by a bunch of vision models, i.e., ViT, CLIP, Swin-Transformer and ConvNeXt. Our source code is released at https://github.com/luogen1996/RepAdapter

arXiv.org e-Print Archive

Progressive Training of A Two-Stage Framework for Video Restoration

Author: Chen Ying
Jiang Lai
Liu Huaida
Qiao Minglang
Xing Qunliang
Xu Mai
Zheng Meisong
Publication venue
Publication date: 21/04/2022
Field of study

As a widely studied task, video restoration aims to enhance the quality of the videos with multiple potential degradations, such as noises, blurs and compression artifacts. Among video restorations, compressed video quality enhancement and video super-resolution are two of the main tacks with significant values in practical scenarios. Recently, recurrent neural networks and transformers attract increasing research interests in this field, due to their impressive capability in sequence-to-sequence modeling. However, the training of these models is not only costly but also relatively hard to converge, with gradient exploding and vanishing problems. To cope with these problems, we proposed a two-stage framework including a multi-frame recurrent network and a single-frame transformer. Besides, multiple training strategies, such as transfer learning and progressive training, are developed to shorten the training time and improve the model performance. Benefiting from the above technical contributions, our solution wins two champions and a runner-up in the NTIRE 2022 super-resolution and quality enhancement of compressed video challenges.Comment: Winning two championships and one runner-up in the NTIRE 2022 challenge of super-resolution and quality enhancement of compressed video; accepted to CVPRW 202

arXiv.org e-Print Archive

Analysis of Uyghur and Kazakh populations using the Precision ID Ancestry Panel

Author: Austin
Berezina
Bonferroni
C. Børsting
Earl
Evanno
Excoffier
Falush
H. Simayijiang
Hammarström
Heijden
Jakobsson
Jiang
Kampmann
Kidd
Lopopolo
McDonald
Minglang
N. Morling
Nassir
Olofsson
Pedregosa
Pereira
Pritchard
Ren
Rosenberg
Rousset
T. Tvedebrink
Themudo
Thermo Fisher Scientific
Togan
Tvedebrink
Tvedebrink
Wells
Xu
Yiliang
Publication venue: 'Elsevier BV'
Publication date: 01/01/2019
Field of study

Crossref

Copenhagen University Research Information System

VBN

Analysis of Uyghur and Kazakh populations using the Precision ID Ancestry Panel

Author: Austin
Berezina
Bonferroni
C. Børsting
Earl
Evanno
Excoffier
Falush
H. Simayijiang
Hammarström
Heijden
Jakobsson
Jiang
Kampmann
Kidd
Lopopolo
McDonald
Minglang
N. Morling
Nassir
Olofsson
Pedregosa
Pereira
Pritchard
Ren
Rosenberg
Rousset
T. Tvedebrink
Themudo
Thermo Fisher Scientific
Togan
Tvedebrink
Tvedebrink
Wells
Xu
Yiliang
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref

3D mesoporous α-Co(OH)2 nanosheets electrodeposited on nickel foam: A new generation of macroscopic cobalt-based hybrid for peroxymonosulfate activation

Author: Ahmad
Ahmadi
Ao
Babar
Barzegar
Boczkaj
Brownson
Cao
Chen
Chen
da Silva Pelissari
Du
Duan
Fan
Fernandes
Fernandes
Gao
Grzegorz Boczkaj
Gupta
Gągol
Gągol
Hu
Hu
Hu
Huaiyuan Wang
Huang
Jaafarzadeh
Jiang
Jiang
Jun Ma
Kong
Liang
Liu
Luo
Ma
Mady
Marichi
Miao
Minglang Jiang
Peng
Rahimi
Rajamathi
Ren
Ruixia Yuan
Simeng Gao
Tian
Wang
Wang
Wang
Wang
Wang
Wang
Wu
Wu
Xia
Xiong
Yang
Yang
Yang
Yao
Yu
Yuan
Yuan
Yuan
Yun
Zeng
Zhang
Zhanjian Liu
Zhao
Zhao
Zhaohui Wang
Zhijun Li
Zhu
Publication venue: 'Elsevier BV'
Publication date
Field of study

Crossref