Search CORE

379 research outputs found

Parameter and Computation Efficient Transfer Learning for Vision-Language Pre-trained Models

Author: Huang Shubin
Ji Rongrong
Sun Xiaoshuai
Wu Qiong
Yu Wei
Zhou Yiyi
Publication venue
Publication date: 06/09/2023
Field of study

With ever increasing parameters and computation, vision-language pre-trained (VLP) models exhibit prohibitive expenditure in downstream task adaption. Recent endeavors mainly focus on parameter efficient transfer learning (PETL) for VLP models by only updating a small number of parameters. However, excessive computational overhead still plagues the application of VLPs. In this paper, we aim at parameter and computation efficient transfer learning (PCETL) for VLP models. In particular, PCETL not only needs to limit the number of trainable parameters in VLP models, but also to reduce the computational redundancy during inference, thus enabling a more efficient transfer. To approach this target, we propose a novel dynamic architecture skipping (DAS) approach towards effective PCETL. Instead of directly optimizing the intrinsic architectures of VLP models, DAS first observes the significances of their modules to downstream tasks via a reinforcement learning (RL) based process, and then skips the redundant ones with lightweight networks, i.e., adapters, according to the obtained rewards. In this case, the VLP model can well maintain the scale of trainable parameters while speeding up its inference on downstream tasks. To validate DAS, we apply it to two representative VLP models, namely ViLT and METER, and conduct extensive experiments on a bunch of VL tasks. The experimental results not only show the great advantages of DAS in reducing computational complexity, e.g. -11.97% FLOPs of METER on VQA2.0, but also confirm its competitiveness against existing PETL methods in terms of parameter scale and performance. Our source code is given in our appendix

arXiv.org e-Print Archive

Adapting Pre-trained Language Models to Vision-Language Tasks via Dynamic Visual Prompting

Author: Chen Weijie
Huang Shubin
Ji Rongrong
Sun Xiaoshuai
Wu Qiong
Zhang Rongsheng
Zhou Yiyi
Publication venue
Publication date: 01/06/2023
Field of study

Pre-trained language models (PLMs) have played an increasing role in multimedia research. In terms of vision-language (VL) tasks, they often serve as a language encoder and still require an additional fusion network for VL reasoning, resulting in excessive memory overhead. In this paper, we focus on exploring PLMs as a stand-alone model for VL reasoning tasks. Inspired by the recently popular prompt tuning, we first prove that the processed visual features can be also projected onto the semantic space of PLMs and act as prompt tokens to bridge the gap between single- and multi-modal learning. However, this solution exhibits obvious redundancy in visual information and model inference, and the placement of prompt tokens also greatly affects the final performance. Based on these observations, we further propose a novel transfer learning approach for PLMs, termed Dynamic Visual Prompting (DVP). Concretely, DVP first deploys a cross-attention module to obtain text-related and compact visual prompt tokens, thereby greatly reducing the input length of PLMs. To obtain the optimal placement, we also equip DVP with a reinforcement-learning based search algorithm, which can automatically merge DVP with PLMs for different VL tasks via a very short search process. In addition, we also experiment DVP with the recently popular adapter approach to keep the most parameters of PLMs intact when adapting to VL tasks, helping PLMs achieve a quick shift between single- and multi-modal tasks. We apply DVP to two representative PLMs, namely BERT and T5, and conduct extensive experiments on a set of VL reasoning benchmarks including VQA2.0, GQA and SNLIVE. The experimental results not only show the advantage of DVP on efficiency and performance, but also confirm its superiority in adapting pre-trained language models to VL tasks

arXiv.org e-Print Archive

Approximated Prompt Tuning for Vision-Language Pre-trained Models

Author: Dai Pingyang
Huang Shubin
Ji Rongrong
Jiang Guannan
Shu Annan
Wu Qiong
Zhou Yiyi
Publication venue
Publication date: 27/06/2023
Field of study

Prompt tuning is a parameter-efficient way to deploy large-scale pre-trained models to downstream tasks by adding task-specific tokens. In terms of vision-language pre-trained (VLP) models, prompt tuning often requires a large number of learnable tokens to bridge the gap between the pre-training and downstream tasks, which greatly exacerbates the already high computational overhead. In this paper, we revisit the principle of prompt tuning for Transformer-based VLP models and reveal that the impact of soft prompt tokens can be actually approximated via independent information diffusion steps, thereby avoiding the expensive global attention modeling and reducing the computational complexity to a large extent. Based on this finding, we propose a novel Approximated Prompt Tuning (APT) approach towards efficient VL transfer learning. To validate APT, we apply it to two representative VLP models, namely ViLT and METER, and conduct extensive experiments on a bunch of downstream tasks. Meanwhile, the generalization of APT is also validated on CLIP for image classification. The experimental results not only show the superior performance gains and computation efficiency of APT against the conventional prompt tuning methods, e.g., +6.6% accuracy and -64.62% additional computation overhead on METER, but also confirm its merits over other parameter-efficient transfer learning approaches

arXiv.org e-Print Archive

Xylem plasticity of root, stem, and branch in Cunninghamia lanceolata under drought stress: implications for whole-plant hydraulic integrity

Author: Heal Kate V.
Huang Xiaoyan
Li Shubin
Zhang Maxiao
Zheng Ruping
Zhou Lili
Zou Zhiguang
Publication venue
Publication date: 19/02/2024
Field of study

Edinburgh Research Explorer

What Goes beyond Multi-modal Fusion in One-stage Referring Expression Comprehension: An Empirical Study

Author: Huang Shubin
Ji Rongrong
Luo Gen
Sun Jiamu
Sun Xiaoshuai
Wu Yongjian
Ye Qixiang
Zhou Yiyi
Publication venue
Publication date: 16/04/2022
Field of study

Most of the existing work in one-stage referring expression comprehension (REC) mainly focuses on multi-modal fusion and reasoning, while the influence of other factors in this task lacks in-depth exploration. To fill this gap, we conduct an empirical study in this paper. Concretely, we first build a very simple REC network called SimREC, and ablate 42 candidate designs/settings, which covers the entire process of one-stage REC from network design to model training. Afterwards, we conduct over 100 experimental trials on three benchmark datasets of REC. The extensive experimental results not only show the key factors that affect REC performance in addition to multi-modal fusion, e.g., multi-scale features and data augmentation, but also yield some findings that run counter to conventional understanding. For example, as a vision and language (V&L) task, REC does is less impacted by language prior. In addition, with a proper combination of these findings, we can improve the performance of SimREC by a large margin, e.g., +27.12% on RefCOCO+, which outperforms all existing REC methods. But the most encouraging finding is that with much less training overhead and parameters, SimREC can still achieve better performance than a set of large-scale pre-trained models, e.g., UNITER and VILLA, portraying the special role of REC in existing V&L research

arXiv.org e-Print Archive

Biochar: A promising soil amendment to mitigate heavy metals toxicity in plants

Author: ALAMRI Saad
HASHEM Mohamed
HUANG Guoqin
LIU Ying
MOSTAFA Yasser S.
SONG Ying
TANG Haiying
UMAIR HASSAN Muhammad
WANG Shubin
Publication venue: University of Agricultural Sciences and Veterinary Medicine (UASVM), Cluj-Napoca, Romania
Publication date: 25/08/2022
Field of study

Heavy metals (HMs) toxicity is serious abiotic stress that is significantly reducing crop productivity and posing a serious threat to human health, soil and environmental quality. Therefore, it is urgently needed to find appropriate measures to mitigate the adverse impacts of HMs on soil, plants, humans and the environment. Biochar (BC) has emerged as an excellent soil amendment to minimize the adverse impacts of HMs and to improve soil fertility and environmental quality. Biochar application decreases HMs uptake and their translocation to plant parts by forming complexes and precipitation. Biochar also has improved soil pH, soil fertility and soil cation exchange capacity (CEC) and it also increases adsorption of HMs thus reduces their mobility and subsequent availability to plants. BC application also maintains membrane stability and improves uptake of nutrients, osmolytes accumulation, antioxidant activities, and gene expression, therefore, improves the plant performance under HMs stress. Biochar application also improves the photosynthetic performance by increasing the synthesis of photosynthetic pigments, stomata conductance and increasing the water uptake by plants. Besides this, BC also scavenges ROS by increasing the antioxidant activities, gene expression, and accumulation of proline in HMs contaminated soils. This review highlights the role of BC to mitigate the HMs toxicity in plants. We have discussed the role of BC in the modification of soil properties to induce tolerance against HMs toxicity. Moreover, we have discussed various mechanisms mediated by BC at the plant level to induce tolerance against HMs. Additionally, we also identified research gaps that must be fulfilled in future research studies

Notulae Botanicae Horti Agrobotanici Cluj-Napoca

Xylem plasticity of root, stem, and branch in Cunninghamia lanceolata under drought stress: implications for whole-plant hydraulic integrity

Author: Kate V. Heal
Lili Zhou
Lili Zhou
Maxiao Zhang
Maxiao Zhang
Ruping Zheng
Ruping Zheng
Shubin Li
Shubin Li
Xiaoyan Huang
Xiaoyan Huang
Zhiguang Zou
Zhiguang Zou
Publication venue: Frontiers Media S.A.
Publication date: 01/02/2024
Field of study

IntroductionA better understanding of xylem hydraulic characteristics in trees is critical to elucidate the mechanisms of forest decline and tree mortality from water deficit. As well as temperate forests and forests growing in arid regions, subtropical and tropical forests are also predicted to experience an increased frequency and intensity of climate change-induced drought in the near future.MethodsIn this study, 1-year-old Cunninghamia lanceolata seedlings (a typical subtropical species in southern China) were selected for a continuous controlled drought pot experiment of 45 days duration. The experimental treatments were non-drought (control), light drought, moderate drought and severe drought stress, which were 80%, 60%, 50%, and 40%, respectively of soil field maximum moisture capacity.ResultsThe hydraulic conductivity, specific conductivity and water potential of roots, stems, and branches of C. lanceolata all decreased with the prolonging of drought in the different drought intensities. The relative decrease in these hydraulic values were greater in roots than in stems and branches, indicating that roots are more sensitive to drought. Root tracheid diameters normally reduce to ensure security of water transport with prolonged drought, whilst the tracheid diameters of stems and branches expand initially to ensure water transport and then decrease to reduce the risk of embolism with continuing drought duration. The pit membrane diameter of roots, stems and branches generally increased to different extents during the 15–45 days drought duration, which is conducive to enhanced radial water transport ability. The tracheid density and pit density of stems generally decreased during drought stress, which decreased water transport efficiency and increased embolism occurrence. Correlation analysis indicated that anatomical plasticity greatly influenced the hydraulic properties, whilst the relationships varied among different organs. In roots, tracheid diameter decreased and tracheid density increased to enhance water transport security; stems and branches may increase tracheid diameter and pit membrane diameter to increase hydraulic conductivity ability, but may increase the occurrence of xylem embolism.DiscussionIn summary, under drought stress, the xylem anatomical characteristics of C. lanceolata organs were highly plastic to regulate water transport vertically and radially to maintain the trade-off between hydraulic conductivity efficiency and safety

Directory of Open Access Journals