Search CORE

27 research outputs found

LSPT: Long-term Spatial Prompt Tuning for Visual Representation Learning

Author: Li Dongsheng
Luo Xufang
Mo Shentong
Wang Yansen
Publication venue
Publication date: 27/02/2024
Field of study

Visual Prompt Tuning (VPT) techniques have gained prominence for their capacity to adapt pre-trained Vision Transformers (ViTs) to downstream visual tasks using specialized learnable tokens termed as prompts. Contemporary VPT methodologies, especially when employed with self-supervised vision transformers, often default to the introduction of new learnable prompts or gated prompt tokens predominantly sourced from the model's previous block. A pivotal oversight in such approaches is their failure to harness the potential of long-range previous blocks as sources of prompts within each self-supervised ViT. To bridge this crucial gap, we introduce Long-term Spatial Prompt Tuning (LSPT) - a revolutionary approach to visual representation learning. Drawing inspiration from the intricacies of the human brain, LSPT ingeniously incorporates long-term gated prompts. This feature serves as temporal coding, curbing the risk of forgetting parameters acquired from earlier blocks. Further enhancing its prowess, LSPT brings into play patch tokens, serving as spatial coding. This is strategically designed to perpetually amass class-conscious features, thereby fortifying the model's prowess in distinguishing and identifying visual categories. To validate the efficacy of our proposed method, we engaged in rigorous experimentation across 5 FGVC and 19 VTAB-1K benchmarks. Our empirical findings underscore the superiority of LSPT, showcasing its ability to set new benchmarks in visual prompt tuning performance

arXiv.org e-Print Archive

Adaptive Policy Learning for Offline-to-Online Reinforcement Learning

Author: Jiang Jing
Li Dongsheng
Luo Xufang
Song Xuan
Wei Pengfei
Zheng Han
Publication venue
Publication date: 14/03/2023
Field of study

Conventional reinforcement learning (RL) needs an environment to collect fresh data, which is impractical when online interactions are costly. Offline RL provides an alternative solution by directly learning from the previously collected dataset. However, it will yield unsatisfactory performance if the quality of the offline datasets is poor. In this paper, we consider an offline-to-online setting where the agent is first learned from the offline dataset and then trained online, and propose a framework called Adaptive Policy Learning for effectively taking advantage of offline and online data. Specifically, we explicitly consider the difference between the online and offline data and apply an adaptive update scheme accordingly, that is, a pessimistic update strategy for the offline dataset and an optimistic/greedy update scheme for the online dataset. Such a simple and effective method provides a way to mix the offline and online RL and achieve the best of both worlds. We further provide two detailed algorithms for implementing the framework through embedding value or policy-based RL algorithms into it. Finally, we conduct extensive experiments on popular continuous control tasks, and results show that our algorithm can learn the expert policy with high sample efficiency even when the quality of offline dataset is poor, e.g., random dataset.Comment: AAAI202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression

Author: Jiang Huiqiang
Li Dongsheng
Lin Chin-Yew
Luo Xufang
Qiu Lili
Wu Qianhui
Yang Yuqing
Publication venue
Publication date: 10/10/2023
Field of study

In long context scenarios, large language models (LLMs) face three main challenges: higher computational/financial cost, longer latency, and inferior performance. Some studies reveal that the performance of LLMs depends on both the density and the position of the key information (question relevant) in the input prompt. Inspired by these findings, we propose LongLLMLingua for prompt compression towards improving LLMs' perception of the key information to simultaneously address the three challenges. We conduct evaluation on a wide range of long context scenarios including single-/multi-document QA, few-shot learning, summarization, synthetic tasks, and code completion. The experimental results show that LongLLMLingua compressed prompt can derive higher performance with much less cost. The latency of the end-to-end system is also reduced. For example, on NaturalQuestions benchmark, LongLLMLingua gains a performance boost of up to 17.1% over the original prompt with ~4x fewer tokens as input to GPT-3.5-Turbo. It can derive cost savings of \$28.5 and \$27.4 per 1,000 samples from the LongBench and ZeroScrolls benchmark, respectively. Additionally, when compressing prompts of ~10k tokens at a compression rate of 2x-10x, LongLLMLingua can speed up the end-to-end latency by 1.4x-3.8x. Our code is available at https://aka.ms/LLMLingua

arXiv.org e-Print Archive

Unified Medical Image Pre-training in Language-Guided Common Semantic Space

Author: He Xiaoxuan
Hu Haoji
Jiang Xinyang
Li Dongsheng
Luo Xufang
Qiu Lili
Yang Yifan
Yang Yuqing
Zhao Siyun
Publication venue
Publication date: 24/11/2023
Field of study

Vision-Language Pre-training (VLP) has shown the merits of analysing medical images, by leveraging the semantic congruence between medical images and their corresponding reports. It efficiently learns visual representations, which in turn facilitates enhanced analysis and interpretation of intricate imaging data. However, such observation is predominantly justified on single-modality data (mostly 2D images like X-rays), adapting VLP to learning unified representations for medical images in real scenario remains an open challenge. This arises from medical images often encompass a variety of modalities, especially modalities with different various number of dimensions (e.g., 3D images like Computed Tomography). To overcome the aforementioned challenges, we propose an Unified Medical Image Pre-training framework, namely UniMedI, which utilizes diagnostic reports as common semantic space to create unified representations for diverse modalities of medical images (especially for 2D and 3D images). Under the text's guidance, we effectively uncover visual modality information, identifying the affected areas in 2D X-rays and slices containing lesion in sophisticated 3D CT scans, ultimately enhancing the consistency across various medical imaging modalities. To demonstrate the effectiveness and versatility of UniMedI, we evaluate its performance on both 2D and 3D images across 10 different datasets, covering a wide range of medical image tasks such as classification, segmentation, and retrieval. UniMedI has demonstrated superior performance in downstream tasks, showcasing its effectiveness in establishing a universal medical visual representation

arXiv.org e-Print Archive

Protecting the Future: Neonatal Seizure Detection with Spatial-Temporal Modeling

Author: Duan Juanyong
Fang Yuchen
Huang Congrui
Li Dongsheng
Li You
Li Ziyue
Luo Xufang
Qiu Lili
Ren Kan
Wang Yansen
Publication venue
Publication date: 02/07/2023
Field of study

A timely detection of seizures for newborn infants with electroencephalogram (EEG) has been a common yet life-saving practice in the Neonatal Intensive Care Unit (NICU). However, it requires great human efforts for real-time monitoring, which calls for automated solutions to neonatal seizure detection. Moreover, the current automated methods focusing on adult epilepsy monitoring often fail due to (i) dynamic seizure onset location in human brains; (ii) different montages on neonates and (iii) huge distribution shift among different subjects. In this paper, we propose a deep learning framework, namely STATENet, to address the exclusive challenges with exquisite designs at the temporal, spatial and model levels. The experiments over the real-world large-scale neonatal EEG dataset illustrate that our framework achieves significantly better seizure detection performance.Comment: Accepted in IEEE International Conference on Systems, Man, and Cybernetics (SMC) 202

arXiv.org e-Print Archive

Insight-HXMT on-orbit thermal control status and thermal deformation impact analysis

Author: Li Xiaobo
Li Xufang
Li Zhengwei
Liao Jinyuan
Liu Congzhan
Liu Xiaojing
Lu Fangjun
Lu Xuefeng
Luo Wenbo
Nie Jianyin
Qian Zhiying
Song Liming
Wang Juan
Wang Ruijie
Wang Yusa
Wu Di
Xu He
Xu Yupeng
Yang Sheng
Zhang Aimei
Zhang Fan
Zhang Shuangnan
Zhang Tong
Zhang Yifan
Zhang Yifei
Zhou Yupeng
Publication venue
Publication date: 11/11/2023
Field of study

Purpose: The Hard X-ray Modulation Telescope is China's first X-ray astronomy satellite launched on June 15th, 2017, dubbed Insight-HXMT. Active and passive thermal control measures are employed to keep devices at suitable temperatures. In this paper, we analyzed the on-orbit thermal monitoring data of the first 5 years and investigated the effect of thermal deformation on the point spread function (PSF) of the telescopes. Methods: We examined the data of the on-orbit temperatures measured using 157 thermistors placed on the collimators, detectors and their support structures and compared the results with the thermal control requirements. The thermal deformation was evaluated by the relative orientation of the two star sensors installed on the main support structure. its effect was estimated with evolution of the PSF obtained with calibration scanning observations of the Crab nebula. Conclusion: The on-orbit temperatures met the thermal control requirements thus far, and the effect of thermal deformation on the PSF was negligible after the on-orbit pointing calibration.Comment: 25 pages, 35 figures, submitte

arXiv.org e-Print Archive

Overview to the Hard X-ray Modulation Telescope (Insight-HXMT) Satellite

Author: Bu QingCui
Cai Ce
Cao XueLei
Chang Zhi
Chen Gang
Chen Li
Chen TianXiang
Chen Wei
Chen YiBao
Chen Yong
Chen YuPeng
Cui Wei
Cui WeiWei
Deng JingKang
Dong YongWei
Du YuanYuan
Fu MinXue
Gao GuanHua
Gao He
Gao Min
Ge MingYu
Gu YuDong
Guan Ju
Gungor Can
Guo ChengCheng
Han DaWei
Hu Wei
Huang Yan
Huang Yue
Huo Jia
Jia ShuMei
Jiang LuHua
Jiang WeiChun
Jin Jing
Jin YongJie
Kong Lingda
Li Bing
Li ChengKui
Li Gang
Li MaoShun
Li TiPei
Li Wei
Li Xian
Li XiaoBo
Li XuFang
Li YanGuo
Li ZhengWei
Li ZiJian
Liang XiaoHua
Liao JinYuan
Liu Baisheng
Liu CongZhan
Liu GuoQing
Liu HongWei
Liu ShaoZhen
Liu XiaoJing
Liu YiNong
Liu Yuan
Lu Bo
Lu FangJun
Lu XueFeng
Luo Qi
Luo Tao
Ma Xiang
Meng Bin
Nang Yi
Nie JianYin
Ou Ge
Qu JinLu
Sai Na
Shang RenCheng
Shen GuoHong
Song LiMing
Song XinYing
Sun Liang
Tan Ying
Tao Lian
Tao WenHui
Tuo YouLi
Wang ChunQin
Wang GuoFeng
Wang HuanYu
Wang Juan
Wang WenShuai
Wang YuSa
Wen XiangYang
Wu BoBing
Wu Mei
Xiao GuangCheng
Xiao Shuo
Xiong ShaoLin
Xu He
Xu YuPeng
Yan LinLi
Yang JiaWei
Yang Sheng
Yang YanJi
Yi Qibin
Yu JiaXi
Yuan Bin
Zhang AiMei
Zhang ChengMo
Zhang ChunLei
Zhang Fan
Zhang HongMei
Zhang Juan
Zhang Liang
Zhang Qiang
Zhang ShenYi
Zhang Shu
Zhang ShuangNan
Zhang Tong
Zhang WanChang
Zhang Wei
Zhang WenZhao
Zhang Yi
Zhang YiFei
Zhang YongJie
Zhang Yue
Zhang Zhao
Zhang Zhi
Zhang ZiLiang
Zhao HaiSheng
Zhao JianLing
Zhao XiaoFan
Zheng ShiJie
Zhu Yue
Zhu YuXuan
Zhuang Renlin
Zou ChangLin
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 21/10/2019
Field of study

As China's first X-ray astronomical satellite, the Hard X-ray Modulation Telescope (HXMT), which was dubbed as Insight-HXMT after the launch on June 15, 2017, is a wide-band (1-250 keV) slat-collimator-based X-ray astronomy satellite with the capability of all-sky monitoring in 0.2-3 MeV. It was designed to perform pointing, scanning and gamma-ray burst (GRB) observations and, based on the Direct Demodulation Method (DDM), the image of the scanned sky region can be reconstructed. Here we give an overview of the mission and its progresses, including payload, core sciences, ground calibration/facility, ground segment, data archive, software, in-orbit performance, calibration, background model, observations and some preliminary results.Comment: 29 pages, 40 figures, 6 tables, to appear in Sci. China-Phys. Mech. Astron. arXiv admin note: text overlap with arXiv:1910.0443

arXiv.org e-Print Archive

İstanbul Üniversitesi Açık Erişim Sistemi

Insight-HXMT observations of Swift J0243.6+6124 during its 2017-2018 outburst

Author: Chang Zhi
Chen Gang
Chen Li
Chen Tianxiang
Chen Yibao
Chen Yong
Cui Wei
Cui Weiwei
Deng Jingkang
Dong Yongwei
Doroshenko Victor
Du Yuanyuan
Fu Minxue
Gao Guanhua
Gao He
Gao Min
Ge Mingyu
Gu Yudong
Guan Ju
Guo Chengcheng
Güngör Can
Han Dawei
Hu Wei
Huang Yue
Huo Jia
Ji Long
Jia Shumei
Jiang Luhua
Jiang Weichun
Jin Jing
Jin Yongjie
Li Bing
Li Chengkui
Li Gang
Li Maoshun
Li Wei
Li Xian
Li Xiaobo
Li Xufang
Li Yanguo
Li Zhengwei
Li Zijian
Liang Xiaohua
Liao Jinyuan
Liu Congzhan
Liu Guoqing
Liu Hongwei
Liu Shaozhen
Liu Xiaojing
Liu Yinong
Liu Yuan
Lu Bo
Lu Xuefeng
Luo Tao
Ma Xiang
Meng Bin
Nang Yi
Nie Jianyin
Ou Ge
Qu Jinlu
Sai Na
Santangelo Andrea
Shi Changsheng
Song Liming
Sun Liang
Tan Ying
Tao Lian
Tao Wenhui
Tuo Youli
Wang Guofeng
Wang Huanyu
Wang Juan
Wang Wenshuai
Wang Yusa
Wen Xiangyang
Wu Bobing
Wu Mei
Xiao Guangcheng
Xiong Shaolin
Xu He
Xu Yupeng
Yan Linli
Yang Jiawei
Yang Sheng
Yang Yanji
Zhang Aimei
Zhang Chengmo
Zhang Chunlei
Zhang Fan
Zhang Hongmei
Zhang Juan
Zhang Shu
Zhang Shuangnan
Zhang Tong
Zhang Wanchang
Zhang Wei
Zhang Wenzhao
Zhang Yi
Zhang Yifei
Zhang Yongjie
Zhang Yue
Zhang Zhao
Zhang Ziliang
Zhao Haisheng
Zhao Jianling
Zhao Xiaofan
Zheng Shijie
Zhu Yue
Zhu Yuxuan
Zou Changlin
Publication venue: 'American Astronomical Society'
Publication date: 01/01/2019
Field of study

The recently discovered neutron star transient Swift J0243.6+6124 has been monitored by {\it the Hard X-ray Modulation Telescope} ({\it Insight-\rm HXMT). Based on the obtained data, we investigate the broadband spectrum of the source throughout the outburst. We estimate the broadband flux of the source and search for possible cyclotron line in the broadband spectrum. No evidence of line-like features is, however, found up to

\rm 150~keV

. In the absence of any cyclotron line in its energy spectrum, we estimate the magnetic field of the source based on the observed spin evolution of the neutron star by applying two accretion torque models. In both cases, we get consistent results with

B\rm \sim 10^{13}~G

D\rm \sim 6~kpc

and peak luminosity of

\rm >10^{39}~erg~s^{-1}

which makes the source the first Galactic ultraluminous X-ray source hosting a neutron star.Comment: publishe

arXiv.org e-Print Archive

İstanbul Üniversitesi Açık Erişim Sistemi