Search CORE

20 research outputs found

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Author: He Yinan
Li Kunchang
Li Yizhuo
Qiao Yu
Wang Limin
Wang Yali
Wang Yi
Publication venue
Publication date: 28/03/2023
Field of study

Video Foundation Models (VFMs) have received limited exploration due to high computational costs and data scarcity. Previous VFMs rely on Image Foundation Models (IFMs), which face challenges in transferring to the video domain. Although VideoMAE has trained a robust ViT from limited data, its low-level reconstruction poses convergence difficulties and conflicts with high-level cross-modal alignment. This paper proposes a training-efficient method for temporal-sensitive VFMs that integrates the benefits of existing methods. To increase data efficiency, we mask out most of the low-semantics video tokens, but selectively align the unmasked tokens with IFM, which serves as the UnMasked Teacher (UMT). By providing semantic guidance, our method enables faster convergence and multimodal friendliness. With a progressive pre-training framework, our model can handle various tasks including scene-related, temporal-related, and complex video-language understanding. Using only public sources for pre-training in 6 days on 32 A100 GPUs, our scratch-built ViT-L/16 achieves state-of-the-art performances on various video tasks. The code and models will be released at https://github.com/OpenGVLab/unmasked_teacher.Comment: 16 pages, 5 figures, 28 table

arXiv.org e-Print Archive

Harvest Video Foundation Models via Efficient Post-Pretraining

Author: He Yinan
Li Kunchang
Li Yizhuo
Luo Ping
Qiao Yu
Wang Limin
Wang Yali
Wang Yi
Publication venue
Publication date: 30/10/2023
Field of study

Building video-language foundation models is costly and difficult due to the redundant nature of video data and the lack of high-quality video-language datasets. In this paper, we propose an efficient framework to harvest video foundation models from image ones. Our method is intuitively simple by randomly dropping input video patches and masking out input text during the post-pretraining procedure. The patch dropping boosts the training efficiency significantly and text masking enforces the learning of cross-modal fusion. We conduct extensive experiments to validate the effectiveness of our method on a wide range of video-language downstream tasks including various zero-shot tasks, video question answering, and video-text retrieval. Despite its simplicity, our method achieves state-of-the-art performances, which are comparable to some heavily pretrained video foundation models. Our method is extremely efficient and can be trained in less than one day on 8 GPUs, requiring only WebVid-10M as pretraining data. We hope our method can serve as a simple yet strong counterpart for prevalent video foundation models, provide useful insights when building them, and make large pretrained models more accessible and sustainable. This is part of the InternVideo project \url{https://github.com/OpenGVLab/InternVideo}

arXiv.org e-Print Archive

Mendelian randomization supports genetic liability to hospitalization for COVID-19 as a risk factor of pre-eclampsia

Author: Baoxing Liu
Junning Zhang
Weizhen Wu
Weizhen Wu
Xuezhi Rao
Yizhuo Qiao
Yuehan Ren
Zhijie Xu
Publication venue: Frontiers Media S.A.
Publication date: 01/03/2024
Field of study

BackgroundPre-eclampsia and eclampsia are among the major threats to pregnant women and fetuses, but they can be mitigated by prevention and early screening. Existing observational research presents conflicting evidence regarding the causal effects of coronavirus disease 2019 (COVID-19) on pre-eclampsia risk. Through Mendelian randomization (MR), this study aims to investigate the causal effect of three COVID-19 severity phenotypes on the risk of pre-eclampsia and eclampsia to provide more rigorous evidence.MethodsTwo-sample MR was utilized to examine causal effects. Summary-level data from genome-wide association studies (GWAS) of individuals of European ancestry were acquired from the GWAS catalog and FinnGen databases. Single-nucleotide polymorphisms associated with COVID-19 traits at p < 5 × −8 were obtained and pruned for linkage disequilibrium to generate instrumental variables for COVID-19. Inverse variance weighted estimates were used as the primary MR results, with weighted median and MR-Egger as auxiliary analyses. The robustness of the MR findings was also evaluated through sensitivity analyses. Bonferroni correction was applied to primary results, with a p < 0.0083 considered significant evidence and a p within 0.083–0.05 considered suggestive evidence.ResultsCritical ill COVID-19 [defined as hospitalization for COVID-19 with either a death outcome or respiratory support, OR (95% CI): 1.17 (1.03–1.33), p = 0.020] and hospitalized COVID-19 [defined as hospitalization for COVID-19, OR (95% CI): 1.10 (1.01–1.19), p = 0.026] demonstrated suggestive causal effects on pre-eclampsia, while general severe acute respiratory syndrome coronavirus 2 infection did not exhibit a significant causal effect on pre-eclampsia. None of the three COVID-19 severity phenotypes exhibited a significant causal effect on eclampsia.ConclusionsOur investigation demonstrates a suggestive causal effect of genetic susceptibility to critical ill COVID-19 and hospitalized COVID-19 on pre-eclampsia. The COVID-19 severity exhibited a suggestive positive dose–response relationship with the risk of pre-eclampsia. Augmented attention should be paid to pregnant women hospitalized for COVID-19, especially those needing respiratory support

Directory of Open Access Journals

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Author: Chen Xinyuan
He Yinan
Li Kunchang
Li Yizhuo
Liu Ziwei
Luo Ping
Ma Xin
Qiao Yu
Wang Limin
Wang Yali
Wang Yaohui
Wang Yi
Yu Jiashuo
Publication venue
Publication date: 13/07/2023
Field of study

This paper introduces InternVid, a large-scale video-centric multimodal dataset that enables learning powerful and transferable video-text representations for multimodal understanding and generation. The InternVid dataset contains over 7 million videos lasting nearly 760K hours, yielding 234M video clips accompanied by detailed descriptions of total 4.1B words. Our core contribution is to develop a scalable approach to autonomously build a high-quality video-text dataset with large language models (LLM), thereby showcasing its efficacy in learning video-language representation at scale. Specifically, we utilize a multi-scale approach to generate video-related descriptions. Furthermore, we introduce ViCLIP, a video-text representation learning model based on ViT-L. Learned on InternVid via contrastive learning, this model demonstrates leading zero-shot action recognition and competitive video retrieval performance. Beyond basic video understanding tasks like recognition and retrieval, our dataset and model have broad applications. They are particularly beneficial for generating interleaved video-text data for learning a video-centric dialogue system, advancing video-to-text and text-to-video generation research. These proposed resources provide a tool for researchers and practitioners interested in multimodal video understanding and generation.Comment: Data and Code: https://github.com/OpenGVLab/InternVideo/tree/main/Data/InternVi

arXiv.org e-Print Archive

Expert Consensus on Microtransplant for Acute Myeloid Leukemia in Elderly Patients -Report From the International Microtransplant Interest Group

Author: Ai Huisheng
Blaise Didier
Cai Bo
Chao Nelson J
Chaudhary Preet M
Chen Baoan
Chen Jieping
Chen Wenming
Chen Xiequn
Ding Banghe
Duarte Rafael F
Fan Chuanbo
Fuchs Ephraim
Gao Chunji
Gergis Usama
Gottlieb David
Guo Mei
Han Weidong
Hu Kaixun
Hu Liangding
Huang He
Huang Xiaojun
Huang Yajing
Jia Yongqian
Jin Jie
Keating Armand
Krakow Elizabeth F
Li Fei
Li Jianyong
Li Lijuan
Liu Daihong
Liu Kaiyan
Liu Mingyao
Liu Qifa
Lu Xuzhang
Ma Jun
Ma Liangming
Qiao Jianhui
Reagan John
Rizzieri David A
Schultz Kirk R
Shen Xuliang
Shi Xiaodong
Spitzer Thomas R
Sun Jie
Sun Qiyun
Sun Xiuli
Sun Zimin
Tian Yuyang
Wang Jianmin
Wang Jianxiang
Wang Jingbo
Wang Zhao
Wei Li
Wu Depei
Wu Xiaoxiong
Xia Bing
Xiao Yi
Xiao Zhijian
Yang Linhua
Yu Changlin
Zhan Xinrong
Zhang Bolong
Zhang Liansheng
Zhang Mei
Zhang Weijing
Zhang Xi
Zhang Xiaohui
Zhang Yizhuo
Zhang Yongqing
Zheng Bo
Zhou Daobin
Zhou Fan
Zhu Jun
Publication venue: Jefferson Digital Commons
Publication date: 31/03/2023
Field of study

Recent studies have shown that microtransplant (MST) could improve outcome of patients with elderly acute myeloid leukemia (EAML). To further standardize the MST therapy and improve outcomes in EAML patients, based on analysis of the literature on MST, especially MST with EAML from January 1st, 2011 to November 30th, 2022, the International Microtransplant Interest Group provides recommendations and considerations for MST in the treatment of EAML. Four major issues related to MST for treating EAML were addressed: therapeutic principle of MST (1), candidates for MST (2), induction chemotherapy regimens (3), and post-remission therapy based on MST (4). Others included donor screening, infusion of donor cells, laboratory examinations, and complications of treatment

Jefferson Digital Commons

A Novel Address-Matching Framework Based on Region Proposal

Author: Chengbo Wang
Linlin Liang
Yanyou Qiao
Yizhuo Quan
Yuanfei Chang
Publication venue: MDPI AG
Publication date: 01/04/2024
Field of study

Geocoding is a fundamental component of geographic information science that plays a crucial role in various geographical studies and applications involving text data. Current mainstream geocoding methods fall into two categories: geodesic-grid prediction and address matching. However, the geodesic-grid-prediction method’s localization accuracy is hindered by the density of grid partitioning, struggling to strike a balance between prediction accuracy and grid density. Address-matching methods mainly focus on the semantics of query text. However, they tend to ignore keyword information that can be used to distinguish candidates and introduce potential interference, which reduces matching accuracy. Inspired by the human map-usage process, we propose a two-stage address-matching approach that integrates geodesic-grid prediction and text-matching models. Initially, a multi-level text-classification model is used to generate a retrieval region proposal for an input query text. Subsequently, we search for the most relevant point of interest (POI) within the region-proposal area using a semantics-based text-retrieval model. We evaluated the proposed method using POI data from the Beijing Chaoyang District. The experimental results indicate that the proposed method provides high address-matching accuracy, increasing Recall@1 by 0.55 to 1.56 percentage points and MRR@5 by 0.54 to 1.68 percentage points

Directory of Open Access Journals

Detection of Genetic Mutations by Next-Generation Sequencing for Predicting Prognosis of Extensive-Stage Small-Cell Lung Cancer

Author: Baohui Han
Dongfang Chen
Jianlin Xu
Rong Qiao
Runbo Zhong
Tianqing Chu
Yizhuo Zhao
Publication venue: 'Hindawi Limited'
Publication date: 01/01/2020
Field of study

Some studies have revealed that specific genetic mutations could be associated with chemotherapy response or even survival in small-cell lung cancer (SCLC). Our retrospective study aimed to identify the correlation between genetic mutations and progression-free survival (PFS) in extensive-stage SCLC after first-line chemotherapy. A total of 75 patients with extensive-stage SCLC confirmed by histopathology from February 2018 to February 2019 were retrospectively analyzed. The biopsy specimens of all patients were analyzed by Next-Generation Sequencing (NGS). All patients received first-line chemotherapy and follow-up at Shanghai Chest Hospital. Eleven genes were mutated in, at least, 10% of the 75 patients, including TP53 (96%), RB1 (77%), SMAD4 (32%), NOTCH1 (21%), PTEN (16%), FGFR1 (16%), KDR (15%), PIK3CA (15%), ROS1 (15%), BRCA2 (13%), and ERBB4 (10%). The median number of mutated genes among all patients was 5. Patients with more than 5 mutated genes (PFS = 6.7 months, P=0.004), mutant TP53 (PFS = 5.0 months, P=0.011), and mutant BRCA2 (PFS = 6.7 months, P=0.046) had better PFS after first-line chemotherapy than other patients. Multivariate Cox regression analysis showed that patients who achieved a PR (HR 3.729, 95% CI 2.038–6.822), had more than 5 mutated genes (HR 1.929, 95% CI 1.096–3.396), had BRCA2 mutations (HR 4.581, 95% CI 1.721–12.195), and had no liver metastasis (HR 0.415, 95% CI 0.181–0.951) showed improvements in PFS after first-line chemotherapy. In conclusion, the number of mutated genes and BRCA2 mutation status in extensive-stage SCLC were significantly related to PFS after first-line chemotherapy

Directory of Open Access Journals

VideoChat: Chat-Centric Video Understanding

Author: He Yinan
Li KunChang
Li Yizhuo
Luo Ping
Qiao Yu
Wang Limin
Wang Wenhai
Wang Yali
Wang Yi
Publication venue
Publication date: 10/05/2023
Field of study

In this study, we initiate an exploration into video understanding by introducing VideoChat, an end-to-end chat-centric video understanding system. It integrates video foundation models and large language models via a learnable neural interface, excelling in spatiotemporal reasoning, event localization, and causal relationship inference. To instructively tune this system, we propose a video-centric instruction dataset, composed of thousands of videos matched with detailed descriptions and conversations. This dataset emphasizes spatiotemporal reasoning and causal relationships, providing a valuable asset for training chat-centric video understanding systems. Preliminary qualitative experiments reveal our system's potential across a broad spectrum of video applications and set the standard for future research. Access our code and data at https://github.com/OpenGVLab/Ask-AnythingComment: Technical repor

arXiv.org e-Print Archive

Table3_Mendelian randomization supports genetic liability to hospitalization for COVID-19 as a risk factor of pre-eclampsia.xlsx

Author: Baoxing Liu (4369039)
Junning Zhang (8614701)
Weizhen Wu (2057200)
Xuezhi Rao (17354749)
Yizhuo Qiao (18123676)
Yuehan Ren (15467393)
Zhijie Xu (3929918)
Publication venue
Publication date: 08/03/2024
Field of study

BackgroundPre-eclampsia and eclampsia are among the major threats to pregnant women and fetuses, but they can be mitigated by prevention and early screening. Existing observational research presents conflicting evidence regarding the causal effects of coronavirus disease 2019 (COVID-19) on pre-eclampsia risk. Through Mendelian randomization (MR), this study aims to investigate the causal effect of three COVID-19 severity phenotypes on the risk of pre-eclampsia and eclampsia to provide more rigorous evidence.MethodsTwo-sample MR was utilized to examine causal effects. Summary-level data from genome-wide association studies (GWAS) of individuals of European ancestry were acquired from the GWAS catalog and FinnGen databases. Single-nucleotide polymorphisms associated with COVID-19 traits at p −8 were obtained and pruned for linkage disequilibrium to generate instrumental variables for COVID-19. Inverse variance weighted estimates were used as the primary MR results, with weighted median and MR-Egger as auxiliary analyses. The robustness of the MR findings was also evaluated through sensitivity analyses. Bonferroni correction was applied to primary results, with a p ResultsCritical ill COVID-19 [defined as hospitalization for COVID-19 with either a death outcome or respiratory support, OR (95% CI): 1.17 (1.03–1.33), p = 0.020] and hospitalized COVID-19 [defined as hospitalization for COVID-19, OR (95% CI): 1.10 (1.01–1.19), p = 0.026] demonstrated suggestive causal effects on pre-eclampsia, while general severe acute respiratory syndrome coronavirus 2 infection did not exhibit a significant causal effect on pre-eclampsia. None of the three COVID-19 severity phenotypes exhibited a significant causal effect on eclampsia.ConclusionsOur investigation demonstrates a suggestive causal effect of genetic susceptibility to critical ill COVID-19 and hospitalized COVID-19 on pre-eclampsia. The COVID-19 severity exhibited a suggestive positive dose–response relationship with the risk of pre-eclampsia. Augmented attention should be paid to pregnant women hospitalized for COVID-19, especially those needing respiratory support.</p

FigShare

Table2_Mendelian randomization supports genetic liability to hospitalization for COVID-19 as a risk factor of pre-eclampsia.xlsx

Author: Baoxing Liu (4369039)
Junning Zhang (8614701)
Weizhen Wu (2057200)
Xuezhi Rao (17354749)
Yizhuo Qiao (18123676)
Yuehan Ren (15467393)
Zhijie Xu (3929918)
Publication venue
Publication date: 08/03/2024
Field of study

BackgroundPre-eclampsia and eclampsia are among the major threats to pregnant women and fetuses, but they can be mitigated by prevention and early screening. Existing observational research presents conflicting evidence regarding the causal effects of coronavirus disease 2019 (COVID-19) on pre-eclampsia risk. Through Mendelian randomization (MR), this study aims to investigate the causal effect of three COVID-19 severity phenotypes on the risk of pre-eclampsia and eclampsia to provide more rigorous evidence.MethodsTwo-sample MR was utilized to examine causal effects. Summary-level data from genome-wide association studies (GWAS) of individuals of European ancestry were acquired from the GWAS catalog and FinnGen databases. Single-nucleotide polymorphisms associated with COVID-19 traits at p −8 were obtained and pruned for linkage disequilibrium to generate instrumental variables for COVID-19. Inverse variance weighted estimates were used as the primary MR results, with weighted median and MR-Egger as auxiliary analyses. The robustness of the MR findings was also evaluated through sensitivity analyses. Bonferroni correction was applied to primary results, with a p ResultsCritical ill COVID-19 [defined as hospitalization for COVID-19 with either a death outcome or respiratory support, OR (95% CI): 1.17 (1.03–1.33), p = 0.020] and hospitalized COVID-19 [defined as hospitalization for COVID-19, OR (95% CI): 1.10 (1.01–1.19), p = 0.026] demonstrated suggestive causal effects on pre-eclampsia, while general severe acute respiratory syndrome coronavirus 2 infection did not exhibit a significant causal effect on pre-eclampsia. None of the three COVID-19 severity phenotypes exhibited a significant causal effect on eclampsia.ConclusionsOur investigation demonstrates a suggestive causal effect of genetic susceptibility to critical ill COVID-19 and hospitalized COVID-19 on pre-eclampsia. The COVID-19 severity exhibited a suggestive positive dose–response relationship with the risk of pre-eclampsia. Augmented attention should be paid to pregnant women hospitalized for COVID-19, especially those needing respiratory support.</p

FigShare