Search CORE

55 research outputs found

D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Author: Chang Angel X.
Chen Dave Zhenyu
Nießner Matthias
Wu Qirui
Publication venue
Publication date: 22/07/2022
Field of study

Recent studies on dense captioning and visual grounding in 3D have achieved impressive results. Despite developments in both areas, the limited amount of available 3D vision-language data causes overfitting issues for 3D visual grounding and 3D dense captioning methods. Also, how to discriminatively describe objects in complex 3D environments is not fully studied yet. To address these challenges, we present D3Net, an end-to-end neural speaker-listener architecture that can detect, describe and discriminate. Our D3Net unifies dense captioning and visual grounding in 3D in a self-critical manner. This self-critical property of D3Net also introduces discriminability during object caption generation and enables semi-supervised training on ScanNet data with partially annotated descriptions. Our method outperforms SOTA methods in both tasks on the ScanRefer dataset, surpassing the SOTA 3D dense captioning method by a significant margin.Comment: Project website: https://daveredrum.github.io/D3Net

arXiv.org e-Print Archive

QoS-Oriented Sensing-Communication-Control Co-Design for UAV-Enabled Positioning

Author: Han Lincong
Liu Qirui
Liu Rongke
Thompson John
Wu Yuan
Zijie Wang
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/03/2023
Field of study

Edinburgh Research Explorer

Dual Modality Prompt Tuning for Vision-Language Pre-Trained Model

Author: Cheng De
Liang Guoqiang
Wang Peng
Wu Qirui
Xing Yinghui
Zhang Shizhou
Zhang Yanning
Publication venue
Publication date: 16/02/2023
Field of study

With the emergence of large pre-trained vison-language model like CLIP, transferable representations can be adapted to a wide range of downstream tasks via prompt tuning. Prompt tuning tries to probe the beneficial information for downstream tasks from the general knowledge stored in the pre-trained model. A recently proposed method named Context Optimization (CoOp) introduces a set of learnable vectors as text prompt from the language side. However, tuning the text prompt alone can only adjust the synthesized "classifier", while the computed visual features of the image encoder can not be affected , thus leading to sub-optimal solutions. In this paper, we propose a novel Dual-modality Prompt Tuning (DPT) paradigm through learning text and visual prompts simultaneously. To make the final image feature concentrate more on the target visual concept, a Class-Aware Visual Prompt Tuning (CAVPT) scheme is further proposed in our DPT, where the class-aware visual prompt is generated dynamically by performing the cross attention between text prompts features and image patch token embeddings to encode both the downstream task-related information and visual instance information. Extensive experimental results on 11 datasets demonstrate the effectiveness and generalization ability of the proposed method. Our code is available in https://github.com/fanrena/DPT.Comment: 12 pages, 7 figure

arXiv.org e-Print Archive

Taxonomic and phylogenetic characterisations of six species of Pleosporales (in Didymosphaeriaceae, Roussoellaceae and Nigrogranaceae) from China

Author: Hongmin Hu
Jichuan Kang
Lili Liu
Minghui He
Nalin N. Wijayawardene
Qingde Long
Qirui Li
Sihan Long
Xiangchun Shen
Xu Zhang
Youpeng Wu
Zebin Meng
Publication venue: Pensoft Publishers
Publication date: 01/01/2023
Field of study

Pleosporales comprise a diverse group of fungi with a global distribution and significant ecological importance. A survey on Pleosporales (in Didymosphaeriaceae, Roussoellaceae and Nigrogranaceae) in Guizhou Province, China, was conducted. Specimens were identified, based on morphological characteristics and phylogenetic analyses using a dataset composed of ITS, LSU, SSU, tef1 and rpb2 loci. Maximum Likelihood (ML) and Bayesian analyses were performed. As a result, three new species (Neokalmusia karka, Nigrograna schinifolium and N. trachycarpus) have been discovered, along with two new records for China (Roussoella neopustulans and R. doimaesalongensis) and a known species (Roussoella pseudohysterioides). Morphologically similar species and phylogenetically close taxa are compared and discussed. This study provides detailed information and descriptions of all newly-identified taxa

Directory of Open Access Journals

ARPHA OAI-PMH Endpoint

Corrigendum: Hu H et al. (2023) Taxonomic and phylogenetic characterisations of six species of Pleosporales (in Didymosphaeriaceae, Roussoellaceae and Nigrogranaceae) from China. MycoKeys 100: 123–151. https://doi.org/10.3897/mycokeys.100.109423

Author: Hongmin Hu
Jichuan Kang
Lili Liu
Minghui He
Nalin N. Wijayawardene
Qingde Long
Qirui Li
Sihan Long
Xiangchun Shen
Xu Zhang
Youpeng Wu
Zebin Meng
Publication venue: Pensoft Publishers
Publication date: 01/03/2024
Field of study

Four new species, Xynobius azonius sp. nov., X. brevifemora sp. nov., X. duoferus sp. nov., and X. stipitoides sp. nov., are described and illustrated, and one species X. geniculatus (Thomson, 1895) is newly reported from South Korea. Xynobius geniculatus (Thomson, 1895) is redescribed and illustrated, and a new combination, Xynobius (Stigmatopoea) cubitalis (Fischer, 1959), comb. nov. is suggested. An identification key to the Xynobius species known from South Korea is provided

Directory of Open Access Journals

Vertical Federated Learning Based Privacy-Preserving Cooperative Sensing in Cognitive Radio Networks

Author: Shikh-Bahaei Mohammad
Wu Qirui
Zhang Yirun
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/12/2020
Field of study

King's Research Portal

A Novel Sustainable Processing Mode for Burr Classified Prediction of Weak Rigid Drilling Process Using a Fusion Modeling Method

Author: Mingyu Wu
Qirui Yang
Siyi Ding
Xiaohu Zheng
Publication venue: 'MDPI AG'
Publication date: 17/06/2022
Field of study

Weakly rigid drilling systems, such as the industrial robot, are widely used in aerospace, military, and other fields due to its good flexibility and large scope of operation. However, the weak rigidity can easily cause burrs, seriously affecting the precision of parts and product performance. To reduce the heavy deburring process and to improve continuous production and sustainable processing capacity, accurate prediction of burr quality is a prerequisite. Traditional burr forming theory cannot accurately predict the drilling defects. Data-driven approaches can be independent of prior knowledge and discover relationships between process parameters and machining precision directly from the data structure itself. Therefore, to take advantage of both approaches, a fusion model was established for burr classified prediction. On the one hand, the drilling and burr forming process was firstly modeled, and preliminary classification results for burrs were calculated. On the other hand, according to the measured data, the errors between initial calculation results and actual classification results were obtained and selected as the tag values of dataset, which served as inputs for the error compensation model of burrs. Finally, by training the network of TCN–DNN using the drilling data, the burr classified prediction in a weak rigid hole-making system was realized. Experimental results showed that compared with traditional drilling theory, the prediction accuracy of the proposed model improved by 25%, reaching 91.67%. The results can provide a basis for judging the process of burr post-treatment, which has practical guiding significance. This method is beneficial to reduce the heavy deburring process and to improve sustainable processing capacity

Multidisciplinary Digital Publishing Institute