Search CORE

48 research outputs found

Learning with Silver Standard Data for Zero-shot Relation Extraction

Author: Wang Jianwei
Wang Tianyin
Zeng Ziqian
Publication venue
Publication date: 24/11/2022
Field of study

The superior performance of supervised relation extraction (RE) methods heavily relies on a large amount of gold standard data. Recent zero-shot relation extraction methods converted the RE task to other NLP tasks and used off-the-shelf models of these NLP tasks to directly perform inference on the test data without using a large amount of RE annotation data. A potentially valuable by-product of these methods is the large-scale silver standard data. However, there is no further investigation on the use of potentially valuable silver standard data. In this paper, we propose to first detect a small amount of clean data from silver standard data and then use the selected clean data to finetune the pretrained model. We then use the finetuned model to infer relation types. We also propose a class-aware clean data detection module to consider class information when selecting clean data. The experimental results show that our method can outperform the baseline by 12% and 11% on TACRED and Wiki80 dataset in the zero-shot RE task. By using extra silver standard data of different distributions, the performance can be further improved.Comment: 21 pages, 6 figure

arXiv.org e-Print Archive

Network Traffic Classification Based on External Attention by IP Packet Header

Author: Hu Yahui
Song Junping
Xu Luyang
Zeng Ziqian
Zhou Xu
Publication venue
Publication date: 17/09/2023
Field of study

As the emerging services have increasingly strict requirements on quality of service (QoS), such as millisecond network service latency ect., network traffic classification technology is required to assist more advanced network management and monitoring capabilities. So far as we know, the delays of flow-granularity classification methods are difficult to meet the real-time requirements for too long packet-waiting time, whereas the present packet-granularity classification methods may have problems related to privacy protection due to using excessive user payloads. To solve the above problems, we proposed a network traffic classification method only by the IP packet header, which satisfies the requirements of both user's privacy protection and classification performances. We opted to remove the IP address from the header information of the network layer and utilized the remaining 12-byte IP packet header information as input for the model. Additionally, we examined the variations in header value distributions among different categories of network traffic samples. And, the external attention is also introduced to form the online classification framework, which performs well for its low time complexity and strong ability to enhance high-dimensional classification features. The experiments on three open-source datasets show that our average accuracy can reach upon 94.57%, and the classification time is shortened to meet the real-time requirements (0.35ms for a single packet).Comment: 12 pages, 5 figure

arXiv.org e-Print Archive

Adaptive Policy with Wait- $k$ Model for Simultaneous Translation

Author: Fan Kai
Huang Zhongqiang
Luo Wei
Wang Shushu
Wu Jing
Zeng Ziqian
Zhao Libo
Publication venue
Publication date: 23/10/2023
Field of study

Simultaneous machine translation (SiMT) requires a robust read/write policy in conjunction with a high-quality translation model. Traditional methods rely on either a fixed wait-

k

policy coupled with a standalone wait-

k

translation model, or an adaptive policy jointly trained with the translation model. In this study, we propose a more flexible approach by decoupling the adaptive policy model from the translation model. Our motivation stems from the observation that a standalone multi-path wait-

k

model performs competitively with adaptive policies utilized in state-of-the-art SiMT approaches. Specifically, we introduce DaP, a divergence-based adaptive policy, that makes read/write decisions for any translation model based on the potential divergence in translation distributions resulting from future information. DaP extends a frozen wait-

k

model with lightweight parameters, and is both memory and computation efficient. Experimental results across various benchmarks demonstrate that our approach offers an improved trade-off between translation accuracy and latency, outperforming strong baselines.Comment: Accept to EMNLP 2023 main conference. 17 pages, 12 figures, 5 table

arXiv.org e-Print Archive

Expression of ALCAM in clinical colon cancer and relationship with patients' treatment responses

Author: Fang Ziqian
Hargest Rachel
Jiang Wen
Lane Jane
Ruge Fiona
Yang Yiming
Zeng Jimmy Jianyuan
Publication venue: International Institute of Anticancer Research
Publication date: 30/06/2023
Field of study

Background/Aim: Activated leukocyte cell adhesion molecule (ALCAM) plays an important role in cancer via its homotypical and heterotypical interactions with ALCAM or other proteins and can also mediate cell-cell interactions. The present study investigated the expression of ALCAM in relation to epithelial–to–mesenchymal transition (EMT) markers and its downstream signal proteins including Ezrin-Moesin-Radixin (ERM), in clinical colon cancer and in the progression of the disease. Materials and Methods: Expression of ALCAM was determined in a clinical colon cancer cohort and assessed against the clinical pathological factors and outcome, together with the expression patterns of the ERM family and EMT markers. ALCAM protein was detected using immunohistochemistry. Cell line models, with ALCAM knock-down and over-expression, were established and used to test cells’ responses to drugs. Results: Tumours from patients who had distant metastasis and died of colon cancer had low levels of ALCAM. Dukes B and C tumours also had lower ALCAM expression than Dukes A tumours. Patients with high levels of ALCAM had a significantly longer overall and disease-free survival than those with lower ALCAM levels (p=0.040 and p=0.044). ALCAM is not only significantly correlated with SNAI1 and TWIST, also positively correlated with SNAI2. ALCAM enhanced the adhesiveness of colorectal cancer, an effect inhibited by both sALCAM and SRC inhibitors. Finally, high ALCAM expression rendered cells resistant, especially to 5-fluorouracil. Conclusion: Reduced expression of ALCAM in colon cancer is an indicator of disease progression and a poor prognostic indicator for patient’s survival. However, ALCAM can enhance the adhesion ability of cancer cells and render them resistant to chemotherapy drugs

Online Research @ Cardiff

Fluorescently conjugated annular fibrin clot for multiplexed real-time digestion analysis

Author: Alves Nathan J.
Hall Abigail
Linnemann Amelia K.
Muralidharan Charanya
Nallan Chakravarthula Tanmaye
Zeng Ziqian
Publication venue: 'Royal Society of Chemistry (RSC)'
Publication date: 01/12/2021
Field of study

Impaired fibrinolysis has long been considered as a risk factor for venous thromboembolism. Fibrin clots formed at physiological concentrations are promising substrates for monitoring fibrinolytic performance as they offer clot microstructures resembling in vivo. Here we introduce a fluorescently labeled fibrin clot lysis assay which leverages a unique annular clot geometry assayed using a microplate reader. A physiologically relevant fibrin clotting formulation was explored to achieve high assay sensitivity while minimizing labeling impact as fluorescence isothiocyanate (FITC)-fibrin(ogen) conjugations significantly affect both fibrin polymerization and fibrinolysis. Clot characteristics were examined using thromboelastography (TEG), turbidity, scanning electron microscopy, and confocal microscopy. Sample fibrinolytic activities at varying plasmin, plasminogen, and tissue plasminogen activator (tPA) concentrations were assessed in the present study and results were compared to an S2251 chromogenic assay. The optimized physiologically relevant clot substrate showed minimal reporter-conjugation impact with nearly physiological clot properties. The assay demonstrated good reproducibility, wide working range, kinetic read ability, low limit of detection, and the capability to distinguish fibrin binding-related lytic performance. In combination with its ease for multiplexing, it also has applications as a convenient platform for assessing patient fibrinolytic potential and screening thrombolytic drug activities in personalized medical applications

IUPUIScholarWorks