Search CORE

381 research outputs found

Context Modeling for Ranking and Tagging Bursty Features in Text Streams

Author: HE Jing
JIANG Jing
LI Xiaoming
Shan Dongdong
YAN Hongfei
ZHAO Xin
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2010
Field of study

Bursty features in text streams are very useful in many text mining applications. Most existing studies detect bursty features based purely on term frequency changes without taking into account the semantic contexts of terms, and as a result the detected bursty features may not always be interesting or easy to interpret. In this paper we propose to model the contexts of bursty features using a language modeling approach. We then propose a novel topic diversity-based metric using the context models to find newsworthy bursty features. We also propose to use the context models to automatically assign meaningful tags to bursty features. Using a large corpus of a stream of news articles, we quantitatively show that the proposed context language models for bursty features can effectively help rank bursty features based on their newsworthiness and to assign meaningful tags to annotate bursty features. ? 2010 ACM.EI

Crossref

Institutional Knowledge at Singapore Management University

TWO-STAGE MODEL SELECTION WITH PARAMETERS WEIGHTED HIDDEN MARKOV MODELS AND LIKELIHOOD RATIO FOR PART-OF-SPEECH TAGGING

Author: Hongbo Liu
Hongfei Lin
Hongfei Lin
Pixi Zhao
Pixi Zhao
Shichang Sun
Shichang Sun
Publication venue
Publication date: 10/04/2020
Field of study

Abstract: In many natural language processing applications two or more models usually have to be involved for accuracy. But it is difficult for minor models, such as "backoff" taggers in part-of-speech tagging, to cooperate smoothly with the major probabilistic model. We introduce a two-stage approach for model selection between hidden Markov models and other minor models. In the first stage, the major model is extended to give a set of candidates for model selection. Parameters weighted hidden Markov model is presented using weighted ratio to create the candidate set. In the second stage, heuristic rules and features are used as evaluation functions to give extra scores to candidates in the set. Such scores are calculated using a diagnostic likelihood ratio test based on sensitivity and specificity criteria. The selection procedure can be fulfilled using swarm optimization technique. Experiment results on public tagging data sets show the applicability of the proposed approach

CiteSeerX

Estimated ultimate recovery prediction of fractured horizontal wells in tight oil reservoirs based on deep neural networks

Author: Cheng Hongfei
Ding Chao
Liu Lingfu
Luo Shangui
Zhang Boning
Zhao Yulong
Publication venue: 'Yandy Scientific Press'
Publication date: 25/04/2022
Field of study

Accurate estimated ultimate recovery prediction of fractured horizontal wells in tight reservoirs is crucial to economic evaluation and oil field development plan formulation. Advances in artificial intelligence and big data have provided a new tool for rapid production prediction of unconventional reservoirs. In this study, the estimated ultimate recovery prediction model based on deep neural networks was established using the data of 58 horizontal wells in Mahu tight oil reservoirs. First, the estimated ultimate recovery of oil wells was calculated based on the stretched exponential production decline model and a five-region flow model. Then, the calculated estimated ultimate recovery, geological attributes, engineering parameters, and production data of each well were used to build a machine learning database. Before the model training, the number of input parameters was reduced from 14 to 9 by feature selection. The prediction accuracy of the model was improved by data normalization, the early stopping technique, and 10-fold cross validation. The optimal activation function, hidden layers, number of neurons in each layer, and learning rate of the deep neural network model were obtained through hyperparameter optimization. The average determination coefficient on the testing set was 0.73. The results indicate that compared with the traditional estimated ultimate recovery prediction methods, the established deep neural network model has the strengths of a simple procedure and low time consumption, and the deep neural network model can be easily updated to improve prediction accuracy when new well information is obtained.Cited as: Luo, S., Ding, C., Cheng, H., Zhang, B., Zhao, Y., Liu, L. Estimated ultimate recovery prediction of fractured horizontal wells in tight oil reservoirs based on deep neural networks. Advances in Geo-Energy Research, 2022, 6(2): 111-122. https://doi.org/10.46690/ager.2022.02.0

Yandy Scientific Press

A Transformer-Based Substitute Recommendation Model Incorporating Weakly Supervised Customer Behavior Data

Author: Fang Haoyang
Neppalli Naveen
Shi Xingjian
Yang Hongfei
Ye Wenting
Zhao Shuai
Publication venue
Publication date: 04/11/2022
Field of study

The substitute-based recommendation is widely used in E-commerce to provide better alternatives to customers. However, existing research typically uses the customer behavior signals like co-view and view-but-purchase-another to capture the substitute relationship. Despite its intuitive soundness, we find that such an approach might ignore the functionality and characteristics of products. In this paper, we adapt substitute recommendation into language matching problem by taking product title description as model input to consider product functionality. We design a new transformation method to de-noise the signals derived from production data. In addition, we consider multilingual support from the engineering point of view. Our proposed end-to-end transformer-based model achieves both successes from offline and online experiments. The proposed model has been deployed in a large-scale E-commerce website for 11 marketplaces in 6 languages. Our proposed model is demonstrated to increase revenue by 19% based on an online A/B experiment.Comment: 6 pages, 3 figures, 5 tables, accepted in 21st IEEE International Conference on Machine Learning and Application

arXiv.org e-Print Archive

Deletion of the meq gene significantly decreases immunosuppression in chickens caused by pathogenic marek's disease virus

Author: Cui Zhizhong
Li Yanpeng
Su Shuai
Sun Aijun
Zhao Peng
Zhu Hongfei
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background Marek's disease virus (MDV) causes an acute lymphoproliferative disease in chickens, resulting in immunosuppression, which is considered to be an integral aspect of the pathogenesis of Marek's disease (MD). A recent study showed that deletion of the Meq gene resulted in loss of transformation of T-cells in chickens and a Meq-null virus, rMd5ΔMeq, could provide protection superior to CVI988/Rispens. Results In the present study, to investigate whether the Meq-null virus could be a safe vaccine candidate, we constructed a Meq deletion strain, GX0101ΔMeq, by deleting both copies of the Meq gene from a pathogenic MDV, GX0101 strain, which was isolated in China. Pathogenesis experiments showed that the GX0101ΔMeq virus was fully attenuated in specific pathogen-free chickens because none of the infected chickens developed Marek's disease-associated lymphomas. The study also evaluated the effects of GX0101ΔMeq on the immune system in chickens after infection with GX0101ΔMeq virus. Immune system variables, including relative lymphoid organ weight, blood lymphocytes and antibody production following vaccination against AIV and NDV were used to assess the immune status of chickens. Experimental infection with GX0101ΔMeq showed that deletion of the Meq gene significantly decreased immunosuppression in chickens caused by pathogenic MDV. Conclusion These findings suggested that the Meq gene played an important role not only in tumor formation but also in inducing immunosuppressive effects in MDV-infected chickens.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue

Author: Jia Yuxiang
Xu Hongfei
Yang Songhua
Zan Hongying
Zhao Hanjie
Zhou Guangyu
Zhu Senbin
Publication venue
Publication date: 13/08/2023
Field of study

Recent advances in Large Language Models (LLMs) have achieved remarkable breakthroughs in understanding and responding to user intents. However, their performance lag behind general use cases in some expertise domains, such as Chinese medicine. Existing efforts to incorporate Chinese medicine into LLMs rely on Supervised Fine-Tuning (SFT) with single-turn and distilled dialogue data. These models lack the ability for doctor-like proactive inquiry and multi-turn comprehension and cannot always align responses with safety and professionalism experts. In this work, we introduce Zhongjing, the first Chinese medical LLaMA-based LLM that implements an entire training pipeline from pre-training to reinforcement learning with human feedback (RLHF). Additionally, we introduce a Chinese multi-turn medical dialogue dataset of 70,000 authentic doctor-patient dialogues, CMtMedQA, which significantly enhances the model's capability for complex dialogue and proactive inquiry initiation. We define a refined annotation rule and evaluation criteria given the biomedical domain's unique characteristics. Results show that our model outperforms baselines in various capacities and matches the performance of ChatGPT in a few abilities, despite having 50x training data with previous best model and 100x parameters with ChatGPT. RLHF further improves the model's instruction-following ability and safety.We also release our code, datasets and model for further research

arXiv.org e-Print Archive

Beyond linguistic cues: fine-grained conversational emotion recognition via belief-desire modelling

Author: Li L.ongjiao
Lin Hongfei
Luo Wei
Naseriparsa Mehdi
Xia Feng
Xu Bo
Zhao Zhehuan
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2024
Field of study

Emotion recognition in conversation (ERC) is essential for dialogue systems to identify the emotions expressed by speakers. Although previous studies have made significant progress, accurate recognition and interpretation of similar fine-grained emotion properly accounting for individual variability remains a challenge. One particular under-explored area is the role of individual beliefs and desires in modelling emotion. Inspired by the Belief-Desire Theory of Emotion, we propose a novel method for conversational emotion recognition that incorporates both belief and desire to accurately identify emotions. We extract emotion-eliciting events from utterances and construct graphs that represent beliefs and desires in conversations. By applying message passing between nodes, our graph effectively models the utterance context, speaker's global state, and the interaction between emotional beliefs, desires, and utterances. We evaluate our model's performance by conducting extensive experiments on four popular ERC datasets and comparing it with multiple state-of-the-art models. The experimental results demonstrate the superiority of our proposed model and validate the effectiveness of each module in the model. © 2024 ELRA Language Resource Association: CC BY-NC 4.0

Federation ResearchOnline