14,066 research outputs found
Role of sentiment classification in sentiment analysis: a survey
Through a survey of literature, the role of sentiment classification in sentiment analysis has been reviewed. The review identifies the research challenges involved in tackling sentiment classification. A total of 68 articles during 2015 – 2017 have been reviewed on six dimensions viz., sentiment classification, feature extraction, cross-lingual sentiment classification, cross-domain sentiment classification, lexica and corpora creation and multi-label sentiment classification. This study discusses the prominence and effects of sentiment classification in sentiment evaluation and a lot of further research needs to be done for productive results
An Intelligent Hybrid Sentiment Analyzer for Personal Protective Medical Equipments Based on Word Embedding Technique: The COVID-19 Era
Due to the accelerated growth of symmetrical sentiment data across different platforms,
experimenting with different sentiment analysis (SA) techniques allows for better decision-making
and strategic planning for different sectors. Specifically, the emergence of COVID-19 has enriched
the data of people’s opinions and feelings about medical products. In this paper, we analyze people’s
sentiments about the products of a well-known e-commerce website named Alibaba.com. People’s
sentiments are experimented with using a novel evolutionary approach by applying advanced
pre-trained word embedding for word presentations and combining them with an evolutionary
feature selection mechanism to classify these opinions into different levels of ratings. The proposed
approach is based on harmony search algorithm and different classification techniques including
random forest, k-nearest neighbor, AdaBoost, bagging, SVM, and REPtree to achieve competitive
results with the least possible features. The experiments are conducted on five different datasets
including medical gloves, hand sanitizer, medical oxygen, face masks, and a combination of all these
datasets. The results show that the harmony search algorithm successfully reduced the number of
features by 94.25%, 89.5%, 89.25%, 92.5%, and 84.25% for the medical glove, hand sanitizer, medical
oxygen, face masks, and whole datasets, respectively, while keeping a competitive performance in
terms of accuracy and root mean square error (RMSE) for the classification techniques and decreasing
the computational time required for classification
DrugExBERT for Pharmacovigilance – A Novel Approach for Detecting Drug Experiences from User-Generated Content
Pharmaceutical companies have to maintain drug safety through pharmacovigilance systems by monitoring various sources of information about adverse drug experiences. Recently, user-generated content (UGC) has emerged as a valuable source of real-world drug experiences, posing new challenges due to its high volume and variety. We present DrugExBERT, a novel approach to extract adverse drug experiences (adverse reaction, lack of effect) and supportive drug experiences (effectiveness, intervention, indication, and off-label use) from UGC. To be able to verify the extracted drug experiences, DrugExBERT additionally provides explications in the form of UGC phrases that were critical for the extraction. In our evaluation, we demonstrate that DrugExBERT outperforms state-of-the-art pharmacovigilance approaches as well as ChatGPT on several performance measures and that DrugExBERT is data- and drug-agnostic. Thus, our novel approach can help pharmaceutical companies meet their legal obligations and ethical responsibility while ensuring patient safety and monitoring drug effectiveness
Drug Reviews: Cross-condition and Cross-source Analysis by Review Quantification Using Regional CNN-LSTM Models
Pharmaceutical drugs are usually rated by customers or patients (i.e. in a scale from 1 to 10). Often, they also give reviews or comments on the drug and its side effects. It is desirable to quantify the reviews to help analyze drug favorability in the market, in the absence of ratings. Since these reviews are in the form of text, we should use lexical methods for the analysis. The intent of this study was two-fold: First, to understand how better the efficiency will be if CNN-LSTM models are used to predict ratings or sentiment from reviews. These models are known to perform better than usual machine learning models in the case of textual data sequences. Second, how effective is it to migrate such information extraction models across different drug review data sets and across different disease conditions. Therefore three experiments were designed, first, an In-domain experiment where train and test data are from the same dataset. Two more experiments were conducted to examine the migration capability of models, namely cross-data source, where train and test are from different sources and cross-disease condition model training, where train and test data belong to different disease conditions in the same dataset. The experiments were evaluated using popular metrics such as RMSE, MAE, R2 and Pearson’s coefficient and the results showed that the proposed deep learning regression model works less successfully when compared to the machine learning sentiment extraction models in the literature, which were done on the same datasets. But, this study contributes to the existing literature in the quantity of research work done and in quality of the model and also suggests the future researchers on how to improve. This work also addressed the shortcomings in the literature by introducin
Text pre-processing of multilingual for sentiment analysis based on social network data
Sentiment analysis (SA) is an enduring area for research especially in the field of text analysis. Text pre-processing is an important aspect to perform SA accurately. This paper presents a text processing model for SA, using natural language processing techniques for twitter data. The basic phases for machine learning are text collection, text cleaning, pre-processing, feature extractions in a text and then categorize the data according to the SA techniques. Keeping the focus on twitter data, the data is extracted in domain specific manner. In data cleaning phase, noisy data, missing data, punctuation, tags and emoticons have been considered. For pre-processing, tokenization is performed which is followed by stop word removal (SWR). The proposed article provides an insight of the techniques, that are used for text pre-processing, the impact of their presence on the dataset. The accuracy of classification techniques has been improved after applying text pre-processing and dimensionality has been reduced. The proposed corpus can be utilized in the area of market analysis, customer behaviour, polling analysis, and brand monitoring. The text pre-processing process can serve as the baseline to apply predictive analysis, machine learning and deep learning algorithms which can be extended according to problem definition
A Comparative Study of Machine Learning Models for Sentiment Analysis: Customer Reviews of E-Commerce Platforms
Understanding customers\u27 preferences can be vital for companies to improve customer satisfaction. Reviews of products and services written by customers and published on various online platforms offer tremendous potential to gain important insights about customers\u27 opinions. Sentiment classification with various machine learning models has been of great interest to academia and practice for a while, however, the emergence of language transformer models brings forth new avenues of research. In this article, we compare the performance of traditional machine learning models and recently introduced transformer-based techniques on a dataset of customer reviews published on the Trustpilot platform. We found that transformer-based models outperform traditional models, and one can achieve over 98% accuracy. The best performing model shows the same excellent performance independently of the store considered. We also illustrate why it can be sometimes more reliable to use the sentiment polarity assigned by the machine learning model, rather than a numeric rating that is provided by the customer
RECOMED: A Comprehensive Pharmaceutical Recommendation System
A comprehensive pharmaceutical recommendation system was designed based on
the patients and drugs features extracted from Drugs.com and Druglib.com.
First, data from these databases were combined, and a dataset of patients and
drug information was built. Secondly, the patients and drugs were clustered,
and then the recommendation was performed using different ratings provided by
patients, and importantly by the knowledge obtained from patients and drug
specifications, and considering drug interactions. To the best of our
knowledge, we are the first group to consider patients conditions and history
in the proposed approach for selecting a specific medicine appropriate for that
particular user. Our approach applies artificial intelligence (AI) models for
the implementation. Sentiment analysis using natural language processing
approaches is employed in pre-processing along with neural network-based
methods and recommender system algorithms for modeling the system. In our work,
patients conditions and drugs features are used for making two models based on
matrix factorization. Then we used drug interaction to filter drugs with severe
or mild interactions with other drugs. We developed a deep learning model for
recommending drugs by using data from 2304 patients as a training set, and then
we used data from 660 patients as our validation set. After that, we used
knowledge from critical information about drugs and combined the outcome of the
model into a knowledge-based system with the rules obtained from constraints on
taking medicine.Comment: 39 pages, 14 figures, 13 table
- …