Search CORE

5,878 research outputs found

Big data techniques in auditing research and practice: Current trends and future opportunities

Author: Gepp Adrian
Linnenluecke Martina K
O'Neill Terence J
Publication venue: 'Elsevier BV'
Publication date: 01/06/2018
Field of study

Data-driven solution to identify sentiments from online drug reviews.

Author: Haque Rezaul
Hasan Md Junayed
Khushbu Katura Gania
Laskar Saddam Hossain
Uddin Jia
Publication venue: 'MDPI AG'
Publication date: 21/04/2023
Field of study

With the proliferation of the internet, social networking sites have become a primary source of user-generated content, including vast amounts of information about medications, diagnoses, treatments, and disorders. Comments on previously used medicines, contained within these data, can be leveraged to identify crucial adverse drug reactions, and machine learning (ML) approaches such as sentiment analysis (SA) can be employed to derive valuable insights. However, given the sheer volume of comments, it is often impractical for consumers to manually review all of them before determining a purchase decision. Therefore, drug assessments can serve as a valuable source of medical information for both healthcare professionals and the general public, aiding in decision making and improving public monitoring systems by revealing collective experiences. Nonetheless, the unstructured and linguistic nature of the comments poses a significant challenge for effective categorization, with previous studies having utilized machine and deep learning (DL) algorithms to address this challenge. Despite both approaches showing promising results, DL classifiers outperformed ML classifiers in previous studies. Therefore, the objective of our study was to improve upon earlier research by applying SA to medication reviews and training five ML algorithms on two distinct feature extractions and four DL classifiers on two different word-embedding approaches to obtain higher categorization scores. Our findings indicated that the random forest trained on the count vectorizer outperformed all other ML algorithms, achieving an accuracy and F1 score of 96.65% and 96.42%, respectively. Furthermore, the bidirectional LSTM (Bi-LSTM) model trained on GloVe embedding resulted in an even better accuracy and F1 score, reaching 97.40% and 97.42%, respectively. Hence, by utilizing appropriate natural language processing and ML algorithms, we were able to achieve superior results compared to earlier studies

Open Access Institutional Repository at Robert Gordon University

2020 SDSU Data Science Symposium Program

Author: South Dakota State University
Publication venue: Open PRAIRIE: Open Public Research Access Institutional Repository and Information Exchange
Publication date: 01/01/2020
Field of study

https://openprairie.sdstate.edu/ds_symposium_programs/1002/thumbnail.jp

Public Research Access Institutional Repository and Information Exchange

Emotion Expression Extraction Method for Chinese Microblog Sentences

Author: Ren Fuji
Zhang Qian
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 07/02/2021
Field of study

With the rapid spread of Chinese microblog, a large number of microblog topics are being generated in real-time. More and more users pay attention to emotion expressions of these opinionated sentences in different topics. It is challenging to label the emotion expressions of opinionated sentences manually. For this endeavor, an emotion expression extraction method is proposed to process millions of user-generated opinionated sentences automatically in this paper. Specifically, the proposed method mainly contains two tasks: emotion classification and opinion target extraction. We first use a lexicon-based emotion classification method to compute different emotion values in emotion label vectors of opinionated sentences. Then emotion label vectors of opinionated sentences are revised by an unsupervised emotion label propagation algorithm. After extracting candidate opinion targets of opinionated sentences, the opinion target extraction task is performed on a random walk-based ranking algorithm, which considers the connection between candidate opinion targets and the textual similarity between opinionated sentences, ranks candidate opinion targets of opinionated sentences. Experimental results demonstrate the effectiveness of algorithms in the proposed method

Tokushima University Institutional Repository

Text Mining Promise and Reality

Author: Durfee Antonina
Publication venue: AIS Electronic Library (AISeL)
Publication date: 31/12/2006
Field of study

AIS Electronic Library (AISeL)

Essays on text mining for improved decision making

Author: Thorleuchter Dirk
Publication venue: Ghent University. Faculty of Economics and Business Administration
Publication date: 01/01/2011
Field of study

Ghent University Academic Bibliography

A Review of Various Sentiment Analysis Techniques

Author: Shubhashree Acharya, Prof. Manali Modi
Publication venue: Auricle Global Society of Education and Research
Publication date: 30/04/2018
Field of study

This paper focuses on the utilization of sentiment analysis techniques in various application domains. Here we present major part of the research work done in the field of sentiment mining or opinion mining using the techniques and tools of sentiment analysis. We get a brief idea regarding the comparison of the techniques and the importance of the data set in acquiring the desired outcomes. This paper gives a comparison on the solutions presented in the research paper

International Journal on Future Revolution in Computer Science & Communication Engineering

Big Data and the Internet of Things

Author: A Baaziz
A Kleiner
ED Feigelson
MA Waller
S Boyd
S Vandermerwe
Z Zhou
Publication venue
Publication date: 24/03/2015
Field of study

Advances in sensing and computing capabilities are making it possible to embed increasing computing power in small devices. This has enabled the sensing devices not just to passively capture data at very high resolution but also to take sophisticated actions in response. Combined with advances in communication, this is resulting in an ecosystem of highly interconnected devices referred to as the Internet of Things - IoT. In conjunction, the advances in machine learning have allowed building models on this ever increasing amounts of data. Consequently, devices all the way from heavy assets such as aircraft engines to wearables such as health monitors can all now not only generate massive amounts of data but can draw back on aggregate analytics to "improve" their performance over time. Big data analytics has been identified as a key enabler for the IoT. In this chapter, we discuss various avenues of the IoT where big data analytics either is already making a significant impact or is on the cusp of doing so. We also discuss social implications and areas of concern.Comment: 33 pages. draft of upcoming book chapter in Japkowicz and Stefanowski (eds.) Big Data Analysis: New algorithms for a new society, Springer Series on Studies in Big Data, to appea

arXiv.org e-Print Archive

Crossref