Search CORE

630 research outputs found

Development of an Automated Physician Review Classification System: A hybrid Machine Learning Approach

Author: Mulgund Pavankumar
Sharman Raj
THIMMANAYAKANAPALYA Sagarika Suresh
Publication venue: AIS Electronic Library (AISeL)
Publication date: 12/12/2022
Field of study

Patients are increasingly turning to physician rating websites to help them make important healthcare decisions, such as selecting primary care doctors, specialists, and supplementary medical care providers. Previous research has identified a variety of topics and themes that emerge on these review platforms. However, there is little or no work that has been done to create an automated classifier that automatically categorizes these reviews into distinct topics after they have been explored in this context. Building such an automated classifier could assist IS developers and other stakeholders in automatically classifying patient reviews and understanding patient needs. Furthermore, using design science research we strategize how such machine learning systems can be built using design guidelines in turn having the potential to be generalized to other specific contextual problem spaces. Our work focuses on laying the foundation to design guidelines that need to be followed while building automated systems in specific contexts

AIS Electronic Library (AISeL)

Mapping (Dis-)Information Flow about the MH17 Plane Crash

Author: Augenstein Isabelle
Golovchenko Yevgeniy
Hartmann Mareike
Publication venue
Publication date: 01/01/2019
Field of study

Digital media enables not only fast sharing of information, but also disinformation. One prominent case of an event leading to circulation of disinformation on social media is the MH17 plane crash. Studies analysing the spread of information about this event on Twitter have focused on small, manually annotated datasets, or used proxys for data annotation. In this work, we examine to what extent text classifiers can be used to label data for subsequent content analysis, in particular we focus on predicting pro-Russian and pro-Ukrainian Twitter content related to the MH17 plane crash. Even though we find that a neural classifier improves over a hashtag based baseline, labeling pro-Russian and pro-Ukrainian content with high precision remains a challenging problem. We provide an error analysis underlining the difficulty of the task and identify factors that might help improve classification in future work. Finally, we show how the classifier can facilitate the annotation task for human annotators

arXiv.org e-Print Archive

Copenhagen University Research Information System

BERT-Deep CNN: State-of-the-Art for Sentiment Analysis of COVID-19 Tweets

Author: Alizadehsani Roohallah
Bagheri Rouhollah
Fazl Fatemeh
Hussain Sadiq
Joloudari Javad Hassannataj
Lashgari Reza
Nematollahi Mohammad Ali
Talukder Ashis
Publication venue
Publication date: 10/03/2023
Field of study

The free flow of information has been accelerated by the rapid development of social media technology. There has been a significant social and psychological impact on the population due to the outbreak of Coronavirus disease (COVID-19). The COVID-19 pandemic is one of the current events being discussed on social media platforms. In order to safeguard societies from this pandemic, studying people's emotions on social media is crucial. As a result of their particular characteristics, sentiment analysis of texts like tweets remains challenging. Sentiment analysis is a powerful text analysis tool. It automatically detects and analyzes opinions and emotions from unstructured data. Texts from a wide range of sources are examined by a sentiment analysis tool, which extracts meaning from them, including emails, surveys, reviews, social media posts, and web articles. To evaluate sentiments, natural language processing (NLP) and machine learning techniques are used, which assign weights to entities, topics, themes, and categories in sentences or phrases. Machine learning tools learn how to detect sentiment without human intervention by examining examples of emotions in text. In a pandemic situation, analyzing social media texts to uncover sentimental trends can be very helpful in gaining a better understanding of society's needs and predicting future trends. We intend to study society's perception of the COVID-19 pandemic through social media using state-of-the-art BERT and Deep CNN models. The superiority of BERT models over other deep models in sentiment analysis is evident and can be concluded from the comparison of the various research studies mentioned in this article.Comment: 20 pages, 5 figure

arXiv.org e-Print Archive

An assessment of deep learning models and word embeddings for toxicity detection within online textual comments

Author: Dessì Danilo
Recupero Diego Reforgiato
Sack Harald
Publication venue: MDPI
Publication date: 01/01/2021
Field of study

Today, increasing numbers of people are interacting online and a lot of textual comments are being produced due to the explosion of online communication. However, a paramount inconvenience within online environments is that comments that are shared within digital platforms can hide hazards, such as fake news, insults, harassment, and, more in general, comments that may hurt someone’s feelings. In this scenario, the detection of this kind of toxicity has an important role to moderate online communication. Deep learning technologies have recently delivered impressive performance within Natural Language Processing applications encompassing Sentiment Analysis and emotion detection across numerous datasets. Such models do not need any pre-defined hand-picked features, but they learn sophisticated features from the input datasets by themselves. In such a domain, word embeddings have been widely used as a way of representing words in Sentiment Analysis tasks, proving to be very effective. Therefore, in this paper, we investigated the use of deep learning and word embeddings to detect six different types of toxicity within online comments. In doing so, the most suitable deep learning layers and state-of-the-art word embeddings for identifying toxicity are evaluated. The results suggest that Long-Short Term Memory layers in combination with mimicked word embeddings are a good choice for this task

Repositorium für Naturwissenschaften und Technik