Search CORE

8,643 research outputs found

TA-COS 2018 : 2nd Workshop on Text Analytics for Cybersecurity and Online Safety : Proceedings

Author: De Pauw Guy
Desmet Bart
Lefever Els
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2018
Field of study

Detection of Hate-Speech Tweets Based on Deep Learning: A Review

Author: Abdulazeez Adnan Mohsin
Miran Ara Zozan
Publication venue: Universitas Trilogi
Publication date: 27/12/2023
Field of study

Cybercrime, cyberbullying, and hate speech have all increased in conjunction with the use of the internet and social media. The scope of hate speech knows no bounds or organizational or individual boundaries. This disorder affects many people in diverse ways. It can be harsh, offensive, or discriminating depending on the target's gender, race, political opinions, religious intolerance, nationality, human color, disability, ethnicity, sexual orientation, or status as an immigrant. Authorities and academics are investigating new methods for identifying hate speech on social media platforms like Facebook and Twitter. This study adds to the ongoing discussion about creating safer digital spaces while balancing limiting hate speech and protecting freedom of speech. Partnerships between researchers, platform developers, and communities are crucial in creating efficient and ethical content moderation systems on Twitter and other social media sites. For this reason, multiple methodologies, models, and algorithms are employed. This study presents a thorough analysis of hate speech in numerous research publications. Each article has been thoroughly examined, including evaluating the algorithms or methodologies used, databases, classification techniques, and the findings achieved. In addition, comprehensive discussions were held on all the examined papers, explicitly focusing on consuming deep learning techniques to detect hate speech

E-Journal Portal System Universitas Trilogi

Towards multidomain and multilingual abusive language detection: a survey

Author: Basile V.
Pamungkas E. W.
Patti V.
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2021
Field of study

Institutional Research Information System University of Turin

Time of your hate: The challenge of time in hate speech detection on social media

Author: Basile P.
Basile V.
Florio K.
Patti V.
Polignano M.
Publication venue: 'MDPI AG'
Publication date: 01/01/2020
Field of study

Institutional Research Information System University of Turin

SoK: Content Moderation in Social Media, from Guidelines to Enforcement, and Research to Practice

Author: Kumarswamy Nihal
Ling Chen
Nilizadeh Shirin
Paudel Pujan
Singhal Mohit
Stringhini Gianluca
Thota Poojitha
Publication venue
Publication date: 27/10/2022
Field of study

To counter online abuse and misinformation, social media platforms have been establishing content moderation guidelines and employing various moderation policies. The goal of this paper is to study these community guidelines and moderation practices, as well as the relevant research publications to identify the research gaps, differences in moderation techniques, and challenges that should be tackled by the social media platforms and the research community at large. In this regard, we study and analyze in the US jurisdiction the fourteen most popular social media content moderation guidelines and practices, and consolidate them. We then introduce three taxonomies drawn from this analysis as well as covering over one hundred interdisciplinary research papers about moderation strategies. We identified the differences between the content moderation employed in mainstream social media platforms compared to fringe platforms. We also highlight the implications of Section 230, the need for transparency and opacity in content moderation, why platforms should shift from a one-size-fits-all model to a more inclusive model, and lastly, we highlight why there is a need for a collaborative human-AI system

arXiv.org e-Print Archive

Like trainer, like bot? Inheritance of bias in algorithmic content moderation

Author: A Caliskan
A Centivany
AA Anderson
AF Hayes
D Halpern
FL Johnson
I Gagliardone
J Feinberg
J Wolak
JS Mill
K Crawford
L Dahlberg
LA Sutton
NJ Stroud
P Burnap
RS Tokunaga
T Calders
T Gillespie
T Jay
TB Ksiazek
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The internet has become a central medium through which `networked publics' express their opinions and engage in debate. Offensive comments and personal attacks can inhibit participation in these spaces. Automated content moderation aims to overcome this problem using machine learning classifiers trained on large corpora of texts manually annotated for offence. While such systems could help encourage more civil debate, they must navigate inherently normatively contestable boundaries, and are subject to the idiosyncratic norms of the human raters who provide the training data. An important objective for platforms implementing such measures might be to ensure that they are not unduly biased towards or against particular norms of offence. This paper provides some exploratory methods by which the normative biases of algorithmic content moderation systems can be measured, by way of a case study using an existing dataset of comments labelled for offence. We train classifiers on comments labelled by different demographic subsets (men and women) to understand how differences in conceptions of offence between these groups might affect the performance of the resulting models on various test sets. We conclude by discussing some of the ethical choices facing the implementers of algorithmic moderation systems, given various desired levels of diversity of viewpoints amongst discussion participants.Comment: 12 pages, 3 figures, 9th International Conference on Social Informatics (SocInfo 2017), Oxford, UK, 13--15 September 2017 (forthcoming in Springer Lecture Notes in Computer Science

arXiv.org e-Print Archive

Crossref

UCL Discovery

Oxford University Research Archive

Towards Automated Moderation: Enabling Toxic Language Detection with Transfer Learning and Attention-Based Models

Author: Bäumer Frederik S.
Caron Matthew
Müller Oliver
Publication venue: 'HICSS Conference Office'
Publication date: 03/01/2022
Field of study

Our world is more connected than ever before. Sadly, however, this highly connected world has made it easier to bully, insult, and propagate hate speech on the cyberspace. Even though researchers and companies alike have started investigating this real-world problem, the question remains as to why users are increasingly being exposed to hate and discrimination online. In fact, the noticeable and persistent increase in harmful language on social media platforms indicates that the situation is, actually, only getting worse. Hence, in this work, we show that contemporary ML methods can help tackle this challenge in an accurate and cost-effective manner. Our experiments demonstrate that a universal approach combining transfer learning methods and state-of-the-art Transformer architectures can trigger the efficient development of toxic language detection models. Consequently, with this universal approach, we provide platform providers with a simplistic approach capable of enabling the automated moderation of user-generated content, and as a result, hope to contribute to making the web a safer place

ScholarSpace at University of Hawai'i at Manoa

AIS Electronic Library (AISeL)