Search CORE

3 research outputs found

Recommended from our members

Understanding Misogynoir Online: Challenges In Identifying Intersectional Hate

Author: Kwarteng Joseph
Publication venue
Publication date: 15/04/2024
Field of study

In an era of unprecedented digital connectivity, social networking sites have become a global crossroads of cultures, nations, and ethnicities. Yet, this digital expansion has been paralleled by the proliferation of hate speech, particularly targeted at specific genders and ethnicities. The speed at which digital networks operate allows hate speech to spread more rapidly than ever before, crossing geographical, social, and political boundaries with ease and often compounding the harm it causes. This scalability of hate speech through these digital platforms presents a formidable challenge in monitoring and moderating online spaces to protect vulnerable groups. In this thesis, we investigate the phenomenon of "Misogynoir", a specific form of intersectional hate speech that combines racial and gender-based prejudice and is directed at Black women. We present a comprehensive study on Misogynoir by converging methodologies from qualitative and quantitative analysis to investigate and highlight the challenges and nuances in identifying and understanding misogynoir. We start by conducting an extensive literature review of the subject, identifying the models of misogynoir. Based on these models, we created a lexicon of terms and expressions to examine further the online presence of this type of hate, particularly targeting Black women within the Science and Technology sector through a detailed analysis of public responses to their experiences of misogynoir shared on X (formerly Twitter). However, given the nuanced and context-dependent nature of misogynoir, which the lexicon approach struggled to capture effectively, we proceeded to assess the effectiveness of the current state-of-the-art automated hate speech detection tools. Our evaluation spanned across specially curated datasets, composed of sampled tweets that potentially exemplified misogynoir and those that displayed support towards Black women. This analysis aimed to determine the proficiency of these tools in identifying the multifaceted expressions of misogynoir within online discourse. We further delved into the impact of annotators' identities and lived experiences on their perception and labelling of potential misogynoir instances, recognising that automated detection systems are trained on datasets annotated by humans. By conducting an in-depth qualitative analysis of the justifications provided by annotators from four distinct demographic groups, it became evident that an annotator's background profoundly influences their content interpretation. Findings from this work shed light on annotator behaviour and their diverse rationales for annotating intersectional hate, and how identity and lived experiences influence labelling decisions. These highlight the inadequacies of present automated hate speech detection tools in detecting misogynoir, setting the stage for future technological improvements. We emphasise the necessity for more advanced, context-sensitive tools tailored to the unique challenges encountered by Black women on digital steering us toward a more just and equitable online environment

Open Research Online (The Open University)

Tune your brown clustering, please

Author: Bøgh K.S.
Chester S.
Derczynski L.
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2015
Field of study

Brown clustering, an unsupervised hierarchical clustering technique based on ngram mutual information, has proven useful in many NLP applications. However, most uses of Brown clustering employ the same default configuration; the appropriateness of this configuration has gone predominantly unexplored. Accordingly, we present information for practitioners on the behaviour of Brown clustering in order to assist hyper-parametre tuning, in the form of a theoretical model of Brown clustering utility. This model is then evaluated empirically in two sequence labelling tasks over two text types. We explore the dynamic between the input corpus size, chosen number of classes, and quality of the resulting clusters, which has an impact for any approach using Brown clustering. In every scenario that we examine, our results reveal that the values most commonly used for the clustering are sub-optimal

White Rose Research Online

Geographic information extraction from texts

Author: Hu Xuke
Hu Yingjie
Kersten Jens
Resch Bernd
Publication venue
Publication date: 05/12/2023
Field of study

A large volume of unstructured texts, containing valuable geographic information, is available online. This information – provided implicitly or explicitly – is useful not only for scientific studies (e.g., spatial humanities) but also for many practical applications (e.g., geographic information retrieval). Although large progress has been achieved in geographic information extraction from texts, there are still unsolved challenges and issues, ranging from methods, systems, and data, to applications and privacy. Therefore, this workshop will provide a timely opportunity to discuss the recent advances, new ideas, and concepts but also identify research gaps in geographic information extraction

Institute of Transport Research:Publications