21 research outputs found

    Toward a Critical Toponymy Framework for Named Entity Recognition: A Case Study of Airbnb in New York City

    Full text link
    Critical toponymy examines the dynamics of power, capital, and resistance through place names and the sites to which they refer. Studies here have traditionally focused on the semantic content of toponyms and the top-down institutional processes that produce them. However, they have generally ignored the ways in which toponyms are used by ordinary people in everyday discourse, as well as the other strategies of geospatial description that accompany and contextualize toponymic reference. Here, we develop computational methods to measure how cultural and economic capital shape the ways in which people refer to places, through a novel annotated dataset of 47,440 New York City Airbnb listings from the 2010s. Building on this dataset, we introduce a new named entity recognition (NER) model able to identify important discourse categories integral to the characterization of place. Our findings point toward new directions for critical toponymy and to a range of previously understudied linguistic signals relevant to research on neighborhood status, housing and tourism markets, and gentrification.Comment: Accepted at EMNLP 2023 (main track

    On the use of Jargon and Word Embeddings to Explore Subculture within the Reddit’s Manosphere

    Get PDF
    Understanding the identities, needs, realities and development of subcultures has been a long term target of sociology and cultural studies. Socio-cultural linguistics, in particular, examines the use of language and, in particular, the existence and use of neologisms, slang and jargon. These terms capture concepts and expressions that are not in common use and represent the new realities, norms and values of subcommunities. Identifying and understanding such terms, however, is a very complex task, particularly considering the vast amount of content that is currently available online for many such groups. In this paper, we propose a combination of computational and socio-linguistic methods to automatically extract new terminology from large amounts of data, using word-embeddings to semantically contextualise their meaning. As a use case, we explore subculture on the platform Reddit. More specifically, we investigate groups considered part of the manosphere, a loose online community where men’s perspectives, gripes, frustrations and desires are explicitly expressed and where women are typically targets of hostility. Characterisations of this group as a subculture are then provided, based on an in-depth analysis of the identified jargon

    Table S5 from The Epigenetic Evolution of Glioma Is Determined by the <i>IDH1</i> Mutation Status and Treatment Regimen

    Full text link
    Table S5: Treatment-related probes and samples (Related to Figure 3) S5A: List of 69 IDHmut pairs with treatment information (ID of Initial and Recurrent samples and group assignment) S5B: List of differentially methylated probes associated with treatment in IDHmut gliomas S5C: List of differentially methylated probes associated with treatment in IDHmut astrocytomas S5D: List of CpG-gene pairs (epigenetic regulation associated with treatment)</p
    corecore