Search CORE

4,777 research outputs found

Computational approaches to semantic change (Volume 6)

Author
Publication venue: Language Science Press
Publication date: 16/10/2021
Field of study

Quantifying the dynamics of topical fluctuations in language

Author: Blythe Richard
Karjus Andres
Kirby Simon
Smith Kenneth
Publication venue: 'Brill'
Publication date: 21/06/2019
Field of study

The availability of large diachronic corpora has provided the impetus for a growing body of quantitative research on language evolution and meaning change. The central quantities in this research are token frequencies of linguistic elements in texts, with changes in frequency taken to reflect the popularity or selective fitness of an element. However, corpus frequencies may change for a wide variety of reasons, including purely random sampling effects, or because corpora are composed of contemporary media and fiction texts within which the underlying topics ebb and flow with cultural and socio-political trends. In this work, we introduce a simple model for controlling for topical fluctuations in corpora - the topical-cultural advection model - and demonstrate how it provides a robust baseline of variability in word frequency changes over time. We validate the model on a diachronic corpus spanning two centuries, and a carefully-controlled artificial language change scenario, and then use it to correct for topical fluctuations in historical time series. Finally, we use the model to show that the emergence of new words typically corresponds with the rise of a trending topic. This suggests that some lexical innovations occur due to growing communicative need in a subspace of the lexicon, and that the topical-cultural advection model can be used to quantify this.Comment: Code to run the analyses described in this paper is now available at https://github.com/andreskarjus/topical_cultural_advection_model . A previous shorter version of this paper outlining the basic model appeared as an extended abstract in the proceedings of the Society for Computation in Linguistics (Karjus et al. 2018, Topical advection as a baseline model for corpus-based lexical dynamics

arXiv.org e-Print Archive

Edinburgh Research Explorer

Computational modeling of semantic change

Author: Dubossarsky Haim
Tahmasebi Nina
Publication venue
Publication date: 13/04/2023
Field of study

In this chapter we provide an overview of computational modeling for semantic change using large and semi-large textual corpora. We aim to provide a key for the interpretation of relevant methods and evaluation techniques, and also provide insights into important aspects of the computational study of semantic change. We discuss the pros and cons of different classes of models with respect to the properties of the data from which one wishes to model semantic change, and which avenues are available to evaluate the results.Comment: This chapter is submitted to Routledge Handbook of Historical Linguistics, 2nd Editio

arXiv.org e-Print Archive

Computational approaches to semantic change

Author: Batista-Navarro Riza
Boons Frank
Borin Lars
Ciobanu Alina Maria
Dinu Liviu P.
Duan Yijun
Dubossarsky Haim
Grewal Karan
Handl Julia
Haslam Nick
Hengchen Simon
Jatowt Adam
Mahanty Sampriti
McGillivray Barbara
Palma Marco
Perrone Valerio
Peterson Stellan
Schlechtweg Dominik
Sköldberg Emma
Smith Jim Q.
Tahmasebi Nina
Uban Ana-Sabina
Vatri Alessandro
Vylomova Ekaterina
Xu Yang
Yoshikawa Masatoshi
Zhang Zheng-sheng
Publication venue: Language Science Press
Publication date: 26/02/2021
Field of study

Semantic change — how the meanings of words change over time — has preoccupied scholars since well before modern linguistics emerged in the late 19th and early 20th century, ushering in a new methodological turn in the study of language change. Compared to changes in sound and grammar, semantic change is the least  understood. Ever since, the study of semantic change has progressed steadily, accumulating a vast store of knowledge for over a century, encompassing many languages and language families. Historical linguists also early on realized the potential of computers as research tools, with papers at the very first international conferences in computational linguistics in the 1960s. Such computational studies still tended to be small-scale, method-oriented, and qualitative. However, recent years have witnessed a sea-change in this regard. Big-data empirical quantitative investigations are now coming to the forefront, enabled by enormous advances in storage capability and processing power. Diachronic corpora have grown beyond imagination, defying exploration by traditional manual qualitative methods, and language technology has become increasingly data-driven and semantics-oriented. These developments present a golden opportunity for the empirical study of semantic change over both long and short time spans. A major challenge presently is to integrate the hard-earned  knowledge and expertise of traditional historical linguistics with  cutting-edge methodology explored primarily in computational linguistics. The idea for the present volume came out of a concrete response to this challenge.  The 1st International Workshop on Computational Approaches to Historical Language Change (LChange'19), at ACL 2019, brought together scholars from both fields. This volume offers a survey of this exciting new direction in the study of semantic change, a discussion of the many remaining challenges that we face in pursuing it, and considerably updated and extended versions of a selection of the contributions to the LChange'19 workshop, addressing both more theoretical problems —  e.g., discovery of "laws of semantic change" — and practical applications, such as information retrieval in longitudinal text archives

Language Science Press

Computational approaches to semantic change

Author: Batista-Navarro Riza
Boons Frank
Borin Lars
Ciobanu Alina Maria
Dinu Liviu P.
Duan Yijun
Dubossarsky Haim
Grewal Karan
Handl Julia
Haslam Nick
Hengchen Simon
Jatowt Adam
Mahanty Sampriti
McGillivray Barbara
Palma Marco
Perrone Valerio
Peterson Stellan
Schlechtweg Dominik
Sköldberg Emma
Smith Jim Q.
Tahmasebi Nina
Uban Ana-Sabina
Vatri Alessandro
Vylomova Ekaterina
Xu Yang
Yoshikawa Masatoshi
Zhang Zheng-sheng
Publication venue: Language Science Press
Publication date: 26/02/2021
Field of study

Language Science Press

Computational approaches to semantic change

Author: Batista-Navarro Riza
Boons Frank
Borin Lars
Ciobanu Alina Maria
Dinu Liviu P.
Duan Yijun
Dubossarsky Haim
Grewal Karan
Handl Julia
Haslam Nick
Hengchen Simon
Jatowt Adam
Mahanty Sampriti
McGillivray Barbara
Palma Marco
Perrone Valerio
Peterson Stellan
Schlechtweg Dominik
Sköldberg Emma
Smith Jim Q.
Tahmasebi Nina
Uban Ana-Sabina
Vatri Alessandro
Vylomova Ekaterina
Xu Yang
Yoshikawa Masatoshi
Zhang Zheng-sheng
Publication venue: Language Science Press
Publication date: 26/02/2021
Field of study

Language Science Press