Search CORE

87 research outputs found

Quantifying change

Author: Nevalainen Terttu
Publication venue: John Benjamins
Publication date: 01/09/2018
Field of study

Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Early Modern English

Author: Nevalainen Taimi Terttu Annikki
Publication venue: Oxford University Press
Publication date: 01/08/2017
Field of study

In the Early Modern English period (1500–1700), steps were taken toward Standard English, and this was also the time when Shakespeare wrote, but these perspectives are only part of the bigger picture. This chapter looks at Early Modern English as a variable and changing language not unlike English today. Standardization is found particularly in spelling, and new vocabulary was created as a result of the spread of English into various professional and occupational specializations. New research using digital corpora, dictionaries, and databases reveals the gradual nature of these processes. Ongoing developments were no less gradual in pronunciation, with processes such as the Great Vowel Shift, or in grammar, where many changes resulted in new means of expression and greater transparency. Word order was also subject to gradual change, becoming more fixed over time.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Volatile organic compounds in commonly used beddings before and after autoclaving

Author: Nevalainen Timo
Vartiainen Terttu
Publication venue: Scandinavian Journal of Laboratory Animal Science
Publication date: 01/12/1996
Field of study

How to describe bedding, that is the question. So far it has been acceptable to write down the type and manufacturer of the bedding. And maybe for quality purposes screen the bedding for pesticide and hcav y metal residues. This study focused on assessing of ten volatile organic compounds, which as a group has been combined with negative effects on animals. The commonly used European beddings were found to contain tremendously variable concentrations of these arbitrarily chosen volatile compounds. Furthermore, in most cases the concentrations went down by an order of magnitude after autoclaving. In conclusion, description of bedding in sensitive studies remains vague unless there is data on volatile organic compounds of the bedding at its final destination, the animal cage

Journals from University of Tartu

Sociolinguistics and Language History: The Helsinki Corpus of Early English Correspondence

Author: Nevalainen Terttu
Raumolin-Brunberg Helena
Publication venue: Aarhus University, Faculty of Arts, School of Communication and Culture
Publication date: 04/01/2017
Field of study

The paper introduces our new project on diachronic sociolinguistics, focusing on the problems of compiling a representative corpus for this purpose. We study long-term linguistic change in the Late Middle and Early Modern English periods (1420-1680) in a computer-readable corpus of personal letters, which is designed specifically for the purposes of sociohistorical research. When completed, the Helsinki Corpus of Early English Correspondence will comprise some 1.5 million running words representing all the literate social ranks of the time, both sexes, and different ages and occupations. In our case, the issues that a corpus compiler must deal with include the coverage of all the sociolinguistically relevant categories of data, authenticity of extant materials, and the quality of editing

Tidsskrift.dk (Det Kongelige Bibliotek)

Variation in noun and pronoun frequencies in a sociohistorical corpus of English

Author: Nevalainen Terttu
Siirtola Harri
Säily Tanja
Publication venue
Publication date: 01/01/2011
Field of study

WOS:000291063000003Many corpus linguists make the tacit assumption that part-of-speech frequencies remain constant during the period of observation. In this article, we will consider two related issues: (1) the reliability of part-of-speech tagging in a diachronic corpus and (2) shifts in tag ratios over time. The purpose is both to serve the users of the corpus by making them aware of potential problems, and to obtain linguistically interesting results. We use noun and pronoun ratios as diagnostics indicative of opposing stylistic tendencies, but we are also interested in testing whether any observed variation in the ratios could be accounted for in sociolinguistic terms. The material for our study is provided by the Parsed Corpus of Early English Correspondence (PCEEC), which consists of 2.2 million running words covering the period 1415–1681. The part-of-speech tagging of the PCEEC has its problems, which we test by reannotating the corpus according to our own principles and comparing the two annotations. While there are quite a few changes, the mean percentage of change is very small for both nouns and pronouns. As for variation over time, the mean frequency of nouns declines somewhat, while the mean frequency of pronouns fluctuates with no clear diachronic trend. However, women consistently use more pronouns than men, while men use more nouns than women. More fine-grained distinctions are needed to uncover further regularities and possible reasons for this variation.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Comparative sociolinguistic perspectives on the rate of linguistic change

Author: Nevalainen Terttu
Säily Tanja
Vartiainen Turo
Publication venue
Publication date: 01/01/2020
Field of study

This issue of the Journal of Historical Sociolinguistics aims to contribute to our understanding of language change in real time by presenting a group of articles particularly focused on social and sociocultural factors underlying language diversification and change. By analysing data from a varied set of languages, including Greek, English, and the Finnic and Mongolic language families, and mainly focussing their investigation on the Middle Ages, the authors connect various social and cultural factors with the specific topic of the issue, the rate of linguistic change. The sociolinguistic themes addressed include community and population size, conflict and conquest, migration and mobility, bi- and multilingualism, diglossia and standardization. In this introduction, the field of comparative historical sociolinguistics is considered a cross-disciplinary enterprise with a sociolinguistic agenda at the crossroads of contact linguistics, historical comparative linguistics and linguistic typology.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

The burden of legacy : Producing the Tagged Corpus of Early English Correspondence Extension (TCEECE)

Author: Kaislaniemi Samuli
Nevalainen Terttu
Saario Lassi
Säily Tanja
Publication venue
Publication date: 01/01/2021
Field of study

Special issue, Challenges of Combining Structured and Unstructured Data in Corpus Development, ed. by Tanja Säily & Jukka Tyrkkö.This paper discusses the process of part-of-speech tagging the Corpus of Early English Correspondence Extension (CEECE), as well as the end result. The process involved normalisation of historical spelling variation, conversion from a legacy format into TEI-XML, and finally, tokenisation and tagging by the CLAWS software. At each stage, we had to face and work around problems such as whether to retain original spelling variants in corpus markup, how to implement overlapping hierarchies in XML, and how to calculate the accuracy of tagging in a way that acknowledges errors in tokenisation. The final tagged corpus is estimated to have an accuracy of 94.5 per cent (in the C7 tagset), which is circa two percentage points (pp) lower than that of present-day corpora but respectable for Late Modern English. The most accurate tag groups include pronouns and numerals, whereas adjectives and adverbs are among the least accurate. Normalisation increased the overall accuracy of tagging by circa 3.7pp. The combination of POS tagging and social metadata will make the corpus attractive to linguists interested in the interplay between language-internal and -external factors affecting variation and change.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

Exploring meta-analysis for historical corpus linguistics based on linked data

Author: Kesäniemi Joonas
Nevalainen Terttu
Säily Tanja
Vartiainen Turo
Publication venue
Publication date: 01/01/2018
Field of study

Empirical work on English historical corpus linguistics is plentiful but fragmented, and some of it is hard to come by. This paper proposes a solution for making it more accessible and reusable for meta-analysis. We present an online Language Change Database (LCD), which provides comparative, real-time baseline data from earlier corpus-based studies. LCD entries summarize the findings and include numerical data from the articles. We discuss the LCD from the perspective of database design and linked data management. Furthermore, we illustrate the reuse of LCD data through a meta-analysis of the history of English connectives. For this purpose, we have developed an application called the LCD Aggregated Data Analysis workbench (LADA). We show how researchers can use LADA to filter, refine and visualize LCD data. Thus we are paving the way for a future where both research results and research data are regularly available for verification, validation and re-use.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

History of English as punctuated equilibria? A meta-analysis of the rate of linguistic change in Middle English

Author: Liimatta Aatu
Lijffijt Jefrey
Nevalainen Terttu
Säily Tanja
Vartiainen Turo
Publication venue
Publication date: 01/01/2020
Field of study

In this paper, we explore the rate of language change in the history of English. Our main focus is on detecting periods of accelerated change in Middle English (1150–1500), but we also compare the Middle English data with the Early Modern period (1500–1700) in order to establish a longer diachrony for the pace at which English has changed over time. Our study is based on a meta-analysis of existing corpus research, which is made available through a new linguistic resource, the Language Change Database (LCD). By aggregating the rates of 44 individual changes, we provide a critical assessment of how well the theory of punctuated equilibria (Dixon 1997) fits with our results. More specifically, by comparing the rate of language change with major language-external events, such as the Norman Conquest and the Black Death, we provide the first corpus-based meta-analysis of whether these events, which had significant societal consequences, also had an impact on the rate of language change. Our results indicate that major changes in the rate of linguistic change in the late medieval period could indeed be connected to the social and cultural after-effects of the Norman Conquest. We also make a methodological contribution to the field of English historical linguistics: by re-using data from existing research, linguists can start to ask new, fundamental questions about the ways in which language change progresses.Peer reviewe

Ghent University Academic Bibliography

Helsingin yliopiston digitaalinen arkisto

Interactive Principal Component Analysis

Author: Nevalainen Terttu
Siirtola Harri
Säily Tanja
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 15/11/2019
Field of study

TamPub Julkaisuarkisto - TamPub Institutional Repository

Trepo - Institutional Repository of Tampere University