159,725 research outputs found

    Large scale analysis of changes in english vocabulary over recent time

    Full text link

    Stochastic model for the vocabulary growth in natural languages

    Full text link
    We propose a stochastic model for the number of different words in a given database which incorporates the dependence on the database size and historical changes. The main feature of our model is the existence of two different classes of words: (i) a finite number of core-words which have higher frequency and do not affect the probability of a new word to be used; and (ii) the remaining virtually infinite number of noncore-words which have lower frequency and once used reduce the probability of a new word to be used in the future. Our model relies on a careful analysis of the google-ngram database of books published in the last centuries and its main consequence is the generalization of Zipf's and Heaps' law to two scaling regimes. We confirm that these generalizations yield the best simple description of the data among generic descriptive models and that the two free parameters depend only on the language but not on the database. From the point of view of our model the main change on historical time scales is the composition of the specific words included in the finite list of core-words, which we observe to decay exponentially in time with a rate of approximately 30 words per year for English.Comment: corrected typos and errors in reference list; 10 pages text, 15 pages supplemental material; to appear in Physical Review

    Changes in children’s cognitive development at the Start of School in England 2001–2008.

    Get PDF
    Since 1997, England has seen massive changes in the Early Years including the introduction of an early childhood curriculum, free pre-school education for three-year-olds and local programmes for disadvantaged communities. Many of these initiatives took time to introduce and become established. Beginning in 2001, and each year thereafter until 2008, the authors collected consistent data from thousands of children when they started school at the age of four on a range of variables, chosen because they are good predictors of later success. These included vocabulary, early reading and early mathematics. Children from the same set of 472 state primary schools in England were assessed each year. This paper contributes to the existing studies of educational trends over time by examining the extent to which children's scores on these measures changed over that period; in general, they were found to have remained stable

    CLIWOC multilingual meteorological dictionary

    Get PDF
    This dictionary is the first attempt to express the wealth of archaic logbook wind force terms in a form that is comprehensible to the modern-day reader. Oliver and Kington (1970) and Lamb (1982) have drawn attention to the importance of logbooks in climatic studies, and Lamb (1991) offered a conversion scale for early eighteenth century English wind force terms, but no studies have thus far pursued the matter to any greater depth. This text attempts to make good this deficiency, and is derived from the research undertaken by the CLIWOC project1 in which British, Dutch, French and Spanish naval and merchant logbooks from the period 1750 to 1850 were used to derive a global database of climatic information. At an early stage in the project it was apparent that many of the logbook weather terms, whilst conforming to a conventional vocabulary, possessed meanings that were unclear to twenty-first century readers or had changed over time. This was particularly the case for the important element of wind force; but no special plea is entered for the evolution in nautical vocabulary, which often reflected more wide-ranging changes in the respective native languages.The key objective was to translate the archaic vocabulary of the late eighteenth and early nineteenth century mariner into expressions directly comparable with the Beaufort Scale (see Appendix I). Only then could the projects scientific programme be embarked upon. This dictionary is the result of the largest undertaking into logbook studies that has yet been carried out. Several thousand logbooks from British, Dutch, French and Spanish archives were examined, and the exercise offered a unique opportunity to explore the vocabulary of the one hundred year period beginning in 1750. The logbooks from which the raw data have been abstracted range widely across the North and South Atlantic and the Indian Oceans. Only the Pacific, largely in consequence of the paucity of regular naval activity in that area, is not well represented. The range of climates encountered in this otherwise wide geographic domain gives ample opportunity for the full range of the mariners nautical weather vocabulary to be assessed, from the calms of the Equatorial regions, through the gales of the mid-latitude systems to the fearsome storms of the tropical latitudes. The Trade Winds belts, the Doldrums, the unsettled mid-latitudes, even the icy wastes of the high latitudes, are all embraced in this study. It is not here intended to pass any judgements on the climatological record of the logbooks, and this text seeks only to provide a means of understanding archaic wind force terms and, other than to indicate those items that were not commonly used, no information is given on the frequency with which different terms appeared in the logbooks. Attention is, furthermore, confined to Dutch, English, French and Spanish because these once great imperial powers were the only nations able to support wide-ranging ocean-going fleets with their attendant collections of logbooks and documents over this long period of time. The work is offered to the wider academic community in the hope that they will prove to be of as much value as it has been to the CLIWOC team

    Time to Act: An Agenda for Advancing Adolescent Literacy for College and Career Success

    Get PDF
    Presents a vision for literacy instruction from fourth through twelfth grade; examines the challenges; outlines the elements of success, including professional development and use of data; and lays out a national agenda for change based on case studies

    Mapping the Americanization of English in Space and Time

    Full text link
    As global political preeminence gradually shifted from the United Kingdom to the United States, so did the capacity to culturally influence the rest of the world. In this work, we analyze how the world-wide varieties of written English are evolving. We study both the spatial and temporal variations of vocabulary and spelling of English using a large corpus of geolocated tweets and the Google Books datasets corresponding to books published in the US and the UK. The advantage of our approach is that we can address both standard written language (Google Books) and the more colloquial forms of microblogging messages (Twitter). We find that American English is the dominant form of English outside the UK and that its influence is felt even within the UK borders. Finally, we analyze how this trend has evolved over time and the impact that some cultural events have had in shaping it.Comment: 16 pages, 6 figures, 2 tables. Published versio

    Neologisms in Modern English: study of word-formation processes

    Get PDF
    http://tartu.ester.ee/record=b2654513~S1*es
    corecore