CORE
🇺🇦
make metadata, not war
Services
Services overview
Explore all CORE services
Access to raw data
API
Dataset
FastSync
Content discovery
Recommender
Discovery
OAI identifiers
OAI Resolver
Managing content
Dashboard
Bespoke contracts
Consultancy services
Support us
Support us
Membership
Sponsorship
Community governance
Advisory Board
Board of supporters
Research network
About
About us
Our mission
Team
Blog
FAQs
Contact us
Deviations in the Zipf and Heaps laws in natural languages
Authors
Bochkarev V.
Lerner E.
Shevlyakova A.
Publication date
1 March 2020
Publisher
Abstract
This paper is devoted to verifying of the empirical Zipf and Hips laws in natural languages using Google Books Ngram corpus data. The connection between the Zipf and Heaps law which predicts the power dependence of the vocabulary size on the text size is discussed. In fact, the Heaps exponent in this dependence varies with the increasing of the text corpus. To explain it, the obtained results are compared with the probability model of text generation. Quasi-periodic variations with characteristic time periods of 60-100 years were also found. © Published under licence by IOP Publishing Ltd
Similar works
Full text
Available Versions
National Open Repository Aggregator (NORA)
See this paper in CORE
Go to the repository landing page
Download from data provider
oai:rour.neicon.ru:rour/179211
Last time updated on 04/04/2020