Skip to main content
Article thumbnail
Location of Repository

Introduction to statistical methods in linguistics with Python

By Alexandre Roulois


International audienceStatistics is a quantitative approach to research. Its relationship with math and science makes it at best ignored by the humanities community, at worst avoided. Indeed, humanities are known to attract students who flee the hard sciences. Even sadly, quantitative data are sometimes wrongly accused to destroy the "magic" in literary texts. On the contrary, I intend to demonstrate in my presentation that statistics can contribute to linguistics.There is another false idea that grows in the mind of students in humanities I want to fight: programing is only for math people. As a matter of fact, programing has more to do with logic and linguistics than we would have thought. It is just like learning a new language and who is better at this than a student in humanities?With only a few tools, we will build a program in Python that analyse a multilingual corpus of various texts (fiction, essay) and gives quantitative data and graphs as well

Topics: Statistics, Linguistics computationnal, Python programmation, [ SHS.LANGUE ] Humanities and Social Sciences/Linguistics, [ SHS.INFO ] Humanities and Social Sciences/Library and information sciences
Publisher: HAL CCSD
Year: 2015
OAI identifier: oai:HAL:halshs-01264583v1
Provided by: Hal-Diderot
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • https://halshs.archives-ouvert... (external link)
  • https://halshs.archives-ouvert... (external link)
  • https://halshs.archives-ouvert... (external link)
  • Suggested articles

    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.