Introduction to statistical methods in linguistics with Python

Abstract

International audienceStatistics is a quantitative approach to research. Its relationship with math and science makes it at best ignored by the humanities community, at worst avoided. Indeed, humanities are known to attract students who flee the hard sciences. Even sadly, quantitative data are sometimes wrongly accused to destroy the "magic" in literary texts. On the contrary, I intend to demonstrate in my presentation that statistics can contribute to linguistics.There is another false idea that grows in the mind of students in humanities I want to fight: programing is only for math people. As a matter of fact, programing has more to do with logic and linguistics than we would have thought. It is just like learning a new language and who is better at this than a student in humanities?With only a few tools, we will build a program in Python that analyse a multilingual corpus of various texts (fiction, essay) and gives quantitative data and graphs as well

Similar works

Full text

thumbnail-image

Hal-Diderot

redirect
Last time updated on 08/11/2016

This paper was published in Hal-Diderot.

Having an issue?

Is data on this page outdated, violates copyrights or anything else? Report the problem now and we will take corresponding actions after reviewing your request.