Comparing expert and learner mathematical language: A corpus linguistics approach

Christopher J. Sangwin (7157666); Juan P. Mejia-Ramos (7157663); Kristen Lew (4163272); Lara Alcock (1384308); Matthew Inglis (1384290); Paolo Rago (4163266)

Comparing expert and learner mathematical language: A corpus linguistics approach

Authors: Christopher J. Sangwin (7157666)
Juan P. Mejia-Ramos (7157663)
Kristen Lew (4163272)
Lara Alcock (1384308)
Matthew Inglis (1384290)
Paolo Rago (4163266)
Publication date: 1 January 2017
Publisher

Abstract

Corpus linguists attempt to understand language by statistically analyzing large collections of text, known as corpora. We describe the creation of three corpora designed to enable the study of expert and learner mathematical language. Our corpora were formed by collecting and processing three different genres of mathematical texts: mathematical research papers, undergraduate-level textbooks, and undergraduate dissertations. We pay particular attention to the method by which our corpora were created, and present a mechanism by which LaTeX source files can be easily converted to a form suitable for use with corpus analysis software packages. We then compare these three different types of mathematical texts by analyzing their word frequency distributions. We find that undergraduate students write in remarkably similar ways to textbook authors, but that research papers are substantially different. These differences are discussed

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Loughborough University Institutional Repository

oai:figshare.com:article/93726...

Last time updated on 26/03/2020