Location of Repository

Automated classification of unexpected uses of this and that in a learner corpus of English

By Thomas Gaillat, Pascale Sébillot and Nicolas BALLIER


International audienceThis paper deals with the way learners make use of the demonstratives this and that. NLP tools are applied to classify occurrences of native and non-native uses of the two forms. The objective of the two experiments is to automatically identify expected and unexpected uses. The textualenvironment of all the occurrences is explored at text and PoS level to uncover features which play a role in the selection of a particular form. Results of the first experiment show that the PoS features predeterminer and determiner, which are found in the close context of occurrences, helpidentify unexpected learner uses among many occurrences also including native uses. The second experiment shows evidence that the PoS features plural noun and coordinating conjunction influence the unexpected uses of the demonstratives by learners. This study shows that NLP toolscan be used to explore texts and uncover underlying grammatical categories that play a role in the selection of specific words

Topics: this, that, learner corpus of English, [SHS.LANGUE] Humanities and Social Sciences/Linguistics
Publisher: Rodopi
Year: 2014
OAI identifier: oai:HAL:hal-01239556v1
Provided by: Hal-Diderot

Suggested articles


To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.