Skip to main content
Article thumbnail
Location of Repository

The adaptation of an English spellchecker for Japanese writers

By Roger Mitton and T. Okada

Abstract

It has been pointed out that the spelling errors made by second-language writers writing in English have features that are to some extent characteristic of their first language, and the suggestion has been made that a spellchecker could be adapted to take account of these features. In the work reported here, a corpus of spelling errors made by Japanese writers writing in English was compared with a corpus of errors made by native speakers. While the great majority of errors were common to the two corpora, some distinctively Japanese error patterns were evident against this common background, notably a difficulty in deciding between the letters b and v, and the letters l and r, and a tendency to add syllables. A spellchecker that had been developed for native speakers of English was adapted to cope with these errors. A brief account is given of the spellchecker’s mode of operation to indicate how it lent itself to modifications of this kind. The native-speaker spellchecker and the Japanese-adapted version were run over the error corpora and the results show that these adaptations produced a modest but worthwhile improvement to the spellchecker’s performance in correcting Japanese-made errors

Topics: csis
Year: 2007
OAI identifier: oai:eprints.bbk.ac.uk.oai2:592

Suggested articles

Citations

  1. (2004). A corpus analysi s of spelli n g errors made by Japane s e EFL writer s .
  2. (2004). A corpus analysis of spelling errors made by Japanese EFL writers.
  3. (1970). Categor i e s of spelling difficulty in spea kers of English as a first and second languag e .
  4. (1970). Categories of spelling difficulty in speakers of English as a first and second language. doi
  5. (1988). Comput e r i z e d correction of phonographic errors. doi
  6. (1988). Computerized correction of phonographic errors. doi
  7. (1996). English spelling and the computer. L o n d o n : Longma n . (now availab l e electro n i c a l l y from eprints.bbk.ac.uk
  8. (1996). English spelling and the computer. London: Longman. (now available electronically from eprints.bbk.ac.uk)
  9. (1998). Learner English on computer. doi
  10. (1998). Learner English on computer. London: Longma n.
  11. (2000). Optimi s i n g measur e s of lexica l variat i o n in EFL learner corpora .
  12. (2000). Optimising measures of lexical variation in EFL learner corpora.
  13. (2005). Spelling errors ma de by Japanes e EFL writers : with referen c e to errors occurri n g at the word-in i t i a l and the word-fi n a l positio n . In
  14. (2005). Spelling errors made by Japanese EFL writers: with reference to errors occurring at the word-initial and the word-final position. In
  15. (1987). Statis t i c a l charact e r i s t i c s of English sentenc e s written by the Japanese and detec ting and correcting spelling-errors.
  16. (1987). Statistical characteristics of English sentences written by the Japanese and detecting and correcting spelling-errors.
  17. (1998). The comput e r learn e r co rpus: a versatile new source of data for SLA researc h .
  18. (1998). The computer learner corpus: a versatile new source of data for SLA research.
  19. (1970). The contrastive analysis hypothesis and spelling errors. doi
  20. (1974). The string-to-string correction problem. doi
  21. (1970). Ziahoss e i n y

To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.