Location of Repository

Extracting Collocations from Specialised Corpora

By Michael Barlow and Ute Römer

Abstract

As the growing number of publications (see Evert 2005) on the topic indicates, collocations, which can be defined as frequently occurring contiguous or noncontiguous combinations of words, are of central interest in linguistic analysis and description. Even though corpora and concordance packages offer important insights into the co-selectional tendencies of words, there is as yet no definitive way to locate collocations in a corpus. However, there is now becoming available a new generation of software tools that enable users to extract from a corpus lists of candidate collocations for inspection. One of these new-generation tools is Collocate (Barlow 2004). Collocate uses frequency information and statistical analyses (t-score, log likelihood, MI) in order to retrieve lists of: (a) collocations with a specified search word and within a set span (e.g. four words), (b) n-grams (lexical bundles) of different lengths, an

Year: 2013
OAI identifier: oai:CiteSeerX.psu:10.1.1.309.4567
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://ucrel.lancs.ac.uk/publi... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.