2 research outputs found

    Linguistic Annotation in/for Corpus Linguistics

    Get PDF
    This article surveys linguistic annotation in corpora and corpus linguistics. We first define the concept of ‘corpus’ as a radial category and then, in Sect.2, discuss a variety of kinds of information for which corpora are annotated and that are exploited in contemporary corpus linguistics. Section3 then exemplifies many current formats of annotation with an eye to highlighting both the diversity of formats currently available and the emergence of XML annotation as, for now, the most widespread form of annotation. Section4 summarizes and concludes with desiderata for future developments.This article surveys linguistic annotation in corpora and corpus linguistics. We first define the concept of ‘corpus’ as a radial category and then, in Sect.2, discuss a variety of kinds of information for which corpora are annotated and that are exploited in contemporary corpus linguistics. Section3 then exemplifies many current formats of annotation with an eye to highlighting both the diversity of formats currently available and the emergence of XML annotation as, for now, the most widespread form of annotation. Section4 summarizes and concludes with desiderata for future developments
    corecore