2,150 research outputs found

    A database of semantic clusters of verb usages

    Get PDF
    We are presenting VPS-30-En, a small lexical resource that contains the following 30 English verbs: access, ally, arrive, breathe, claim, cool, crush, cry, deny, enlarge, enlist, forge, furnish, hail, halt, part, plough, plug, pour, say, smash, smell, steer, submit, swell, tell, throw, trouble, wake and yield. We have created and have been using VPS-30-En to explore the interannotator agreement potential of the Corpus Pattern Analysis. VPS-30-En is a small snapshot of the Pattern Dictionary of English Verbs (Hanks and Pustejovsky, 2005), which we revised (both the entries and the annotated concordances) and enhanced with additional annotations. It is freely available at http://ufal.mff.cuni.cz/spr. In this paper, we compare the annotation scheme of VPS-30-En with the original PDEV. We also describe the adjustments we have made and their motivation, as well as the most pervasive causes of interannotator disagreements

    Vector Space Models and the usage patterns of Indonesian denominal verbs: A case study of verbs with meN-, meN-/-kan, and meN-/-i affixes

    Get PDF

    Weaving creativity into the Semantic Web: a language-processing approach

    Get PDF
    This paper describes a novel language processing ap- proach to the analysis of creativity and the development of a machine-readable ontology of creativity. The ontol- ogy provides a conceptualisation of creativity in terms of a set of fourteen key components or building blocks and has application to research into the nature of cre- ativity in general and to the evaluation of creative prac- tice, in particular. We further argue that the provision of a machine readable conceptualisation of creativity pro- vides a small, but important step towards addressing the problem of automated evaluation, ā€™the Achillesā€™ heel of AI research on creativityā€™ (Boden 1999)

    Learning constructions from bilingual exposure:Computational studies of argument structure acquisition

    Get PDF

    Formulaic Language and Adjective Categories in Eight Centuries of the Spanish Expression of \u27Becoming\u27 /quedar(se)/ + ADJ

    Get PDF
    The purpose of this study is to track the diachronic development of exemplar clusters formed by the adjectives in the Spanish expression of becoming quedar(se) + ADJ (e.g. quedar(se) solo to be left alone\u27, quedar(se) espantado \u27to get scared\u27). This approach applies the same system of analysis used by Bybee & Eddington (2006) in their synchronic study of four verb + adjective combinations used to denote a change of state (ponerse + ADJ, hacerse + ADJ, quedarse + ADJ, and volverse + ADJ). Although there are diachronic studies that follow the development of constructions employing usage-based approaches (Israel 1996), and there are synchronic studies that apply the exemplar model to the analysis of constructions in Spanish (Bybee & Eddington 1996) this thesis is the first in-depth diachronic study known to the author that uses the exemplar model to account for the development of a construction over several centuries. It represents the continuation of preliminary studies by the author (Wilson 2006, 2009) but is much more in-depth, analyzing data from the 13th to the 19th centuries. A total of 1,374 tokens were analyzed in the time span indicated and were extracted from a corpus of 5,394,606 words compiled from 119 narrative (or narrative-like) written works. This study provides evidence that, (a) frequently occurring verb + adjective combinations, such as prefabs, serve as the central members of exemplar categories, (b) formulaic combinations, such as prefabs, have longevity, (c) the categories to which formulaic sequences belong have longevity, (d) categories mutate over time by becoming more centralized, changing central members, expanding to new types or by contracting as types are lost, and (e) there is a correlation between the token frequency of the central member of a category and the type frequency of that category. The data studied in this thesis suggest that the rise in overall standardized frequency of quedar(se) + ADJ has to do with the decrease in frequency of a similar expression of \u27becoming\u27, fincar(se) + ADJ, as many adjectives associated with the latter come to be used with the former. In tracking the continuum of adjective categories over time, this investigation provides insight into diachronic trends of formulaic language showing trends of emergence as categories expand, contraction as forms fall into disuse, and stability as some categories change very little over time
    • ā€¦
    corecore