Search CORE

885 research outputs found

Disambiguating Nouns, Verbs, and Adjectives Using Automatically Acquired Selectional Preferences

Author: Briscoe Ted
Diana McCarthy
John Carroll
Li Hang
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2003
Field of study

Selectional preferences have been used by word sense disambiguation (WSD) systems as one source of disambiguating information. We evaluate WSD using selectional preferences acquired for English adjective—noun, subject, and direct object grammatical relationships with respect to a standard test corpus. The selectional preferences are specific to verb or adjective classes, rather than individual word forms, so they can be used to disambiguate the co-occurring adjectives and verbs, rather than just the nominal argument heads. We also investigate use of the one-senseper-discourse heuristic to propagate a sense tag for a word to other occurrences of the same word within the current document in order to increase coverage. Although the preferences perform well in comparison with other unsupervised WSD systems on the same corpus, the results show that for many applications, further knowledge sources would be required to achieve an adequate level of accuracy and coverage. In addition to quantifying performance, we analyze the results to investigate the situations in which the selectional preferences achieve the best precision and in which the one-sense-per-discourse heuristic increases performance

CiteSeerX

Crossref

Sussex Research Online

The interaction of knowledge sources in word sense disambiguation

Author: Brill Eric
Daelemans Walter
Daelemans Walter
Ide Nancy
Kilgarriff Adam
Marcus Mitchell
Mark Stevenson
Masterman Margaret
McRoy Susan
Yorick Wilks
Publication venue: 'MIT Press - Journals'
Publication date: 01/01/2001
Field of study

Word sense disambiguation (WSD) is a computational linguistics task likely to benefit from the tradition of combining different knowledge sources in artificial in telligence research. An important step in the exploration of this hypothesis is to determine which linguistic knowledge sources are most useful and whether their combination leads to improved results. We present a sense tagger which uses several knowledge sources. Tested accuracy exceeds 94% on our evaluation corpus.Our system attempts to disambiguate all content words in running text rather than limiting itself to treating a restricted vocabulary of words. It is argued that this approach is more likely to assist the creation of practical systems

CiteSeerX

Crossref

White Rose Research Online

Metaphor Identification in Large Texts Corpora

Author: Argamon Shlomo
Assaf Dan
Cohen Yohai
Frieder Ophir
Howard Newton
Last Mark
Neuman Yair
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/12/2012
Field of study

Identifying metaphorical language-use (e.g., sweet child) is one of the challenges facing natural language processing. This paper describes three novel algorithms for automatic metaphor identification. The algorithms are variations of the same core algorithm. We evaluate the algorithms on two corpora of Reuters and the New York Times articles. The paper presents the most comprehensive study of metaphor identification in terms of scope of metaphorical phrases and annotated corpora size. Algorithms’ performance in identifying linguistic phrases as metaphorical or literal has been compared to human judgment. Overall, the algorithms outperform the state-of-the-art algorithm with 71% precision and 27% averaged improvement in prediction over the base-rate of metaphors in the corpus.United States. Intelligence Advanced Research Projects Activity (IARPA)United States. Dept. of Defense (U.S. Army Research Laboratory Contract W911NF-12-C-0021

Public Library of Science (PLOS)

DSpace@MIT

Crossref

Directory of Open Access Journals

PubMed Central

The Touro College and University System

Opposition theory and computational semiotics

Author: Assaf Dan
Cohen Yochai
Danesi Marcel
Neuman Yair
Publication venue: 'University of Tartu'
Publication date: 30/11/2015
Field of study

Opposition theory suggests that binary oppositions (e.g., high vs. low) underlie basic cognitive and linguistic processes. However, opposition theory has never been implemented in a computational cognitive-semiotics model. In this paper, we present a simple model of metaphor identification that relies on opposition theory. An algorithm instantiating the model has been tested on a data set of 100 phrases comprising adjective-noun pairs in which approximately a half represent metaphorical language-use (e.g., dark thoughts) and the rest literal language-use (e.g., dark hair). The algorithm achieved 89% accuracy in metaphor identification and illustrates the relevance of opposition theory for modelling metaphor processing

Journals from University of Tartu

Crossref

Estimating linear models for compositional distributional semantics

Author: Fallucchi F
Korkontzelos I
Manandhar S
Zanzotto Fm
Publication venue
Publication date: 01/01/2010
Field of study

ART

Acquisition and modeling of lexical knowledge: a corpus-based investigation of systematic polysemy

Author: Lapata Maria
Publication venue: The University of Edinburgh
Publication date: 01/01/2000
Field of study

Edinburgh Research Archive

Distributional Tensor Space Model of Natural Language Semantics

Author: Giesbrecht Eugenie
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2014
Field of study

We propose a novel Distributional Tensor Space Model of natural language semantics employing 3d order tensors that accounts for order dependent word contexts and assigns to words characteristic matrices such that semantic composition can be realized in a linguistically and cognitively plausible way. The proposed model achieves state-of-the-art results for important tasks of linguistic semantics by using a relatively small text corpus and without any sophisticated preprocessing

KITopen

A distributional investigation of German verbs

Author: Roberts William
Publication venue: Humboldt-Universität zu Berlin
Publication date: 14/06/2023
Field of study

Diese Dissertation bietet eine empirische Untersuchung deutscher Verben auf der Grundlage statistischer Beschreibungen, die aus einem großen deutschen Textkorpus gewonnen wurden. In einem kurzen Überblick über linguistische Theorien zur lexikalischen Semantik von Verben skizziere ich die Idee, dass die Verbbedeutung wesentlich von seiner Argumentstruktur (der Anzahl und Art der Argumente, die zusammen mit dem Verb auftreten) und seiner Aspektstruktur (Eigenschaften, die den zeitlichen Ablauf des vom Verb denotierten Ereignisses bestimmen) abhängt. Anschließend erstelle ich statistische Beschreibungen von Verben, die auf diesen beiden unterschiedlichen Bedeutungsfacetten basieren. Insbesondere untersuche ich verbale Subkategorisierung, Selektionspräferenzen und Aspekt. Alle diese Modellierungsstrategien werden anhand einer gemeinsamen Aufgabe, der Verbklassifikation, bewertet. Ich zeige, dass im Rahmen von maschinellem Lernen erworbene Merkmale, die verbale lexikalische Aspekte erfassen, für eine Anwendung von Vorteil sind, die Argumentstrukturen betrifft, nämlich semantische Rollenkennzeichnung. Darüber hinaus zeige ich, dass Merkmale, die die verbale Argumentstruktur erfassen, bei der Aufgabe, ein Verb nach seiner Aspektklasse zu klassifizieren, gut funktionieren. Diese Ergebnisse bestätigen, dass diese beiden Facetten der Verbbedeutung auf grundsätzliche Weise zusammenhängen.This dissertation provides an empirical investigation of German verbs conducted on the basis of statistical descriptions acquired from a large corpus of German text. In a brief overview of the linguistic theory pertaining to the lexical semantics of verbs, I outline the idea that verb meaning is composed of argument structure (the number and types of arguments that co-occur with a verb) and aspectual structure (properties describing the temporal progression of an event referenced by the verb). I then produce statistical descriptions of verbs according to these two distinct facets of meaning: In particular, I examine verbal subcategorisation, selectional preferences, and aspectual type. All three of these modelling strategies are evaluated on a common task, automatic verb classification. I demonstrate that automatically acquired features capturing verbal lexical aspect are beneficial for an application that concerns argument structure, namely semantic role labelling. Furthermore, I demonstrate that features capturing verbal argument structure perform well on the task of classifying a verb for its aspectual type. These findings suggest that these two facets of verb meaning are related in an underlying way

Dokumenten-Publikationsserver der Humboldt-Universität zu Berlin