Search CORE

597 research outputs found

A Mention-Ranking Model for Abstract Anaphora Resolution

Author: Born Leo
Frank Anette
Marasović Ana
Opitz Juri
Publication venue
Publication date: 01/01/2017
Field of study

Resolving abstract anaphora is an important, but difficult task for text understanding. Yet, with recent advances in representation learning this task becomes a more tangible aim. A central property of abstract anaphora is that it establishes a relation between the anaphor embedded in the anaphoric sentence and its (typically non-nominal) antecedent. We propose a mention-ranking model that learns how abstract anaphors relate to their antecedents with an LSTM-Siamese Net. We overcome the lack of training data by generating artificial anaphoric sentence--antecedent pairs. Our model outperforms state-of-the-art results on shell noun resolution. We also report first benchmark results on an abstract anaphora subset of the ARRAU corpus. This corpus presents a greater challenge due to a mixture of nominal and pronominal anaphors and a greater range of confounders. We found model variants that outperform the baselines for nominal anaphors, without training on individual anaphor data, but still lag behind for pronominal anaphors. Our model selects syntactically plausible candidates and -- if disregarding syntax -- discriminates candidates using deeper features.Comment: In Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing (EMNLP). Copenhagen, Denmar

arXiv.org e-Print Archive

TUbiblio

Crossref

Anaphora and Discourse Structure

Author: Joshi Aravind
Knott Alistair
Stone Matthew
Webber Bonnie
Publication venue
Publication date: 13/09/2002
Field of study

We argue in this paper that many common adverbial phrases generally taken to signal a discourse relation between syntactically connected units within discourse structure, instead work anaphorically to contribute relational meaning, with only indirect dependence on discourse structure. This allows a simpler discourse structure to provide scaffolding for compositional semantics, and reveals multiple ways in which the relational meaning conveyed by adverbial connectives can interact with that associated with discourse structure. We conclude by sketching out a lexicalised grammar for discourse that facilitates discourse interpretation as a product of compositional rules, anaphor resolution and inference.Comment: 45 pages, 17 figures. Revised resubmission to Computational Linguistic

arXiv.org e-Print Archive

CiteSeerX

Deep Learning With Sentiment Inference For Discourse-Oriented Opinion Analysis

Author: Marasovic Ana
Publication venue
Publication date: 01/01/2020
Field of study

Opinions are omnipresent in written and spoken text ranging from editorials, reviews, blogs, guides, and informal conversations to written and broadcast news. However, past research in NLP has mainly addressed explicit opinion expressions, ignoring implicit opinions. As a result, research in opinion analysis has plateaued at a somewhat superficial level, providing methods that only recognize what is explicitly said and do not understand what is implied. In this dissertation, we develop machine learning models for two tasks that presumably support propagation of sentiment in discourse, beyond one sentence. The first task we address is opinion role labeling, i.e.\ the task of detecting who expressed a given attitude toward what or who. The second task is abstract anaphora resolution, i.e.\ the task of finding a (typically) non-nominal antecedent of pronouns and noun phrases that refer to abstract objects like facts, events, actions, or situations in the preceding discourse. We propose a neural model for labeling of opinion holders and targets and circumvent the problems that arise from the limited labeled data. In particular, we extend the baseline model with different multi-task learning frameworks. We obtain clear performance improvements using semantic role labeling as the auxiliary task. We conduct a thorough analysis to demonstrate how multi-task learning helps, what has been solved for the task, and what is next. We show that future developments should improve the ability of the models to capture long-range dependencies and consider other auxiliary tasks such as dependency parsing or recognizing textual entailment. We emphasize that future improvements can be measured more reliably if opinion expressions with missing roles are curated and if the evaluation considers all mentions in opinion role coreference chains as well as discontinuous roles. To the best of our knowledge, we propose the first abstract anaphora resolution model that handles the unrestricted phenomenon in a realistic setting. We cast abstract anaphora resolution as the task of learning attributes of the relation that holds between the sentence with the abstract anaphor and its antecedent. We propose a Mention-Ranking siamese-LSTM model (MR-LSTM) for learning what characterizes the mentioned relation in a data-driven fashion. The current resources for abstract anaphora resolution are quite limited. However, we can train our models without conventional data for abstract anaphora resolution. In particular, we can train our models on many instances of antecedent-anaphoric sentence pairs. Such pairs can be automatically extracted from parsed corpora by searching for a common construction which consists of a verb with an embedded sentence (complement or adverbial), applying a simple transformation that replaces the embedded sentence with an abstract anaphor, and using the cut-off embedded sentence as the antecedent. We refer to the extracted data as silver data. We evaluate our MR-LSTM models in a realistic task setup in which models need to rank embedded sentences and verb phrases from the sentence with the anaphor as well as a few preceding sentences. We report the first benchmark results on an abstract anaphora subset of the ARRAU corpus \citep{uryupina_et_al_2016} which presents a greater challenge due to a mixture of nominal and pronominal anaphors as well as a greater range of confounders. We also use two additional evaluation datasets: a subset of the CoNLL-12 shared task dataset \citep{pradhan_et_al_2012} and a subset of the ASN corpus \citep{kolhatkar_et_al_2013_crowdsourcing}. We show that our MR-LSTM models outperform the baselines in all evaluation datasets, except for events in the CoNLL-12 dataset. We conclude that training on the small-scale gold data works well if we encounter the same type of anaphors at the evaluation time. However, the gold training data contains only six shell nouns and events and thus resolution of anaphors in the ARRAU corpus that covers a variety of anaphor types benefits from the silver data. Our MR-LSTM models for resolution of abstract anaphors outperform the prior work for shell noun resolution \citep{kolhatkar_et_al_2013} in their restricted task setup. Finally, we try to get the best out of the gold and silver training data by mixing them. Moreover, we speculate that we could improve the training on a mixture if we: (i) handle artifacts in the silver data with adversarial training and (ii) use multi-task learning to enable our models to make ranking decisions dependent on the type of anaphor. These proposals give us mixed results and hence a robust mixed training strategy remains a challenge

Heidelberger Dokumentenserver

Discourse Anaphora and Anaphora Resolution in a Natural Language Interface to a Database Question Answering System

Author: Sommerville Stephen
Zahri Nor Aliah bt. Mohd
Publication venue: The Logico-Linguistic Society of Japan
Publication date: 01/01/1995
Field of study

Waseda University Repository

Incremental Interpretation: Applications, Theory, and Relationship to Dynamic Semantics

Author: Cooper Robin
Milward David
Publication venue
Publication date: 01/01/1994
Field of study

Why should computers interpret language incrementally? In recent years psycholinguistic evidence for incremental interpretation has become more and more compelling, suggesting that humans perform semantic interpretation before constituent boundaries, possibly word by word. However, possible computational applications have received less attention. In this paper we consider various potential applications, in particular graphical interaction and dialogue. We then review the theoretical and computational tools available for mapping from fragments of sentences to fully scoped semantic representations. Finally, we tease apart the relationship between dynamic semantics and incremental interpretation.Comment: Procs. of COLING 94, LaTeX (2.09 preferred), 8 page

arXiv.org e-Print Archive

CiteSeerX

Anaphora and Discourse Semantics

Author: Joshi Aravind
Knott Alistair
Stone Matthew
Webber Bonnie L
Publication venue: ScholarlyCommons
Publication date: 01/01/2001
Field of study

We argue in this paper that many common adverbial phrases generally taken to be discourse connectives signalling discourse relations between adjacent discourse units are instead anaphors. We do this by (i) demonstrating their behavioral similarity with more common anaphors (pronouns and definite NPs); (ii) presenting a general framework for understanding anaphora into which they nicely fit; (iii) showing the interpretational benefits of understanding discourse adverbials as anaphors; and (iv) sketching out a lexicalised grammar that facilitates discourse interpretation as a product of compositional rules, anaphor resolution and inference

ScholarlyCommons@Penn

Identifying Co-reference of Zibun and Caki: The Case of Reflexives in Japanese and Korean

Author: Juffs Alan
Li Noriyasu
Publication venue: Ohio State University. Libraries
Publication date: 01/10/2018
Field of study

This study examines the properties of co-reference in DPs and the Japanese reflexive zibun, and the Korean reflexive caki. We posit that the resolution of local and long distance binding ambiguity in Japanese and Korean is influenced by the case particles that mark the reflexives. Results from a truth-value judgment task showed that Japanese and Koreans not only have different binding patterns but local and long distance binding varies based on case-marked reflexives. Bonferroni post-hoc tests revealed that Japanese prefer local binding when zibun is marked by the nominative case and long distance binding for the dative and accusative cases, while the Koreans prefer long distance binding when caki is marked by the genitive, dative, and accusative cases. Overall, our results show that further studies of reflexives should closely examine the role of case markers in ambiguity resolution and also examine how native speakers parse and process ambiguous sentences

KnowledgeBank at OSU

Presupposition projection as proof construction

Author: A Ranta
D Beaver
D Beaver
D Milward
E Krahmer
E Krahmer
E Krahmer
H Clark
H Kamp
H Zeevat
HB Curry
HC Bunt
HP Barendregt
J Bos
JR Hobbs
N Asher
P Krause
P Martin-Löf
P Piwek
P Piwek
P Piwek
R Ahn
R Ahn
R Sandt van der
R Stalnaker
RJ Beun
W Saurer
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/1999
Field of study

Even though Van der Sandt's presuppositions as anaphora approach is empirically successful, it fails to give a formal account of the interaction between world-knowledge and presuppositions. In this paper, an algorithm is sketched which is based on the idea of presuppositions as anaphora. It improves on this approach by employing a deductive system, Constructive Type Theory (CTT), to get a formal handle on the way world-knowledge influences presupposition projection. In CTT, proofs for expressions are explicitly represented as objects. These objects can be seen as a generalization of DRT's discourse markers. They are useful in dealing with presuppositional phenomena which require world-knowledge, such as Clark's bridging examples and Beaver's conditional presuppositions

Repository TU/e

Crossref

Pure OAI Repository

Open Research Online (The Open University)

Tilburg University Repository

Structured Access in Sentence Comprehension

Author: Dillon Brian William
Publication venue
Publication date: 01/01/2011
Field of study

This thesis is concerned with the nature of memory access during the construction of long-distance dependencies in online sentence comprehension. In recent years, an intense focus on the computational challenges posed by long-distance dependencies has proven to be illuminating with respect to the characteristics of the architecture of the human sentence processor, suggesting a tight link between general memory access procedures and sentence processing routines (Lewis & Vasishth 2005; Lewis, Vasishth, & Van Dyke 2006; Wagers, Lau & Phillips 2009). The present thesis builds upon this line of research, and its primary aim is to motivate and defend the hypothesis that the parser accesses linguistic memory in an essentially structured fashion for certain long-distance dependencies. In order to make this case, I focus on the processing of reflexive and agreement dependencies, and ask whether or not non-structural information such as morphological features are used to gate memory access during syntactic comprehension. Evidence from eight experiments in a range of methodologies in English and Chinese is brought to bear on this question, providing arguments from interference effects and time-course effects that primarily syntactic information is used to access linguistic memory in the construction of certain long-distance dependencies. The experimental evidence for structured access is compatible with a variety of architectural assumptions about the parser, and I present one implementation of this idea in a parser based on the ACT-R memory architecture. In the context of such a content-addressable model of memory, the claim of structured access is equivalent to the claim that only syntactic cues are used to query memory. I argue that structured access reflects an optimal parsing strategy in the context of a noisy, interference-prone cognitive architecture: abstract structural cues are favored over lexical feature cues for certain structural dependencies in order to minimize memory interference in online processing

Digital Repository at the University of Maryland

Between anaphora and deixis...the resolution of the demonstrative noun-phrase ‘that N’

Author: Cowles H Wind
Fossard Marion
Garnham Alan
Publication venue: 'Informa UK Limited'
Publication date: 02/11/2011
Field of study

Three experiments examined the hypothesis that the demonstrative noun phrase (NP) that N, as an anadeictic expression, preferentially refers to the less salient referent in a discourse representation when used anaphorically, whereas the anaphoric pronoun he or she preferentially refers to the highly-focused referent. The findings, from a sentence completion task and two reading time experiments that used gender to create ambiguous and unambiguous coreference, reveal that the demonstrative NP specifically orients processing toward a less salient referent when there is no gender cue discriminating between different possible referents. These findings show the importance of taking into account the discourse function of the anaphor itself and its influence on the process of searching for the referent

RERO DOC Digital Library

Sussex Research Online