Search CORE

21,599 research outputs found

Combination Strategies for Semantic Role Labeling

Author: Carreras X.
Comas P. R.
Marquez L.
Surdeanu M.
Publication venue: 'AI Access Foundation'
Publication date: 04/10/2011
Field of study

This paper introduces and analyzes a battery of inference models for the problem of semantic role labeling: one based on constraint satisfaction, and several strategies that model the inference as a meta-learning problem using discriminative classifiers. These classifiers are developed with a rich set of novel features that encode proposition and sentence-level information. To our knowledge, this is the first work that: (a) performs a thorough analysis of learning-based inference models for semantic role labeling, and (b) compares several inference strategies in this context. We evaluate the proposed inference strategies in the framework of the CoNLL-2005 shared task using only automatically-generated syntactic information. The extensive experimental evaluation and analysis indicates that all the proposed inference strategies are successful -they all outperform the current best results reported in the CoNLL-2005 evaluation exercise- but each of the proposed approaches has its advantages and disadvantages. Several important traits of a state-of-the-art SRL combination strategy emerge from this analysis: (i) individual models should be combined at the granularity of candidate arguments rather than at the granularity of complete solutions; (ii) the best combination strategy uses an inference model based in learning; and (iii) the learning-based inference benefits from max-margin classifiers and global feedback

arXiv.org e-Print Archive

Crossref

Deep Semantic Role Labeling with Self-Attention

Author: Chen Yidong
Shi Xiaodong
Tan Zhixing
Wang Mingxuan
Xie Jun
Publication venue
Publication date: 05/12/2017
Field of study

Semantic Role Labeling (SRL) is believed to be a crucial step towards natural language understanding and has been widely studied. Recent years, end-to-end SRL with recurrent neural networks (RNN) has gained increasing attention. However, it remains a major challenge for RNNs to handle structural information and long range dependencies. In this paper, we present a simple and effective architecture for SRL which aims to address these problems. Our model is based on self-attention which can directly capture the relationships between two tokens regardless of their distance. Our single model achieves F

_1=83.4

on the CoNLL-2005 shared task dataset and F

_1=82.7

on the CoNLL-2012 shared task dataset, which outperforms the previous state-of-the-art results by

1.8

and

1.0

_1

score respectively. Besides, our model is computationally efficient, and the parsing speed is 50K tokens per second on a single Titan X GPU.Comment: Accepted by AAAI-201

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications