Towards Semi-Automated Annotation for Prepositional Phrase Attachment

Andreas, Jacob; Lipovsky, William; McKeown, Kathleen; Rosenthal, Sara; Thadani, Kapil

Towards Semi-Automated Annotation for Prepositional Phrase Attachment

Authors: Jacob Andreas
William Lipovsky
Kathleen McKeown
Sara Rosenthal
Kapil Thadani
Publication date: 1 January 2010
Publisher: 'Columbia University Libraries/Information Services'
Doi

Abstract

This paper investigates whether high-quality annotations for tasks involving semantic disambiguation can be obtained without a major investment in time or expense. We examine the use of untrained human volunteers from Amazon’s Mechanical Turk in disambiguating prepositional phrase (PP) attachment over sentences drawn from the Wall Street Journal corpus. Our goal is to compare the performance of these crowdsourced judgments to the annotations supplied by trained linguists for the Penn Treebank project in order to indicate the viability of this approach for annotation projects that involve contextual disambiguation. The results of our experiments show that invoking majority agreement between multiple human workers can yield PP attachments with fairly high precision, conﬁrming that this crowdsourcing approach to syntactic annotation holds promise for the generation of training corpora in new domains and genres

Similar works

Full text

Open in the Core reader

Download PDF

Available Versions

Sustaining member

Columbia University Academic Commons

oai:academiccommons.columbia.e...

Last time updated on 02/10/2018