Location of Repository

A simple DOP model for constituency parsing of Italian sentences

By Federico Sangati

Abstract

Abstract. We present a simplified Data-Oriented Parsing (DOP) formalism for learning the constituency structure of Italian sentences. In our approach we try to simplify the original DOP methodology by constraining the number and type of fragments we extract from the training corpus. We provide some examples of the types of constructions that occur more often in the treebank, and quantify the performance of our grammar on the constituency parsing task. Keywords: Data-Oriented Parsing, Tree substitution grammar, statistical model, fragments, kernel methods

Year: 2010
OAI identifier: oai:CiteSeerX.psu:10.1.1.163.4048
Provided by: CiteSeerX
Download PDF:
Sorry, we are unable to provide the full text but you may find it at the following location(s):
  • http://citeseerx.ist.psu.edu/v... (external link)
  • http://staff.science.uva.nl/~f... (external link)
  • Suggested articles


    To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.