Abstract. We present a simplified Data-Oriented Parsing (DOP) formalism for learning the constituency structure of Italian sentences. In our approach we try to simplify the original DOP methodology by constraining the number and type of fragments we extract from the training corpus. We provide some examples of the types of constructions that occur more often in the treebank, and quantify the performance of our grammar on the constituency parsing task. Keywords: Data-Oriented Parsing, Tree substitution grammar, statistical model, fragments, kernel methods
To submit an update or takedown request for this paper, please submit an Update/Correction/Removal Request.