Search CORE

30 research outputs found

Span-Based LCFRS-2 Parsing

Author: Stanojević Miloš
Steedman Mark
Publication venue
Publication date: 01/01/2020
Field of study

Crossref

Edinburgh Research Explorer

Discontinuous Data-Oriented Parsing: A mildly context-sensitive all-fragments grammar

Author: Andreas van Cranenburgh
Remko Scha
Sangati Federico
Publication venue
Publication date: 01/01/2011
Field of study

Recent advances in parsing technology have made treebank parsing with discontinuous constituents possible, with parser output of competitive quality (Kallmeyer and Maier, 2010). We apply Data-Oriented Parsing (DOP) to a grammar formalism that allows for discontinuous trees (LCFRS). Decisions during parsing are conditioned on all possible fragments, resulting in improved performance. Despite the fact that both DOP and discontinuity present formidable challenges in terms of computational complexity, the model is reasonably efficient, and surpasses the state of the art in discontinuous parsing.

CiteSeerX

Archivio della ricerca - Fondazione Bruno Kessler

ARCHIVIO ISTITUZIONALE DELLA RICERCA-UNIVERSITA' DEGLI STUDI DI NAPOLI "L'ORIENTALE"

Università degli Studi di Napoli L'Orientale: CINECA IRIS

International Migration, Integration and Social Cohesion online publications

UvA-DARE

Parsing as Reduction

Author: Fernández-González Daniel
Martins André F. T.
Publication venue
Publication date: 01/01/2015
Field of study

We reduce phrase-representation parsing to dependency parsing. Our reduction is grounded on a new intermediate representation, "head-ordered dependency trees", shown to be isomorphic to constituent trees. By encoding order information in the dependency labels, we show that any off-the-shelf, trainable dependency parser can be used to produce constituents. When this parser is non-projective, we can perform discontinuous parsing in a very natural manner. Despite the simplicity of our approach, experiments show that the resulting parsers are on par with strong baselines, such as the Berkeley parser for English and the best single system in the SPMRL-2014 shared task. Results are particularly striking for discontinuous parsing of German, where we surpass the current state of the art by a wide margin

arXiv.org e-Print Archive

Crossref

Discontinuous Data-Oriented Parsing: A mildly context-sensitive all-fragments grammar

Author: Sangati F.
Scha R.
van Cranenburgh A.
Publication venue: The Association for Computational Linguistics
Publication date: 01/01/2011
Field of study

International Migration, Integration and Social Cohesion online publications

Synchronous Context-Free Grammars and Optimal Linear Parsing Strategies

Author: Crescenzi Pierluigi
Gildea Daniel
Marino Andrea
Rossi Gianluca
Satta Giorgio
Publication venue
Publication date: 25/11/2013
Field of study

Synchronous Context-Free Grammars (SCFGs), also known as syntax-directed translation schemata, are unlike context-free grammars in that they do not have a binary normal form. In general, parsing with SCFGs takes space and time polynomial in the length of the input strings, but with the degree of the polynomial depending on the permutations of the SCFG rules. We consider linear parsing strategies, which add one nonterminal at a time. We show that for a given input permutation, the problems of finding the linear parsing strategy with the minimum space and time complexity are both NP-hard

arXiv.org e-Print Archive

CiteSeerX

A declarative characterization of different types of multicomponent tree adjoining grammars

Author: Kallmeyer Laura
Publication venue
Publication date: 01/01/2009
Field of study

Multicomponent Tree Adjoining Grammars (MCTAGs) are a formalism that has been shown to be useful for many natural language applications. The definition of non-local MCTAG however is problematic since it refers to the process of the derivation itself: a simultaneity constraint must be respected concerning the way the members of the elementary tree sets are added. Looking only at the result of a derivation (i.e., the derived tree and the derivation tree), this simultaneity is no longer visible and therefore cannot be checked. I.e., this way of characterizing MCTAG does not allow to abstract away from the concrete order of derivation. In this paper, we propose an alternative definition of MCTAG that characterizes the trees in the tree language of an MCTAG via the properties of the derivation trees (in the underlying TAG) the MCTAG licences. We provide similar characterizations for various types of MCTAG. These characterizations give a better understanding of the formalisms, they allow a more systematic comparison of different types of MCTAG, and, furthermore, they can be exploited for parsing

Hochschulschriftenserver - Universität Frankfurt am Main

Neural Combinatory Constituency Parsing

Author: CHEN Zhousi
チンチュウシ
陳宙斯
Publication venue
Publication date: 25/03/2023
Field of study

東京都立大学Tokyo Metropolitan University博士（情報科学）doctoral thesi

Tokyo Metropolitan University Institutional Repository Miyako-Dori / 首都大学東京機関リポジトリ

A derivational model of discontinuous parsing

Author: D Hays
F Jelinek
H Gaifman
J Nivre
M Collins
M Kuhlmann
V Sornlertlamvanich
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituent parsing and dependency parsing. The derivational model is accompanied by an equivalent probabilistic automaton model. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of perplexity, as shown in experiments.Postprin

Crossref

University of St. Andrews - Pure

St Andrews Research Repository

Two characterisation results of multiple context-free grammars and their application to parsing

Author: Denkinger Tobias
Publication venue
Publication date: 20/02/2020
Field of study

In the first part of this thesis, a Chomsky-Schützenberger characterisation and an automaton characterisation of multiple context-free grammars are proved. Furthermore, a framework for approximation of automata with storage is described. The second part develops each of the three theoretical results into a parsing algorithm

Technische Universität Dresden: Qucosa

Parsing TAG with Abstract Categorial Grammar.

Author: Salvati Sylvain
Publication venue: HAL CCSD
Publication date: 01/01/2006
Field of study

International audienceThis paper presents informally an Earley algorithm for TAG which behaves as the algorithm given by [SJ88]. This algorithm is a specialization to TAG of a more general algorithm dedicated to second order ACGs. As second order ACGs allows to encode Linear Context Free Rewriting Systems (LCFRS) [dGP04], the main purpose of this paper is to give a rough presentation of formal tools which can be used to design efficient algorithms for LCFRS

INRIA a CCSD electronic archive server