Search CORE

1,841 research outputs found

A Variant of Earley Parsing

Author: Nederhof Mark-Jan
Satta Giorgio
Publication venue
Publication date: 01/01/1997
Field of study

The Earley algorithm is a widely used parsing method in natural language processing applications. We introduce a variant of Earley parsing that is based on a ``delayed'' recognition of constituents. This allows us to start the recognition of a constituent only in cases in which all of its subconstituents have been found within the input string. This is particularly advantageous in several cases in which partial analysis of a constituent cannot be completed and in general in all cases of productions sharing some suffix of their right-hand sides (even for different left-hand side nonterminals). Although the two algorithms result in the same asymptotic time and space complexity, from a practical perspective our algorithm improves the time and space requirements of the original method, as shown by reported experimental results.Comment: 12 pages, 1 Postscript figure, uses psfig.tex and llncs.st

arXiv.org e-Print Archive

CiteSeerX

University of Groningen Digital Archive

Archivio istituzionale della ricerca - Università di Padova

A corpus-based study of Italian idiomatic phrases: from citation forms to 'real-life' occurrences

Author: CIGNONI L
COFFEY S
Publication venue: English & Dutch Departments, University of Liège
Publication date: 01/01/1998
Field of study

In the current paper we present a typological description of the alternative forms and variations which Italian idiomatic phrases were found to take in a corpus of contemporary written Italian. We were concerned above all with lexical variation, both regularly occurring and occasional, not with the syntactic flexibility of idioms. Flexible search parameters were used in order to locate as many variations as possible, including alternative lexical components, shortened forms of idioms, adaptation of underlying metaphors, and alternative syntactic realizations. We relate our findings to lexicographical description

Archivio della Ricerca - Università di Pisa

Representation and parsing of multiword expressions

Author
Publication venue: Language Science Press
Publication date: 01/04/2020
Field of study

This book consists of contributions related to the definition, representation and parsing of MWEs. These reflect current trends in the representation and processing of MWEs. They cover various categories of MWEs such as verbal, adverbial and nominal MWEs, various linguistic frameworks (e.g. tree-based and unification-based grammars), various languages including English, French, Modern Greek, Hebrew, Norwegian), and various applications (namely MWE detection, parsing, automatic translation) using both symbolic and statistical approaches

Directory of Open Access Books (DOAB)

Current trends

Author
Publication venue
Publication date: 01/01/2019
Field of study

Deep parsing is the fundamental process aiming at the representation of the syntactic structure of phrases and sentences. In the traditional methodology this process is based on lexicons and grammars representing roughly properties of words and interactions of words and structures in sentences. Several linguistic frameworks, such as Headdriven Phrase Structure Grammar (HPSG), Lexical Functional Grammar (LFG), Tree Adjoining Grammar (TAG), Combinatory Categorial Grammar (CCG), etc., offer different structures and combining operations for building grammar rules. These already contain mechanisms for expressing properties of Multiword Expressions (MWE), which, however, need improvement in how they account for idiosyncrasies of MWEs on the one hand and their similarities to regular structures on the other hand. This collaborative book constitutes a survey on various attempts at representing and parsing MWEs in the context of linguistic theories and applications

Institutional Repository of the Freie Universität Berlin

Idioms, non-literal language and knowledge representation

Author: van der Linden H.J.B.M.
Publication venue: Institute for Language Technology and Artifical IntelIigence, Tilburg University
Publication date: 01/01/1992
Field of study

Tilburg University Repository

Building a Treebank for Italian: a Data-driven Annotation Schema.

Author: Bosco Cristina
Daniela Vassallo
Lesmo Leonardo
Lombardo Vincenzo
Publication venue: Universitas Imelda Medan
Publication date: 01/01/2000
Field of study

Institutional Research Information System University of Turin

Multiword expressions at length and in depth

Author
Publication venue: Language Science Press
Publication date: 01/04/2020
Field of study

The annual workshop on multiword expressions takes place since 2001 in conjunction with major computational linguistics conferences and attracts the attention of an ever-growing community working on a variety of languages, linguistic phenomena and related computational processing issues. MWE 2017 took place in Valencia, Spain, and represented a vibrant panorama of the current research landscape on the computational treatment of multiword expressions, featuring many high-quality submissions. Furthermore, MWE 2017 included the first shared task on multilingual identification of verbal multiword expressions. The shared task, with extended communal work, has developed important multilingual resources and mobilised several research groups in computational linguistics worldwide. This book contains extended versions of selected papers from the workshop. Authors worked hard to include detailed explanations, broader and deeper analyses, and new exciting results, which were thoroughly reviewed by an internationally renowned committee. We hope that this distinctly joint effort will provide a meaningful and useful snapshot of the multilingual state of the art in multiword expressions modelling and processing, and will be a point point of reference for future work

Directory of Open Access Books (DOAB)

A Variant of Earley Parsing

Author: Nederhof M. J.
Satta G.
Publication venue: Springer
Publication date: 01/01/1997
Field of study

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Idioms, non-literal language and knowledge representation

Author: van der Linden H.J.B.M.
Publication venue: Institute for Language Technology and Artifical IntelIigence, Tilburg University
Publication date: 01/01/1992
Field of study

Tilburg University Repository