4,899 research outputs found
Wrapper syntax for example-based machine translation
TransBooster is a wrapper technology designed to improve the performance of wide-coverage machine translation
systems. Using linguistically motivated syntactic information, it automatically decomposes source language sentences into shorter and syntactically simpler chunks, and recomposes their translation to form target language sentences. This generally improves both the word order
and lexical selection of the translation. To date, TransBooster has been successfully applied to rule-based MT, statistical MT, and multi-engine MT. This paper presents
the application of TransBooster to Example-Based Machine Translation. In an experiment conducted on test sets
extracted from Europarl and the Penn II Treebank we show that our method can raise the BLEU score up to 3.8% relative
to the EBMT baseline. We also conduct a manual evaluation, showing that TransBooster-enhanced EBMT produces
a better output in terms of fluency than the baseline EBMT in 55% of the cases and in terms of accuracy in 53% of the
cases
Automatic Extraction of Subcategorization from Corpora
We describe a novel technique and implemented system for constructing a
subcategorization dictionary from textual corpora. Each dictionary entry
encodes the relative frequency of occurrence of a comprehensive set of
subcategorization classes for English. An initial experiment, on a sample of 14
verbs which exhibit multiple complementation patterns, demonstrates that the
technique achieves accuracy comparable to previous approaches, which are all
limited to a highly restricted set of subcategorization classes. We also
demonstrate that a subcategorization dictionary built with the system improves
the accuracy of a parser by an appreciable amount.Comment: 8 pages; requires aclap.sty. To appear in ANLP-9
Verbal valency in Biblical Hebrew : an analysis of the valency of `br
https://place.asburyseminary.edu/ecommonsatsdissertations/1619/thumbnail.jp
A Frobenius Algebraic Analysis for Parasitic Gaps
The interpretation of parasitic gaps is an ostensible case of non-linearity
in natural language composition. Existing categorial analyses, both in the
typelogical and in the combinatory traditions, rely on explicit forms of
syntactic copying. We identify two types of parasitic gapping where the
duplication of semantic content can be confined to the lexicon. Parasitic gaps
in adjuncts are analysed as forms of generalized coordination with a
polymorphic type schema for the head of the adjunct phrase. For parasitic gaps
affecting arguments of the same predicate, the polymorphism is associated with
the lexical item that introduces the primary gap. Our analysis is formulated in
terms of Lambek calculus extended with structural control modalities. A
compositional translation relates syntactic types and derivations to the
interpreting compact closed category of finite dimensional vector spaces and
linear maps with Frobenius algebras over it. When interpreted over the
necessary semantic spaces, the Frobenius algebras provide the tools to model
the proposed instances of lexical polymorphism.Comment: SemSpace 2019, to appear in Journal of Applied Logic
Large and noisy vs small and reliable: combining 2 types of corpora for adjective valence extraction
International audienceThis work investigates a possibility of combining two different types of corpora to build a valence lexicon for French adjectives. We complete adjectival frames extracted from a Treebank with statistical cues computed from a large automatically parsed corpus. This experiment shows how linguistic knowledge and large amount of annotated data can be used in a complementary manner
Part-time Faculty in Higher Education: A Selected Annotated Bibliography
At this writing (Fall 2011), two-thirds of the faculty in higher education are contingent part-time or full-time. Only one-third of the faculty is tenured or on the tenure-track. This selected, annotated bibliography is organized by year of publication, from 1977 to 2010. (An earlier version was published in 2008.) It is the purpose of this publication to facilitate understanding of the meaning and implications of this major change in the structure of higher education. The annotations in the bibliography were written from the perspective of a part-time faculty member, unlike most of the literature, which is written from a management perspective
- …