Search CORE

1,648 research outputs found

How flexible is constituent order in the midfield of German subordinate clauses?: A corpus study revealing unexpected rigidity

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2004
Field of study

Comparing linguistic judgments and corpus frequencies as windows on grammatical competence: A study of argument linearization in German clauses

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2008
Field of study

We present an overview of several corpus studies we carried out into the frequencies of argument NP orderings in the midﬁeld of subordinate and main clauses of German. Comparing the corpus frequencies with grammaticality ratings published by Keller’s (2000), we observe a “grammaticality–frequency gap”: Quite a few argument orderings with zero corpus frequency are nevertheless assigned medium–range grammaticality ratings. We propose an explanation in terms of a two-factor theory. First, we hypothesize that the grammatical induction component needs a sufficient number of exposures to a syntactic pattern to incorporate it into its repertoire of more or less stable rules of grammar. Moderately to highly frequent argument NP orderings are likely have attained this status, but not their zero-frequency counterparts. This is why the latter argument sequences cannot be produced by the grammatical encoder and are absent from the corpora. Secondly, we assumed that an extraneous (nonlinguistic) judgment process biases the ratings of moderately grammatical linear order patterns: Confronted with such structures, the informants produce their own “ideal delivery” variant of the to-be-rated target sentence and evaluate the similarity between the two versions. A high similarity score yielded by this judgment then exerts a positive bias on the grammaticality rating—a score that should not be mistaken for an authentic grammaticality rating. We conclude that, at least in the linearization domain studied here, the goal of gaining a clear view of the internal grammar of language users is best served by a combined strategy in which grammar rules are founded on structures that elicit moderate to high grammaticality ratings and attain at least moderate usage frequencies

MPG.PuRe

A generation-oriented workbench for performance grammar: Capturing linear order variability in German and Dutch

Author: Harbusch K.
Kempen G.
Koch U.
Van Breugel C.
Publication venue
Publication date: 01/01/2006
Field of study

We describe a generation-oriented workbench for the Performance Grammar (PG) formalism, highlighting the treatment of certain word order and movement constraints in Dutch and German. PG enables a simple and uniform treatment of a heterogeneous collection of linear order phenomena in the domain of verb constructions (variably known as Cross-serial Dependencies, Verb Raising, Clause Union, Extraposition, Third Construction, Particle Hopping, etc.). The central data structures enabling this feature are clausal “topologies”: one-dimensional arrays associated with clauses, whose cells (“slots”) provide landing sites for the constituents of the clause. Movement operations are enabled by unification of lateral slots of topologies at adjacent levels of the clause hierarchy. The PGW generator assists the grammar developer in testing whether the implemented syntactic knowledge allows all and only the well-formed permutations of constituents

MPG.PuRe

A generation-oriented workbench for performance grammar: Capturing linear order variability in German and Dutch

Author: Harbusch K.
Kempen G.
Koch U.
Van Breugel C.
Publication venue
Publication date: 01/01/2006
Field of study

MPG.PuRe

Dust, Ice, and Gas In Time (DIGIT) Herschel program first results: A full PACS-SED scan of the gas line emission in protostar DK Chamaeleontis

Author: Blake G. A.
Pontoppidan K. M.
van Kempen T. A.
Publication venue: 'EDP Sciences'
Publication date: 01/07/2010
Field of study

Aims. We aim to study the composition and energetics of the circumstellar material of DK Cha, an intermediate-mass star in transition from an embedded configuration to a star plus disk stage, during this pivotal stage of its evolution. Methods. Using the range scan mode of PACS on the Herschel Space Observatory, we obtained a spectrum of DK Cha from 55 to 210 μm as part of the DIGIT key program. Results. Almost 50 molecular and atomic lines were detected, many more than the 7 lines detected in ISO-LWS. Nearly the entire ladder of CO from J = 14–13 to 38–37 (E_u/k = 4080 K), water from levels as excited as J_(K−1 K+1) = 7_(07) (E_u/k = 843 K), and OH lines up to E_u/k = 290 K were detected. Conclusions. The continuum emission in our PACS SED scan matches the flux expected by a model consisting of a star, a surrounding disk of 0.03 M_⊙, and an envelope of a similar mass, supporting the suggestion that the object is emerging from its main accretion stage. Molecular, atomic, and ionic emission lines in the far-infrared reveal the outflow’s influence on the envelope. The inferred hot gas may be photon-heated, but some emission may be caused by C-shocks in the walls of the outflow cavity

Caltech Authors

A corpus study into word order variation in German subordinate clauses: Animacy affects linearization independently of function assignment

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2003
Field of study

MPG.PuRe

ELLEIPO: A module that computes coordinative ellipsis for language generators that don't

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2006
Field of study

Many current sentence generators lack the ability to compute elliptical versions of coordinated clauses in accordance with the rules for Gapping, Forward and Backward Conjunction Reduction, and SGF (Subject Gap in clauses with Finite/ Fronted verb). We describe a module (implemented in JAVA, with German and Dutch as target languages) that takes non-elliptical coordinated clauses as input and returns all reduced versions licensed by coordinative ellipsis. It is loosely based on a new psycholinguistic theory of coordinative ellipsis proposed by Kempen. In this theory, coordinative ellipsis is not supposed to result from the application of declarative grammar rules for clause formation but from a procedural component that interacts with the sentence generator and may block the overt expression of certain constituents

MPG.PuRe

How flexible is constituent order in the midfield of German subordinate clauses? A corpus study revealing unexpected rigidity

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2004
Field of study

MPG.PuRe

Automatic online writing support for L2 learners of German through output monitoring by a natural-language paraphrase generator

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2011
Field of study

Students who are learning to write in a foreign language, often want feedback on the grammatical quality of the sentences they produce. The usual NLP approach to this problem is based on parsing student-generated text. Here, we propose a generation-based ap- proach aiming at preventing errors ("scaffolding"). In our ICALL system, the student constructs sentences by composing syntactic trees out of lexically anchored "treelets" via a graphical drag & drop user interface. A natural-language generator computes all possible grammatically well-formed sentences entailed by the student-composed tree. It provides positive feedback if the student-composed tree belongs to the well-formed set, and negative feedback otherwise. If so requested by the student, it can substantiate the positive or negative feedback based on a comparison between the student-composed tree and its own trees (informative feedback on demand). In case of negative feedback, the system refuses to build the structure attempted by the student. Frequently occurring errors are handled in terms of "malrules." The system we describe is a prototype (implemented in JAVA and C++) which can be parameterized with respect to L1 and L2, the size of the lexicon, and the level of detail of the visually presented grammatical structures

MPG.PuRe

Word order scrambling as a consequence of incremental sentence production

Author: Harbusch K.
Kempen G.
Publication venue
Publication date: 01/01/2003
Field of study

MPG.PuRe