28,035 research outputs found
Syntactic Topic Models
The syntactic topic model (STM) is a Bayesian nonparametric model of language
that discovers latent distributions of words (topics) that are both
semantically and syntactically coherent. The STM models dependency parsed
corpora where sentences are grouped into documents. It assumes that each word
is drawn from a latent topic chosen by combining document-level features and
the local syntactic context. Each document has a distribution over latent
topics, as in topic models, which provides the semantic consistency. Each
element in the dependency parse tree also has a distribution over the topics of
its children, as in latent-state syntax models, which provides the syntactic
consistency. These distributions are convolved so that the topic of each word
is likely under both its document and syntactic context. We derive a fast
posterior inference algorithm based on variational methods. We report
qualitative and quantitative studies on both synthetic data and hand-parsed
documents. We show that the STM is a more predictive model of language than
current models based only on syntax or only on topics
Papers on predicative constructions : Proceedings of the workshop on secundary predication, October 16-17, 2000, Berlin
This volume presents a collection of papers touching on various issues concerning the syntax and semantics of predicative constructions.
A hot topic in the study of predicative copula constructions, with direct implications for the treatment of he (how many he's do we need?), and wider implications for the theories of predication, event-based semantics and aspect, is the nature and source of the situation argument. Closer examination of copula-less predications is becoming increasingly relevant to all these issues, as is clearly illustrated by the present collection
On past participle agreement in transitive clauses in French
This paper provides a Minimalist analysis of past participle agreement in French in transitive
clauses. Our account posits that the head v of vP in such structures carries an (accusativeassigning) structural case feature which may apply (with or without concomitant agreement)
to case-mark a clause-mate object, the subject of a defective complement clause, or an
intermediate copy of a preposed subject in spec-CP. In structures where a goal is extracted
from vP (e.g. via wh-movement) v also carries an edge feature, and may also carry a
specificity feature and a set of (number and gender) agreement features. We show how these
assumptions account for agreement of a participle with a preposed specific clausemate object
or defective-clause subject, and for the absence of agreement with an embedded object, with
the complement of an impersonal verb, and with the subject of an embedded (finite or nonfinite) CP complement. We also argue that the absence of agreement marking (in expected
contexts) on the participles faitmade and laissélet in infinitive structures is essentially viral in
nature. Finally, we claim that obligatory participle agreement with reflexive and reciprocal
objects arises because the derivation of reflexives involves A-movement and concomitant
agreement
Neural connectivity in syntactic movement processing
Linguistic theory suggests non-canonical sentences subvert the dominant agent-verb-theme order in English via displacement of sentence constituents to argument (NP-movement) or non-argument positions (wh-movement). Both processes have been associated with the left inferior frontal gyrus and posterior superior temporal gyrus, but differences in neural activity and connectivity between movement types have not been investigated. In the current study, functional magnetic resonance imaging data were acquired from 21 adult participants during an auditory sentence-picture verification task using passive and active sentences contrasted to isolate NP-movement, and object- and subject-cleft sentences contrasted to isolate wh-movement. Then, functional magnetic resonance imaging data from regions common to both movement types were entered into a dynamic causal modeling analysis to examine effective connectivity for wh-movement and NP-movement. Results showed greater left inferior frontal gyrus activation for Wh > NP-movement, but no activation for NP > Wh-movement. Both types of movement elicited activity in the opercular part of the left inferior frontal gyrus, left posterior superior temporal gyrus, and left medial superior frontal gyrus. The dynamic causal modeling analyses indicated that neither movement type significantly modulated the connection from the left inferior frontal gyrus to the left posterior superior temporal gyrus, nor vice-versa, suggesting no connectivity differences between wh- and NP-movement. These findings support the idea that increased complexity of wh-structures, compared to sentences with NP-movement, requires greater engagement of cognitive resources via increased neural activity in the left inferior frontal gyrus, but both movement types engage similar neural networks.This work was supported by the NIH-NIDCD, Clinical Research Center Grant, P50DC012283 (PI: CT), and the Graduate Research Grant and School of Communication Graduate Ignition Grant from Northwestern University (awarded to EE). (P50DC012283 - NIH-NIDCD, Clinical Research Center Grant; Graduate Research Grant and School of Communication Graduate Ignition Grant from Northwestern University)Published versio
Input effects on the acquisition of a novel phrasal construction in five year olds
The present experiments demonstrate that children as young as five years old (M = 5;2) generalize beyond their input on the basis of minimal exposure to a novel argument structure construction. The novel construction that was used involved a non-English phrasal pattern: VN1N2, paired with a novel abstract meaning: N2 approaches N1. At the same time, we find that children are keenly sensitive to the input: they show knowledge of the construction after a single day of exposure but this grows stronger after three days; also, children generalize more readily to new verbs when the input contains more than one verb
Learning Language from a Large (Unannotated) Corpus
A novel approach to the fully automated, unsupervised extraction of
dependency grammars and associated syntax-to-semantic-relationship mappings
from large text corpora is described. The suggested approach builds on the
authors' prior work with the Link Grammar, RelEx and OpenCog systems, as well
as on a number of prior papers and approaches from the statistical language
learning literature. If successful, this approach would enable the mining of
all the information needed to power a natural language comprehension and
generation system, directly from a large, unannotated corpus.Comment: 29 pages, 5 figures, research proposa
The Composite Nature of Interlanguage as a Developing System
This paper explores the nature of interlanguage (IL) as a developing system with a focus on the abstract lexical structure underlying IL construction. The developing system of IL is assumed to be ‘composite’ in that in second language acquisition (SLA) several linguistic systems are in contact, each of which may contribute different amounts to the developing system. The lexical structure is assumed to be ‘abstract’ in that the mental lexicon contains abstract elements called ‘lemmas’, which contain information about individual lexemes, and lemmas in the bilingual mental lexicon are language-specific and are in contact in IL production. Based on the research findings, it concludes that language transfer in IL production should be understood as lemma transfer of the learner's first language (L1) lexical structure at three abstract levels: lexical-conceptual structure, predicate-argument structure, and morphological realization patterns, and IL construction is driven by an incompletely acquired abstract lexical structure of a target language (TL) item
Parameterising Germanic Ditransitive Variation: A Historical-Comparative Study
This dissertation investigates the interplay of morphology and syntax in generating surface complexity and the universality of argument structure by analysing recipient ditransitives in Germanic. The main claim of the dissertation is that all recipients in Germanic are introduced as dative PPs in the specifier of an applicative phrase. This conclusion supports a strong version of Baker’s UTAH hypothesis, namely that there is no variation between natural languages in argument structure and that all surface variation is derived from transformations on a uniform underlying structure.
In addition to arguing for the base generated structure of recipient ditransitives, this dissertation also explores transformations that apply to the base structure and show how these transformations are able to account for the surface variation seen both synchronically and diachronically in Germanic. Morphological variation in the form of allomorphy in the realisation of the dative P head is argued to cause the variation seen in Dative Shift (e.g. “John gave Mary the book” vs “John gave the book to Mary”). In addition to the morphological variation, languages also varied as to the availability of different syntactic transformations. For active sentence, the main syntactic transformation is VP-internal scrambling, which moves the theme over the recipient to generate theme–recipient word orders (e.g., “John gave the book to Mary”). Also, pronominal cliticisation can effect the morphological realisation of dative case. In the passive, P-incorporation is argued to license dative-to-nominative recipient subject raising. Theme passivisation is argued to be licensed by a number of different syntactic methods, including relativised minimality with respect to the PP/DP distinction.
The main original contributions of this dissertation are: the complete syntactic framework presented here, a collation of typological data from across the Germanic languages, and novel data and methods in historical syntax
- …