7,299 research outputs found

    Identification of context markers for Russian nouns

    Get PDF
    Proceedings of the 18th Nordic Conference of Computational Linguistics NODALIDA 2011. Editors: Bolette Sandford Pedersen, Gunta Nešpore and Inguna Skadiņa. NEALT Proceedings Series, Vol. 11 (2011), 344-347. © 2011 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/1695

    Disambiguation of Taxonomy Markers in Context: Russian Nouns

    Get PDF
    Proceedings of the 17th Nordic Conference of Computational Linguistics NODALIDA 2009. Editors: Kristiina Jokinen and Eckhard Bick. NEALT Proceedings Series, Vol. 4 (2009), 111-117. © 2009 The editors and contributors. Published by Northern European Association for Language Technology (NEALT) http://omilia.uio.no/nealt . Electronically published at Tartu University Library (Estonia) http://hdl.handle.net/10062/9206

    The linguistics of gender

    Get PDF
    This chapter explores grammatical gender as a linguistic phenomenon. First, I define gender in terms of agreement, and look at the parts of speech that can take gender agreement. Because it relates to assumptions underlying much psycholinguistic gender research, I also examine the reasons why gender systems are thought to emerge, change, and disappear. Then, I describe the gender system of Dutch. The frequent confusion about the number of genders in Dutch will be resolved by looking at the history of the system, and the role of pronominal reference therein. In addition, I report on three lexical- statistical analyses of the distribution of genders in the language. After having dealt with Dutch, I look at whether the genders of Dutch and other languages are more or less randomly assigned, or whether there is some system to it. In contrast to what many people think, regularities do indeed exist. Native speakers could in principle exploit such regularities to compute rather than memorize gender, at least in part. Although this should be taken into account as a possibility, I will also argue that it is by no means a necessary implication

    First verbs : On the way to mini-paradigms

    Get PDF
    This 18th issue of ZAS-Papers in Linguistics consists of papers on the development of verb acquisition in 9 languages from the very early stages up to the onset of paradigm construction. Each of the 10 papers deals with first-Ianguage developmental processes in one or two children studied via longitudinal data. The languages involved are French, Spanish, Russian, Croatian, Lithuanien, Finnish, English and German. For German two different varieties are examined, one from Berlin and one from Vienna. All papers are based on presentations at the workshop 'Early verbs: On the way to mini-paradigms' held at the ZAS (Berlin) on the 30./31. of September 2000. This workshop brought to a close the first phase of cooperation between two projects on language acquisition which has started in October 1999: a) the project on "Syntaktische Konsequenzen des Morphologieerwerbs" at the ZAS (Berlin) headed by Juergen Weissenborn and Ewald Lang, and financially supported by the Deutsche Forschungsgemeinschaft, and b) the international "Crosslinguistic Project on Pre- and Protomorphology in Language Acquisition" coordinated by Wolfgang U. Dressler in behalf of the Austrian Academy of Sciences

    'Most' vs. 'the most' in languages where 'the more' means 'most'

    Full text link
    This paper focuses on languages in which a superlative interpretation is typically indicated merely by a combination of a definiteness marker with a comparative marker, including French, Spanish, Italian, Romanian, and Greek ('DEF+CMP languages'). Despite ostensibly using definiteness markers to form the superlative, superlatives are not always definite-marked in these languages, and the distribution of definiteness-marking varies from language to language. To account for the cross-linguistic variation, we iden- tify conflicting pressures that all of the languages in consideration may be subject to, and suggest that different languages prioritize differently in the resolution of these conflicts. What these languages have in common, we suggest, is a mechanism of Definite Null Instantiation for the degree-type standard argument of the comparative. Among the parameters along which languages are proposed to differ is the relative importance of marking uniqueness vs. avoiding determiners with predicates of entities that are not individuals.http://eecoppock.info/CoppockStrand-DAL.pdfhttp://eecoppock.info/CoppockStrand-DAL.pdfAccepted manuscrip

    The development of sentence-interpretation strategies in monolingual German-learning children with and without specific language impairment

    Get PDF
    Previous research on sentence comprehension conducted with German-learning children has concentrated on the role of case marking and word order in typically developing children. This paper compares, the performance of German-learning children with language impairment (age 4-6 years) and without language impairment (aged 2-6, 8-9 years) in two experiments that systematically vary the cues animacy, case marking; word-order, and subject-verb agreement. The two experiments differ with regard to the choice of case marking: in the first it is distinct but in the second it is neutralized. The theoretical framework is the competition model developed by Bates and Mac Whinney and their collaborators, a variant of the parallel distributed processing models. It is hypothesized that children of either population first appreciate the cue animacy that can be processed locally, that is, "on the spot," before they turn to more distributed cues leading ultimately up to subject-verb agreement, which presupposes the comparison of various constituents before an interpretation can be established. Thus agreement is more "costly" in processing than animacy or the (more) local cue initial NP. In experiment I with unambiguous case markers it is shown that the typically developing children proceed from animacy to the nominative (predominantly in coalition with the initial NP) to agreement, while in the second experiment with ambiguous case markers these children turn from animacy to the initial NP and then to agreement. The impaired children also progress from local to distributed cues. Yet, in contrast to the control group, they do not acknowledge the nominative in coalition with the initial NP in the first experiment but only in support of agreement. However, although they do not seem to appreciate distinct case markers to any large extent in the first experiment, they are irritated if such distinctions are lacking: in experiment II all impaired children turn to. animacy (some in coalition with the initial NP and/or particular word orders). In the discussion, the relationship between short-term memory and processing as well as the relationship between production and comprehension of case markers and agreement are addressed. Further research is needed to explore in more detail "cue costs" in sentence comprehension

    Lexical typology : a programmatic sketch

    Get PDF
    The present paper is an attempt to lay the foundation for Lexical Typology as a new kind of linguistic typology.1 The goal of Lexical Typology is to investigate crosslinguistically significant patterns of interaction between lexicon and grammar

    Token-based typology and word order entropy: A study based on universal dependencies

    No full text
    The present paper discusses the benefits and challenges of token-based typology, which takes into account the frequencies of words and constructions in language use. This approach makes it possible to introduce new criteria for language classification, which would be difficult or impossible to achieve with the traditional, type-based approach. This point is illustrated by several quantitative studies of word order variation, which can be measured as entropy at different levels of granularity. I argue that this variation can be explained by general functional mechanisms and pressures, which manifest themselves in language use, such as optimization of processing (including avoidance of ambiguity) and grammaticalization of predictable units occurring in chunks. The case studies are based on multilingual corpora, which have been parsed using the Universal Dependencies annotation scheme
    corecore