153 research outputs found

    Coordination, rather than pragmatics, shapes colexification when the pressure forefficiency is low.

    Get PDF
    We investigate the phenomenon of colexification, where a single word form is associated with multiple meanings. Previous research on colexification has primarily focused on empirical studies of different properties of the meanings that determine colexification, such as semantic similarity or meaning frequency. Meanwhile, little attention was paid to the word-forms’ properties, despite being the original approach advocated by Zipf. Our preregistered study examines whether word length influences word choice for colexification using a noveldyadic communication game (N = 64) and a computational model grounded in the Rational Speech Act (RSA) framework.Contrary to initial predictions, participants did not exhibit a strong preference for efficient colexification (namely colexifying multiple concepts using short words, when long alternatives are available). The results align more closely with a simpler coordination model, where dyads align on a functioning lexical convention with relatively little influence from the efficiency of that convention. Our study highlights the possibility that colexification choices are strongly determined by the pressure for coordination, with weaker influences from semantic similarity or meaning frequency. This is most likely explained by weak pressure for efficiency in our experimental design

    Phrasal prosody constrains syntactic analysis in toddlers

    Get PDF
    International audienceThis study examined whether phrasal prosody can impact toddlers' syntactic analysis. French noun-verb homophones were used to create locally ambiguous test sentences (e.g., using the homophone as a noun: [le bébé souris] [a bien mangé]-[the baby mouse] [ate well] or using it as a verb: [le bébé] [sourit à sa maman]-[the baby] [smiles to his mother], where brackets indicate prosodic phrase boundaries). Although both sentences start with the same words (le-bebe-/suʁi/), they can be disambiguated by the prosodic boundary that either directly precedes the critical word /suʁi/ when it is a verb, or directly follows it when it is a noun. Across two experiments using an intermodal preferential looking procedure, 28-month-olds (Exp. 1 and 2) and 20-month-olds (Exp. 2) listened to the beginnings of these test sentences while watching two images displayed side-by-side on a TV-screen: one associated with the noun interpretation of the ambiguous word (e.g., a mouse) and the other with the verb interpretation (e.g., a baby smiling). The results show that upon hearing the first words of these sentences, toddlers were able to correctly exploit prosodic information to access the syntactic structure of sentences, which in turn helped them to determine the syntactic category of the ambiguous word and to correctly identify its intended meaning: participants switched their eye-gaze toward the correct image based on the prosodic condition in which they heard the ambiguous target word. This provides evidence that during the first steps of language acquisition, toddlers are already able to exploit the prosodic structure of sentences to recover their syntactic structure and predict the syntactic category of upcoming words, an ability which would be extremely useful to discover the meaning of novel words

    How efficiency shapes human language

    Get PDF
    We review recent research on the burgeoning topic of how language structure is shaped by principles of efficiency for communication and learning.Work in this area has infused long-standing ideas in linguistics and psychology with new precision and methodological rigor by bringing together information theory, newly available datasets, controlled experimentation, and computational modeling.We review a number of studies that focus on phenomena ranging from the lexicon through syntactic processes, and which deploy formal tools from information theory and probability theory to understand how and why language works the way that it does.These studies show how a pervasive pressure for efficient usage guides the form of natural language and suggest a rich future for language research in connecting linguistics to cognitive psychology and mathematical theories of communication

    Words cluster phonetically beyond phonotactic regularities

    Get PDF
    Recent evidence suggests that cognitive pressures associated with language acquisition and use could affect the organization of the lexicon. On one hand, consistent with noisy channel models of language (e.g., Levy, 2008), the phonological distance between wordforms should be maximized to avoid perceptual confusability (a pressure for dispersion). On the other hand, a lexicon with high phonological regularity would be simpler to learn, remember and produce (e.g., Monaghan et al., 2011) (a pressure for clumpiness). Here we investigate wordform similarity in the lexicon, using measures of word distance (e.g., phonological neighborhood density) to ask whether there is evidence for dispersion or clumpiness of wordforms in the lexicon. We develop a novel method to compare lexicons to phonotactically-controlled baselines that provide a null hypothesis for how clumpy or sparse wordforms would be as the result of only phonotactics. Results for four languages, Dutch, English, German and French, show that the space of monomorphemic wordforms is clumpier than what would be expected by the best chance model according to a wide variety of measures: minimal pairs, average Levenshtein distance and several network properties. This suggests a fundamental drive for regularity in the lexicon that conflicts with the pressure for words to be as phonologically distinct as possible. Keywords: Linguistics; Lexical design; Communication; Phonotactic

    Word forms are structured for efficient use

    Get PDF
    Zipf famously stated that, if natural language lexicons are structured for efficient communication, the words that are used the most frequently should require the least effort. This observation explains the famous finding that the most frequent words in a language tend to be short. A related prediction is that, even within words of the same length, the most frequent word forms should be the ones that are easiest to produce and understand. Using orthographics as a proxy for phonetics, we test this hypothesis using corpora of 96 languages from Wikipedia. We find that, across a variety of languages and language families and controlling for length, the most frequent forms in a language tend to be more orthographically well‐formed and have more orthographic neighbors than less frequent forms. We interpret this result as evidence that lexicons are structured by language usage pressures to facilitate efficient communication. Keywords: Lexicon; Word frequency; Phonology; Communication; EfficiencyNational Science Foundation (Grant ES/N0174041/1
    corecore