Search CORE

eScholarship - University of California

Recommended from our members

How well do models of cross-situational word learning account for the learning of ambiguous words?

Author: Dautriche Isabelle
Regan Sophie
Srinivasan Mahesh
Publication venue: eScholarship, University of California
Publication date: 01/01/2025
Field of study

Existing theories of word learning largely focus on a learner's ability to learn a single meaning for a word despite the fact that many words have multiple meanings. Several computational models of cross-situational word learning have been proposed to explain how words are learned, but it is unknown to what extent they can learn ambiguous words with multiple meanings. Here, we present an experiment showing that adult learners are able to learn multiple meanings of novel ambiguous words in a cross-situational word learning paradigm, and are especially good at doing so when the meanings of the words are related (polysemous) rather than unrelated (homophonous). We evaluated the ability of ten different computational models of cross-situational word learning to explain the empirical data, and none were able to learn the ambiguous words as successfully as the adult learners. Moreover, because these computational models do not represent any semantic information, they are in principle unable to replicate the key difference between polysemous and homophonous word learning found in the study

Recommended from our members

Subjective confidence influences word learning in a cross-situational statistical learning task

Author: Dautriche Isabelle
Rabagliati Hugh
Smith Kenny
Publication venue: 'Elsevier BV'
Publication date: 01/01/2021
Field of study

International audienceLearning is often accompanied by a subjective sense of confidence in one’s knowledge, a feeling of knowing what you know and how well you know it. Subjective confidence has been shown to guide learning in other domains, but has received little attention so far in the word learning literature. Across three word learning experiments, we investigated whether and how a sense of confidence in having acquired a word meaning influences the word learning process itself. First, we show evidence for a confirmation bias during word learning in a cross-situational statistical learning task: Learners who are highly confident they know the meaning of a word are more likely to persist in their belief than learners who are not, even after observing objective evidence disconfirming their belief. Second, we show that subjective confidence in a word meaning modulates inferential processes based on that word, affecting learning over the whole lexicon: Learners who hold high confidence in a word meaning are more likely to use that word to make mutual exclusivity inferences about the meaning of other words. We conclude that confidence influences word learning by modulating both information selection processes and inferential processes and discuss the implications of these results for word learning models

Coordination, rather than pragmatics, shapes colexification when the pressure forefficiency is low.

Author: Dautriche Isabelle
Koshevoy Alexey
Morin Olivier
Smith Kenny
Publication venue: eScholarship University of California
Publication date: 01/01/2024
Field of study

We investigate the phenomenon of colexification, where a single word form is associated with multiple meanings. Previous research on colexification has primarily focused on empirical studies of different properties of the meanings that determine colexification, such as semantic similarity or meaning frequency. Meanwhile, little attention was paid to the word-forms’ properties, despite being the original approach advocated by Zipf. Our preregistered study examines whether word length influences word choice for colexification using a noveldyadic communication game (N = 64) and a computational model grounded in the Rational Speech Act (RSA) framework.Contrary to initial predictions, participants did not exhibit a strong preference for efficient colexification (namely colexifying multiple concepts using short words, when long alternatives are available). The results align more closely with a simpler coordination model, where dyads align on a functioning lexical convention with relatively little influence from the efficiency of that convention. Our study highlights the possibility that colexification choices are strongly determined by the pressure for coordination, with weaker influences from semantic similarity or meaning frequency. This is most likely explained by weak pressure for efficiency in our experimental design

Phrasal prosody constrains syntactic analysis in toddlers

Author: Christophe Anne
Dautriche Isabelle
de Carvalho Alex
Lin Isabelle
Publication venue
Publication date: 01/06/2017
Field of study

International audienceThis study examined whether phrasal prosody can impact toddlers' syntactic analysis. French noun-verb homophones were used to create locally ambiguous test sentences (e.g., using the homophone as a noun: [le bébé souris] [a bien mangé]-[the baby mouse] [ate well] or using it as a verb: [le bébé] [sourit à sa maman]-[the baby] [smiles to his mother], where brackets indicate prosodic phrase boundaries). Although both sentences start with the same words (le-bebe-/suʁi/), they can be disambiguated by the prosodic boundary that either directly precedes the critical word /suʁi/ when it is a verb, or directly follows it when it is a noun. Across two experiments using an intermodal preferential looking procedure, 28-month-olds (Exp. 1 and 2) and 20-month-olds (Exp. 2) listened to the beginnings of these test sentences while watching two images displayed side-by-side on a TV-screen: one associated with the noun interpretation of the ambiguous word (e.g., a mouse) and the other with the verb interpretation (e.g., a baby smiling). The results show that upon hearing the first words of these sentences, toddlers were able to correctly exploit prosodic information to access the syntactic structure of sentences, which in turn helped them to determine the syntactic category of the ambiguous word and to correctly identify its intended meaning: participants switched their eye-gaze toward the correct image based on the prosodic condition in which they heard the ambiguous target word. This provides evidence that during the first steps of language acquisition, toddlers are already able to exploit the prosodic structure of sentences to recover their syntactic structure and predict the syntactic category of upcoming words, an ability which would be extremely useful to discover the meaning of novel words

Portail HAL EHESS (École des hautes études en sciences sociales)

Toddlers’ eye-movements reflect (un)certainty about their knowledge of a word’s meaning

Author: Dautriche Isabelle
Goupil Louise
Rabagliati Hugh
Smith Kenneth
Publication venue
Publication date: 01/01/2018
Field of study

How efficiency shapes human language

Author: Bergen Leon
Dautriche Isabelle
Futrell Richard
Gibson Edward
Levy Roger
Mahowald Kyle
Piandadosi Steven T.
Publication venue: 'Elsevier BV'
Publication date: 18/04/2019
Field of study

We review recent research on the burgeoning topic of how language structure is shaped by principles of efficiency for communication and learning.Work in this area has infused long-standing ideas in linguistics and psychology with new precision and methodological rigor by bringing together information theory, newly available datasets, controlled experimentation, and computational modeling.We review a number of studies that focus on phenomena ranging from the lexicon through syntactic processes, and which deploy formal tools from information theory and probability theory to understand how and why language works the way that it does.These studies show how a pervasive pressure for efficient usage guides the form of natural language and suggest a rich future for language research in connecting linguistics to cognitive psychology and mathematical theories of communication

Words cluster phonetically beyond phonotactic regularities

Author: Albright
Altmann
Altvater-Mackensen
Anne Christophe
Arbesman
Arnold
Aylett
Aylett
Baayen
Bailey
Barabási
Bell
Bergen
Bloomfield
Borovsky
Cauvet
Chomsky
Coady
Cole
Dautriche
Dautriche
de Saussure
Dell
Dell
Edward Gibson
Ernestus
Ferrer-i Cancho
Fitneva
Flemming
Flemming
Gafos
Gahl
Gahl
Gahl
Gahl
Gibson
Goldrick
Goldsmith
Hayes
Hinton
Hockett
Hockett
Hockett
Howes
Imai
Imai
Isabelle Dautriche
Jusczyk
Kelly
Kemps
Kirby
Kirby
Kirby
Kyle Mahowald
Lane
Levy
Liljencrants
Luce
Magnuson
Mandelbrot
Manning
Martin
Martinet
Miller
Miller
Monaghan
Monaghan
New
Newman
Ngon
Nielsen
Nygaard
Piantadosi
Pluymaekers
Raymond
Sadat
Sapir
Schwartz
Shannon
Simon
Smith
Stemberger
Steven T. Piantadosi
Storkel
Storkel
Storkel
Storkel
Storkel
Storkel
Strand
Swingley
Tamariz
Van Son
Viebahn
Vitevitch
Vitevitch
Vitevitch
Vitevitch
Vitevitch
Vitevitch
Wasserman
Watts
Wedel
Wedel
Younger
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Recent evidence suggests that cognitive pressures associated with language acquisition and use could affect the organization of the lexicon. On one hand, consistent with noisy channel models of language (e.g., Levy, 2008), the phonological distance between wordforms should be maximized to avoid perceptual confusability (a pressure for dispersion). On the other hand, a lexicon with high phonological regularity would be simpler to learn, remember and produce (e.g., Monaghan et al., 2011) (a pressure for clumpiness). Here we investigate wordform similarity in the lexicon, using measures of word distance (e.g., phonological neighborhood density) to ask whether there is evidence for dispersion or clumpiness of wordforms in the lexicon. We develop a novel method to compare lexicons to phonotactically-controlled baselines that provide a null hypothesis for how clumpy or sparse wordforms would be as the result of only phonotactics. Results for four languages, Dutch, English, German and French, show that the space of monomorphemic wordforms is clumpier than what would be expected by the best chance model according to a wide variety of measures: minimal pairs, average Levenshtein distance and several network properties. This suggests a fundamental drive for regularity in the lexicon that conflicts with the pressure for words to be as phonologically distinct as possible. Keywords: Linguistics; Lexical design; Communication; Phonotactic

DSpace@MIT

Portail HAL EHESS (École des hautes études en sciences sociales)

Word forms are structured for efficient use

Author: Atkinson
Baayen
Beddor
Bergem
Brown
Brysbaert
Chen
Coady
Dautriche
Dautriche
Edward Gibson
Fedzechkina
Ferreira
Ferrer-i-Cancho
Frank
Frauenfelder
Gahl
Hills
Hume
Isabelle Dautriche
Jaeger
Jaeger
Jusczyk
Kanwal
Kawasaki
Kyle Mahowald
Landauer
Levy
Lindblom
Lindblom
Lindblom
Luce
Magnuson
Mahowald
Manin
New
Ngon
Ohala
Pate
Piantadosi
Piantadosi
Piantadosi
Sadat
Shannon
Smith
Stemberger
Steven T. Piantadosi
Storkel
Storkel
Storkel
Storkel
Storkel
Swingley
Vitevitch
Vitevitch
Vitevitch
Vitevitch
Zipf
Zipf
Publication venue: 'Wiley'
Publication date: 01/08/2018
Field of study

Zipf famously stated that, if natural language lexicons are structured for efficient communication, the words that are used the most frequently should require the least effort. This observation explains the famous finding that the most frequent words in a language tend to be short. A related prediction is that, even within words of the same length, the most frequent word forms should be the ones that are easiest to produce and understand. Using orthographics as a proxy for phonetics, we test this hypothesis using corpora of 96 languages from Wikipedia. We find that, across a variety of languages and language families and controlling for length, the most frequent forms in a language tend to be more orthographically well‐formed and have more orthographic neighbors than less frequent forms. We interpret this result as evidence that lexicons are structured by language usage pressures to facilitate efficient communication. Keywords: Lexicon; Word frequency; Phonology; Communication; EfficiencyNational Science Foundation (Grant ES/N0174041/1

DSpace@MIT