Search CORE

8 research outputs found

Evidence of Human-Like Visual-Linguistic Integration in Multimodal Large Language Models During Predictive Language Processing

Author: DEstalenx Quitterie Lacome
Edwards Christopher
Kewenig Viktor
Rechardt Akilles
Skipper Jeremy I
Vigliocco Gabriella
Publication venue
Publication date: 04/09/2023
Field of study

The advanced language processing abilities of large language models (LLMs) have stimulated debate over their capacity to replicate human-like cognitive processes. One differentiating factor between language processing in LLMs and humans is that language input is often grounded in several perceptual modalities, whereas most LLMs process solely text-based information. Multimodal grounding allows humans to integrate - e.g. visual context with linguistic information and thereby place constraints on the space of upcoming words, reducing cognitive load and improving comprehension. Recent multimodal LLMs (mLLMs) combine a visual-linguistic embedding space with a transformer type attention mechanism for next-word prediction. Here we ask whether predictive language processing based on multimodal input in mLLMs aligns with humans. Two-hundred participants watched short audio-visual clips and estimated predictability of an upcoming verb or noun. The same clips were processed by the mLLM CLIP, with predictability scores based on comparing image and text feature vectors. Eye-tracking was used to estimate what visual features participants attended to, and CLIP's visual attention weights were recorded. We find that alignment of predictability scores was driven by multimodality of CLIP (no alignment for a unimodal state-of-the-art LLM) and by the attention mechanism (no alignment when attention weights were perturbated or when the same input was fed to a multimodal model without attention). We further find a significant spatial overlap between CLIP's visual attention weights and human eye-tracking data. Results suggest that comparable processes of integrating multimodal information, guided by attention to relevant visual features, supports predictive language processing in mLLMs and humans.Comment: 13 pages, 4 figures, submitted to journa

arXiv.org e-Print Archive

Language development beyond the here‐and‐now: Iconicity and displacement in child‐directed communication

Author: Brieke Ricarda
Donnellan Ed
Grzyb Beata
Gu Yan
Kewenig Viktor
Marshall Chloe
Motamedi Yasamin
Murgiano Margherita
Perniss Pamela
Vigliocco Gabriella
Wonnacott Elizabeth
Publication venue: Wiley
Publication date: 26/02/2024
Field of study

Most language use is displaced, referring to past, future, or hypothetical events, posing the challenge of how children learn what words refer to when the referent is not physically available. One possibility is that iconic cues that imagistically evoke properties of absent referents support learning when referents are displaced. In an audio‐visual corpus of caregiver–child dyads, English‐speaking caregivers interacted with their children (N = 71, 24–58 months) in contexts in which the objects talked about were either familiar or unfamiliar to the child, and either physically present or displaced. The analysis of the range of vocal, manual, and looking behaviors caregivers produced suggests that caregivers used iconic cues especially in displaced contexts and for unfamiliar objects, using other cues when objects were present

University of Essex Research Repository

Oxford University Research Archive

Intensification of Antiretroviral Therapy with a CCR5 Antagonist in Patients with Chronic HIV-1 Infection: Effect on T Cells Latently Infected

Objective: The primary objective was to assess the effect of MVC intensification on latently infected CD4+ T cells in chronically HIV-1-infected patients receiving antiretroviral therapy. Methods: We performed an open-label pilot phase II clinical trial involving chronically HIV-1-infected patients receiving stable antiretroviral therapy whose regimen was intensified with 48 weeks of maraviroc therapy. We analyzed the latent reservoir, the residual viremia and episomal 2LTR DNA to examine the relationship between these measures and the HIV-1 latent reservoir, immune activation, lymphocyte subsets (including effector and central memory T cells), and markers associated with bacterial translocation. Results: Overall a non significant reduction in the size of the latent reservoir was found (p = 0.068). A mean reduction of 1.82 IUPM was observed in 4 patients with detectable latent reservoir at baseline after 48 weeks of intensification. No effect on plasma residual viremia was observed. Unexpectedly, all the patients had detectable 2LTR DNA circles at week 24, while none of them showed those circles at the end of the study. No changes were detected in CD4+ or CD8+ counts, although a significant decrease was found in the proportion of HLA-DR+/CD38+ CD4+ and CD8+ T-cells. LPS and sCD14 levels increased. Conclusions: Intensification with MVC was associated with a trend to a decrease in the size of the latent HIV-1 reservoir in memory T cells. No impact on residual viremia was detected. Additional studies with larger samples are needed to confirm the results

Repositorio Institucional de la Universidad de Córdoba

Public Library of Science (PLOS)

Crossref

University of Birmingham Research Portal

Directory of Open Access Journals

PubMed Central

Commentary: Robots As Intentional Agents: Using Neuroscientific Methods to Make Robots Appear More Social

Author: Martin H. Fischer
Viktor Kewenig
Yuefang Zhou
Publication venue: 'Frontiers Media SA'
Publication date: 01/01/2018
Field of study

Directory of Open Access Journals

Frontiers - Publisher Connector

Recommended from our members

Do you hear how BIG it is? Iconic Prosody in Child Directed Language Supports Language Acquisition

Author: Brieke Ricarda
Gu Yan
Kewenig Viktor
Vigliocco Gabriella
Publication venue: eScholarship, University of California
Publication date: 01/01/2021
Field of study

Child directed language has been characterized by exaggerated prosody which can serve multiple functions including highlighting properties of meaning via iconicity. Iconic prosody may help language acquisition by bringing properties of displaced or unknown referents to the language learner’s “mind’s eye” or facilitating the acquisition of abstract features such as “direction”, or “speed”. We investigate iconic prosody in semi-naturalistic caregiver-child interactions. 50 caregivers were asked to talk to their child (2-4 years) about a set of toys either known or unknown to the child, and either present or absent from the interaction. In a first analysis, we included instances of iconic prosody as subjectively coded. In a second analysis, we looked at acoustic modulations for a set of seed words. In both analyses, we found that caregivers made use of iconic prosody more when talking about unknown or displaced objects, pointing to a neglected role for prosody in word learning

eScholarship - University of California

Recommended from our members

Do you hear how BIG it is? Iconic Prosody in Child Directed Language Supports Language Acquisition

Author: Brieke Ricarda
Gu Yan
Kewenig Viktor
Vigliocco Gabriella
Publication venue: eScholarship, University of California
Publication date: 01/01/2021
Field of study

eScholarship - University of California

Language development beyond the here-and-now: iconicity and displacement in child-directed communication

Author: Beata Grzyb
Chloë Marshall
Ed Donnellan
Elizabeth Wonnacott
Gabriella Vigliocco
Margherita Murgiano
Pamela Perniss
Ricarda Brieke
Viktor Kewenig
Yan Gu
Yasamin Motamedi
Publication venue: PsyArXiv
Publication date: 14/01/2024
Field of study

Most language use is displaced, referring to past, future or hypothetical events. Displacement poses an important challenge for language learning. How can children learn what words refer to when the referent is not physically available? We suggest that caregivers provide children with iconic vocal and gestural cues that imagistically evoke properties of absent referents to support displaced learning. We collected an audio-visual corpus of English-speaking caregiver-child interactions (N = 71, 24-58 months, 37 female) and annotated the range of vocal and manual behaviours caregivers produced. We found that caregivers used iconic cues especially in displaced contexts, using other cues when objects were present. Thus, we map caregivers’ non-linguistic behaviours, showing that they provide iconic cues to support displaced language learning and processing

PsyArxiv

Multimodal cues in child-directed communication

Author: Beata Grzyb
Chloë Marshall
Ed Donnellan
Elizabeth Wonnacott
Gabriella Vigliocco
Margherita Murgiano
Pamela Perniss
Ricarda Brieke
Viktor Kewenig
Yan Gu
Yasamin Motamedi
Publication venue: 'Center for Open Science'
Publication date: 31/07/2023
Field of study

Project looking at multimodal cues (onomatopoeia, gesture, points, object manipulation, eye gaze) in caregivers' communication to their English-speaking toddlers

OSF Preprints