Search CORE

86 research outputs found

Memory, Space, and Planning: Multiscale Predictive Representations

Author: Momennejad Ida
Publication venue
Publication date: 19/02/2024
Field of study

Memory is inherently entangled with prediction and planning. Flexible behavior in biological and artificial agents depends on the interplay of learning from the past and predicting the future in ever-changing environments. This chapter reviews computational, behavioral, and neural evidence suggesting these processes rely on learning the relational structure of experiences, known as cognitive maps, and draws two key takeaways. First, that these memory structures are organized as multiscale, compact predictive representations in hippocampal and prefrontal cortex, or PFC, hierarchies. Second, we argue that such predictive memory structures are crucial to the complementary functions of the hippocampus and PFC, both for enabling a recall of detailed and coherent past episodes as well as generalizing experiences at varying scales for efficient prediction and planning. These insights advance our understanding of memory and planning mechanisms in the brain and hold significant implications for advancing artificial intelligence systems.Comment: To be published as a chapter in an edited volume by Oxford University Press (Editors: Sara Aronowitz and Lynn Nadel

arXiv.org e-Print Archive

In-Context Learning in Large Language Models: A Neuroscience-inspired Analysis of Representations

Author: Betthauser Leo
Hasanbeig Hosein
Millière Raphaël
Momennejad Ida
Saran Akanksha
Yousefi Safoora
Publication venue
Publication date: 18/10/2023
Field of study

Large language models (LLMs) exhibit remarkable performance improvement through in-context learning (ICL) by leveraging task-specific examples in the input. However, the mechanisms behind this improvement remain elusive. In this work, we investigate embeddings and attention representations in Llama-2 70B and Vicuna 13B. Specifically, we study how embeddings and attention change after in-context-learning, and how these changes mediate improvement in behavior. We employ neuroscience-inspired techniques, such as representational similarity analysis (RSA), and propose novel methods for parameterized probing and attention ratio analysis (ARA, measuring the ratio of attention to relevant vs. irrelevant information). We designed three tasks with a priori relationships among their conditions: reading comprehension, linear regression, and adversarial prompt injection. We formed hypotheses about expected similarities in task representations to investigate latent changes in embeddings and attention. Our analyses revealed a meaningful correlation between changes in both embeddings and attention representations with improvements in behavioral performance after ICL. This empirical framework empowers a nuanced understanding of how latent representations affect LLM behavior with and without ICL, offering valuable tools and insights for future research and practical applications.Comment: Added overview figures 1-3 in this versio

arXiv.org e-Print Archive

A Prefrontal Cortex-inspired Architecture for Planning in Large Language Models

Author: Krabach Brian
Momennejad Ida
Mondal Shanka Subhra
Wang Chi
Webb Taylor
Publication venue
Publication date: 05/03/2024
Field of study

Large language models (LLMs) demonstrate impressive performance on a wide variety of tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed planning. To address this, we take inspiration from the human brain, in which planning is accomplished via the recurrent interaction of specialized modules in the prefrontal cortex (PFC). These modules perform functions such as conflict monitoring, state prediction, state evaluation, task decomposition, and task coordination. We find that LLMs are sometimes capable of carrying out these functions in isolation, but struggle to autonomously coordinate them in the service of a goal. Therefore, we propose a black box architecture with multiple LLM-based (GPT-4) modules. The architecture improves planning through the interaction of specialized PFC-inspired modules that break down a larger problem into multiple brief automated calls to the LLM. We evaluate the combined architecture on three challenging planning tasks -- graph traversal, Tower of Hanoi, and logistics -- finding that it yields significant improvements over standard LLM methods (e.g., zero-shot prompting, in-context learning, and chain-of-thought). These results demonstrate the benefit of utilizing knowledge from cognitive neuroscience to improve planning in LLMs

arXiv.org e-Print Archive

Recommended from our members

Predictive representations can link model-based reinforcement learning to model-free mechanisms

Author: Botvinick Matthew M.
Daw Nathaniel D.
Gershman Samuel J.
Momennejad Ida
Russek Evan M.
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/09/2017
Field of study

Humans and animals are capable of evaluating actions by considering their long-run future rewards through a process described using model-based reinforcement learning (RL) algorithms. The mechanisms by which neural circuits perform the computations prescribed by model-based RL remain largely unknown; however, multiple lines of evidence suggest that neural circuits supporting model-based behavior are structurally homologous to and overlapping with those thought to carry out model-free temporal difference (TD) learning. Here, we lay out a family of approaches by which model-based computation may be built upon a core of TD learning. The foundation of this framework is the successor representation, a predictive state representation that, when combined with TD learning of value predictions, can produce a subset of the behaviors associated with model-based learning, while requiring less decision-time computation than dynamic programming. Using simulations, we delineate the precise behavioral capabilities enabled by evaluating actions using this approach, and compare them to those demonstrated by biological organisms. We then introduce two new algorithms that build upon the successor representation while progressively mitigating its limitations. Because this framework can account for the full range of observed putatively model-based behaviors while still utilizing a core TD framework, we suggest that it represents a neurally plausible family of mechanisms for model-based evaluation

Princeton University Open Access Repository

Crossref

Harvard University - DASH

Directory of Open Access Journals

UCL Discovery

Searchlight-based multi-voxel pattern analysis of fMRI by cross-validated MANOVA

Author: Abrams
Blankertz
Boser
Carlson
Carsten Allefeld
Chang
Cichy
Cichy
Cohen
Cohen
Cortes
Cox
Edelman
Fang
Fonov
Friston
Friston
Friston
Glaser
Good
Goutte
Haken
Hastie
Haxby
Haxby
Haynes
Haynes
Haynes
John-Dylan Haynes
Kahnt
Kamitani
Kiebel
Kriegeskorte
Kruggel
Kullback
Lehmann
Mahalanobis
McIntosh
McKeown
Mitchell
Momennejad
Nichols
Norman
Pereira
Schäfer
Stelzer
Timm
Tong
Worsley
Wyman
Publication venue: 'Elsevier BV'
Publication date: 01/12/2013
Field of study

Multi-voxel pattern analysis (MVPA) is a fruitful and increasingly popular complement to traditional univariate methods of analyzing neuroimaging data. We propose to replace the standard ‘decoding’ approach to searchlight-based MVPA, measuring the performance of a classifier by its accuracy, with a method based on the multivariate form of the general linear model. Following the well-established methodology of multivariate analysis of variance (MANOVA), we define a measure that directly characterizes the structure of multi-voxel data, the pattern distinctness D. Our measure is related to standard multivariate statistics, but we apply cross-validation to obtain an unbiased estimate of its population value, independent of the amount of data or its partitioning into ‘training’ and ‘test’ sets. The estimate can therefore serve not only as a test statistic, but also as an interpretable measure of multivariate effect size. The pattern distinctness generalizes the Mahalanobis distance to an arbitrary number of classes, but also the case where there are no classes of trials because the design is described by parametric regressors. It is defined for arbitrary estimable contrasts, including main effects (pattern differences) and interactions (pattern changes). In this way, our approach makes the full analytical power of complex factorial designs known from univariate fMRI analyses available to MVPA studies. Moreover, we show how the results of a factorial analysis can be used to obtain a measure of pattern stability, the equivalent of ‘cross-decoding’

arXiv.org e-Print Archive

City Research Online

Crossref

Prioritized memory access explains planning and hippocampal replay.

Author: A Johnson
A Johnson
AC Singer
AK Lee
AnbspC Singer
AS Gupta
AW Moore
BB Doll
BB Doll
BE Pfeiffer
BF Sadacca
CM Gillan
CS Lansink
CT Wu
D Dupret
D Shohamy
DJ Foster
F Cushman
GE Wimmer
HF Ólafsdóttir
I. Momennejad
J O’Neill
J Peng
JC Jackson
JL McClelland
John R. Anderson
K Diba
Kimberly L Stachenfeld
Marcelo G. Mattar
Mehdi Keramati
MF Carr
MJ Sharpe
MM Botvinick
MP Karlsson
Nathaniel D. Daw
ND Daw
ND Daw
ND Daw
P Dayan
QJ Huys
RE Ambrose
RS Sutton
RS Sutton
S Cheng
Samuel J. Gershman
SN Gomperts
SP Jadhav
TJ Davidson
W Schultz
Publication venue: Nat Neurosci
Publication date: 01/11/2018
Field of study

To make decisions, animals must evaluate candidate choices by accessing memories of relevant experiences. Yet little is known about which experiences are considered or ignored during deliberation, which ultimately governs choice. We propose a normative theory predicting which memories should be accessed at each moment to optimize future decisions. Using nonlocal 'replay' of spatial locations in hippocampus as a window into memory access, we simulate a spatial navigation task in which an agent accesses memories of locations sequentially, ordered by utility: how much extra reward would be earned due to better choices. This prioritization balances two desiderata: the need to evaluate imminent choices versus the gain from propagating newly encountered information to preceding locations. Our theory offers a simple explanation for numerous findings about place cells; unifies seemingly disparate proposed functions of replay including planning, learning, and consolidation; and posits a mechanism whose dysfunction may underlie pathologies like rumination and craving

Crossref

Apollo (Cambridge)

Author response image 2.

Author: Alvernhe
Alvernhe
Aprahamian
Baram
Barron
Barron
Benzi
Berry
Boorman
Bornstein
Bush
Chadwick
Cleeremans
Collin
Constantinescu
Dayan
Derdikman
Dordek
Eichenbaum
Estrada
Estrada
Ezzyat
Glover
Gläscher
Grill-Spector
Hafting
Heckers
Horner
Howard
Hutton
Komorowski
Kumaran
Kumaran
McKenzie
McNaughton
Mittelstaedt
Momennejad
Morgan
O'Keefe
O’Keefe
Preston
Reber
Reber
Rossion
Rubin
Russek
Schapiro
Schapiro
Schendan
Schlichting
Schuck
Seger
Spiers
Stachenfeld
Stachenfeld
Stemmler
Strange
Tavares
Tolman
Weiskopf
Wimmer
Publication venue: 'eLife Sciences Publications, Ltd'
Publication date: 01/01/2017
Field of study

The hippocampal–entorhinal system encodes a map of space that guides spatial navigation. Goal-directed behaviour outside of spatial navigation similarly requires a representation of abstract forms of relational knowledge. This information relies on the same neural system, but it is not known whether the organisational principles governing continuous maps may extend to the implicit encoding of discrete, non-spatial graphs. Here, we show that the human hippocampal–entorhinal system can represent relationships between objects using a metric that depends on associative strength. We reconstruct a map-like knowledge structure directly from a hippocampal–entorhinal functional magnetic resonance imaging adaptation signal in a situation where relationships are non-spatial rather than spatial, discrete rather than continuous, and unavailable to conscious awareness. Notably, the measure that best predicted a behavioural signature of implicit knowledge and blood oxygen level-dependent adaptation was a weighted sum of future states, akin to the successor representation that has been proposed to account for place and grid-cell firing patterns

Crossref

UCL Discovery

Oxford University Research Archive

Electronic Archiving System

MPG.PuRe

Recommended from our members

Priority-Adjusted Replay for Successor Representations

Author: Barnett Samuel
Momennejad Ida
Publication venue: eScholarship, University of California
Publication date: 01/01/2022
Field of study

Intelligent agents are capable of transfer and generalization. This flexibility in adapting to new tasks and environments often relies on representation learning and replay. Among these algorithms, successor representation learning and memory replay offer biologically plausible solutions. However, replay prioritization algorithms remain largely limited to value prediction errors. Here we propose PARSR, Priority-Adjusted Replay for Successor Representations, to address this caveat. Decoupling learning of the environment dynamics and rewards, PARSR can use prediction errors from either representation learning or values to prioritize memory replay. We compare PARSR to SR-Dyna, Dyna-Q, and a number of state of the art algorithms using replay and successor representations in cognitive neuroscience. We find that PARSR is able to reproduce human behavior in a number of revaluation tasks while also representing a performance improvement over SR-Dyna, its closest counterpart

eScholarship - University of California