117 research outputs found
Boosting Theory-of-Mind Performance in Large Language Models via Prompting
Large language models (LLMs) excel in many tasks in 2023, but they still face
challenges in complex reasoning. Theory-of-mind (ToM) tasks, which require
understanding agents' beliefs, goals, and mental states, are essential for
common-sense reasoning involving humans, making it crucial to enhance LLM
performance in this area. This study measures the ToM performance of GPT-4 and
three GPT-3.5 variants (Davinci-2, Davinci-3, GPT-3.5-Turbo), and investigates
the effectiveness of in-context learning in improving their ToM comprehension.
We evaluated prompts featuring two-shot chain of thought reasoning and
step-by-step thinking instructions. We found that LLMs trained with
Reinforcement Learning from Human Feedback (RLHF) (all models excluding
Davinci-2) improved their ToM accuracy via in-context learning. GPT-4 performed
best in zero-shot settings, reaching nearly 80% ToM accuracy, but still fell
short of the 87% human accuracy on the test set. However, when supplied with
prompts for in-context learning, all RLHF-trained LLMs exceeded 80% ToM
accuracy, with GPT-4 reaching 100%. These results demonstrate that appropriate
prompting enhances LLM ToM reasoning, and they underscore the context-dependent
nature of LLM cognitive capacities.Comment: 27 pages, 4 main figures, 2 supplementary figure
Learning Representations from Temporally Smooth Data
Events in the real world are correlated across nearby points in time, and we
must learn from this temporally smooth data. However, when neural networks are
trained to categorize or reconstruct single items, the common practice is to
randomize the order of training items. What are the effects of temporally
smooth training data on the efficiency of learning? We first tested the effects
of smoothness in training data on incremental learning in feedforward nets and
found that smoother data slowed learning. Moreover, sampling so as to minimize
temporal smoothness produced more efficient learning than sampling randomly. If
smoothness generally impairs incremental learning, then how can networks be
modified to benefit from smoothness in the training data? We hypothesized that
two simple brain-inspired mechanisms, leaky memory in activation units and
memory-gating, could enable networks to rapidly extract useful representations
from smooth data. Across all levels of data smoothness, these brain-inspired
architectures achieved more efficient category learning than feedforward
networks. This advantage persisted, even when leaky memory networks with gating
were trained on smooth data and tested on randomly-ordered data. Finally, we
investigated how these brain-inspired mechanisms altered the internal
representations learned by the networks. We found that networks with
multi-scale leaky memory and memory-gating could learn internal representations
that un-mixed data sources which vary on fast and slow timescales across
training samples. Altogether, we identified simple mechanisms enabling neural
networks to learn more quickly from temporally smooth data, and to generate
internal representations that separate timescales in the training signal
Mapping the Structural Core of Human Cerebral Cortex
Structurally segregated and functionally specialized regions of the human cerebral cortex are interconnected by a dense network of cortico-cortical axonal pathways. By using diffusion spectrum imaging, we noninvasively mapped these pathways within and across cortical hemispheres in individual human participants. An analysis of the resulting large-scale structural brain networks reveals a structural core within posterior medial and parietal cerebral cortex, as well as several distinct temporal and frontal modules. Brain regions within the structural core share high degree, strength, and betweenness centrality, and they constitute connector hubs that link all major structural modules. The structural core contains brain regions that form the posterior components of the human default network. Looking both within and outside of core regions, we observed a substantial correspondence between structural connectivity and resting-state functional connectivity measured in the same participants. The spatial and topological centrality of the core within cortex suggests an important role in functional integration
Corticocortical evoked potentials reveal projectors and integrators in human brain networks.
The cerebral cortex is composed of subregions whose
functional specialization is largely determined by their
incoming and outgoing connections with each other. In the
present study, we asked which cortical regions can exert the
greatest influence over other regions and the cortical
network as a whole. Previous research on this question has
relied on coarse anatomy (mapping large fiber pathways) or
functional connectivity (mapping inter-regional statistical
dependencies in ongoing activity). Here we combined direct
electrical stimulation with recordings from the cortical
surface to provide a novel insight into directed, inter-
regional influence within the cerebral cortex of awake
humans. These networks of directed interaction were
reproducible across strength thresholds and across subjects.
Directed network properties included (1) a decrease in the
reciprocity of connections with distance; (2) major projector
nodes (sources of influence) were found in peri-Rolandic
cortex and posterior, basal and polar regions of the temporal
lobe; and (3) major receiver nodes (receivers of influence)
were found in anterolateral frontal, superior parietal, and
superior temporal regions. Connectivity maps derived from
electrical stimulation and from resting electrocorticography
(ECoG) correlations showed similar spatial distributions for
the same source node. However, higher-level network topology
analysis revealed differences between electrical stimulation
and ECoG that were partially related to the reciprocity of
connections. Together, these findings inform our
understanding of large-scale corticocortical influence as
well as the interpretation of functional connectivity
networks
The āNarrativesā fMRI dataset for evaluating models of naturalistic language comprehension
The āNarrativesā collection aggregates a variety of functional MRI datasets collected while human subjects listened to naturalistic spoken stories. The current release includes 345 subjects, 891 functional scans, and 27 diverse stories of varying duration totaling ~4.6 hours of unique stimuli (~43,000 words). This data collection is well-suited for naturalistic neuroimaging analysis, and is intended to serve as a benchmark for models of language and narrative comprehension. We provide standardized MRI data accompanied by rich metadata, preprocessed versions of the data ready for immediate use, and the spoken story stimuli with time-stamped phoneme- and word-level transcripts. All code and data are publicly available with full provenance in keeping with current best practices in transparent and reproducible neuroimaging
Dynamic Modulation of Local Population Activity by Rhythm Phase in Human Occipital Cortex During a Visual Search Task
Brain rhythms are more than just passive phenomena in visual cortex. For the first time, we show that the physiology underlying brain rhythms actively suppresses and releases cortical areas on a second-to-second basis during visual processing. Furthermore, their influence is specific at the scale of individual gyri. We quantified the interaction between broadband spectral change and brain rhythms on a second-to-second basis in electrocorticographic (ECoG) measurement of brain surface potentials in five human subjects during a visual search task. Comparison of visual search epochs with a blank screen baseline revealed changes in the raw potential, the amplitude of rhythmic activity, and in the decoupled broadband spectral amplitude. We present new methods to characterize the intensity and preferred phase of coupling between broadband power and band-limited rhythms, and to estimate the magnitude of rhythm-to-broadband modulation on a trial-by-trial basis. These tools revealed numerous coupling motifs between the phase of low-frequency (Ī“, Īø, Ī±, Ī², and Ī³ band) rhythms and the amplitude of broadband spectral change. In the Īø and Ī² ranges, the coupling of phase to broadband change is dynamic during visual processing, decreasing in some occipital areas and increasing in others, in a gyrally specific pattern. Finally, we demonstrate that the rhythms interact with one another across frequency ranges, and across cortical sites
Identification and Classification of Hubs in Brain Networks
Brain regions in the mammalian cerebral cortex are linked by a complex network of fiber bundles. These inter-regional networks have previously been analyzed in terms of their node degree, structural motif, path length and clustering coefficient distributions. In this paper we focus on the identification and classification of hub regions, which are thought to play pivotal roles in the coordination of information flow. We identify hubs and characterize their network contributions by examining motif fingerprints and centrality indices for all regions within the cerebral cortices of both the cat and the macaque. Motif fingerprints capture the statistics of local connection patterns, while measures of centrality identify regions that lie on many of the shortest paths between parts of the network. Within both cat and macaque networks, we find that a combination of degree, motif participation, betweenness centrality and closeness centrality allows for reliable identification of hub regions, many of which have previously been functionally classified as polysensory or multimodal. We then classify hubs as either provincial (intra-cluster) hubs or connector (inter-cluster) hubs, and proceed to show that lesioning hubs of each type from the network produces opposite effects on the small-world index. Our study presents an approach to the identification and classification of putative hub regions in brain networks on the basis of multiple network attributes and charts potential links between the structural embedding of such regions and their functional roles
- ā¦