Search Results for author: Samuel J. Gershman

Found 19 papers, 4 papers with code

Predictive representations: building blocks of intelligence

no code implementations • 9 Feb 2024 • Wilka Carvalho, Momchil S. Tomov, William de Cothi, Caswell Barry, Samuel J. Gershman

Adaptive behavior often requires predicting future events.

reinforcement-learning

Paper
Add Code

Reconciling Shared versus Context-Specific Information in a Neural Network Model of Latent Causes

1 code implementation • 13 Dec 2023 • Qihong Lu, Tan T. Nguyen, Qiong Zhang, Uri Hasson, Thomas L. Griffiths, Jeffrey M. Zacks, Samuel J. Gershman, Kenneth A. Norman

Through learning, it naturally stores structure that is shared across tasks in the network weights.

Paper
Code

How should the advent of large language models affect the practice of science?

no code implementations • 5 Dec 2023 • Marcel Binz, Stephan Alaniz, Adina Roskies, Balazs Aczel, Carl T. Bergstrom, Colin Allen, Daniel Schad, Dirk Wulff, Jevin D. West, Qiong Zhang, Richard M. Shiffrin, Samuel J. Gershman, Ven Popov, Emily M. Bender, Marco Marelli, Matthew M. Botvinick, Zeynep Akata, Eric Schulz

For this opinion piece, we have invited four diverse groups of scientists to reflect on this query, sharing their perspectives and engaging in debate.

Paper
Add Code

Grokking as the Transition from Lazy to Rich Training Dynamics

no code implementations • 9 Oct 2023 • Tanishq Kumar, Blake Bordelon, Samuel J. Gershman, Cengiz Pehlevan

We identify sufficient statistics for the test loss of such a network, and tracking these over training reveals that grokking arises in this setting when the network first attempts to fit a kernel regression solution with its initial features, followed by late-time feature learning where a generalizing solution is identified after train loss is already low.

Paper
Add Code

Successor-Predecessor Intrinsic Exploration

no code implementations • NeurIPS 2023 • Changmin Yu, Neil Burgess, Maneesh Sahani, Samuel J. Gershman

Here we focus on exploration with intrinsic rewards, where the agent transiently augments the external rewards with self-generated intrinsic rewards.

Atari Games Efficient Exploration +1

Paper
Add Code

The molecular memory code and synaptic plasticity: a synthesis

no code implementations • 11 Sep 2022 • Samuel J. Gershman

As an alternative, it has been proposed that molecules within the cell body are the storage sites of memory, and that memories are formed through biochemical operations on these molecules.

Paper
Add Code

Representation learning with reward prediction errors

no code implementations • 27 Aug 2021 • William H. Alexander, Samuel J. Gershman

The Reward Prediction Error hypothesis proposes that phasic activity in the midbrain dopaminergic system reflects prediction errors needed for learning in reinforcement learning.

Hippocampus Representation Learning

Paper
Add Code

Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

no code implementations • 27 Jul 2021 • Pedro A. Tsividis, Joao Loula, Jake Burga, Nathan Foss, Andres Campero, Thomas Pouncy, Samuel J. Gershman, Joshua B. Tenenbaum

Here we propose a new approach to this challenge based on a particularly strong form of model-based RL which we call Theory-Based Reinforcement Learning, because it uses human-like intuitive theories -- rich, abstract, causal models of physical objects, intentional agents, and their interactions -- to explore and model an environment, and plan effectively to achieve task goals.

Bayesian Inference Board Games +2

Paper
Add Code

Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface

no code implementations • ICLR 2022 • Tuan Anh Le, Katherine M. Collins, Luke Hewitt, Kevin Ellis, N. Siddharth, Samuel J. Gershman, Joshua B. Tenenbaum

We build on a recent approach, Memoised Wake-Sleep (MWS), which alleviates part of the problem by memoising discrete variables, and extend it to allow for a principled and effective way to handle continuous variables by learning a separate recognition model used for importance-sampling based approximate inference and marginalization.

Scene Understanding Time Series +1

Paper
Add Code

Language-Mediated, Object-Centric Representation Learning

no code implementations • Findings (ACL) 2021 • Ruocheng Wang, Jiayuan Mao, Samuel J. Gershman, Jiajun Wu

These object-centric concepts derived from language facilitate the learning of object-centric representations.

Object Object Discovery +5

Paper
Add Code

Analyzing machine-learned representations: A natural language case study

1 code implementation • 12 Sep 2019 • Ishita Dasgupta, Demi Guo, Samuel J. Gershman, Noah D. Goodman

Analyzing performance on these diagnostic tests indicates a lack of systematicity in the representations and decision rules, and reveals a set of heuristic strategies.

Paper
Code

What does the free energy principle tell us about the brain?

no code implementations • 23 Jan 2019 • Samuel J. Gershman

The free energy principle has been proposed as a unifying account of brain function.

Active Learning Bayesian Inference

Paper
Add Code

Human-in-the-Loop Interpretability Prior

no code implementations • NeurIPS 2018 • Isaac Lage, Andrew Slavin Ross, Been Kim, Samuel J. Gershman, Finale Doshi-Velez

We often desire our models to be interpretable as well as accurate.

Paper
Add Code

Estimating scale-invariant future in continuous time

no code implementations • 18 Feb 2018 • Zoran Tiganj, Samuel J. Gershman, Per B. Sederberg, Marc W. Howard

Widely used reinforcement learning algorithms discretize continuous time and estimate either transition functions from one step to the next (model-based algorithms) or a scalar value of exponentially-discounted future reward using the Bellman equation (model-free algorithms).

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

Evaluating Compositionality in Sentence Embeddings

1 code implementation • 12 Feb 2018 • Ishita Dasgupta, Demi Guo, Andreas Stuhlmüller, Samuel J. Gershman, Noah D. Goodman

Further, we find that augmenting training with our dataset improves test performance on our dataset without loss of performance on the original training dataset.

Natural Language Inference Sentence +2