ICLR 2024PastSafety & alignment

ICLR 2024 Workshop on Representational Alignment

ICLR 2024 Workshop Re-Align

Official website ↗OpenReview venue ↗See all ICLR workshops →✎ Edit this entry

Submission deadline: Feb 9, 2024, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (56)

Fetched from OpenReview (v2) on 2026-06-10.

A case for sparse positive alignment of neural systems
Jacob S. Prince, Colin Conwell, George A. Alvarez, Talia Konkle · PDF
An Analysis of Human Alignment of Latent Diffusion Models
Lorenz Linhardt, Marco Morik, Sidney Bender, Naima Elosegui Borras · PDF
Beyond Sight: Probing Alignment Between Image Models and Blind V1
Galen Pogoncheff, Jacob Granley, Alfonso Rodil, Leili Soo, Lily Marie Turkstra, Lucas Gil Nadolskis, Arantxa Alfaro Saez, Cristina Soto Sanchez, Eduardo Fernandez Jover, Michael Beyeler · PDF
Biased Causal Strength Judgments in Humans and Large Language Models
Anita Keshmirian, Moritz Willig, Babak Hemmatian, Ulrike Hahn, Kristian Kersting, Tobias Gerstenberg · PDF
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex
Jea Kwon, Kyungwoo Song, C. Justin Lee · PDF
Can Foundation Models Smell Like Humans?
Farzaneh Taleb, Miguel Vasco, Nona Rajabi, Mårten Björkman, Danica Kragic · PDF
Can Generative Multimodal Models Count to Ten?
Sunayana Rane, Alexander Ku, Jason Michael Baldridge, Ian Tenney, Thomas L. Griffiths, Been Kim · PDF
Categories vs Semantic Features: What shape the similarities people discern in photographs of objects?
Siddharth Suresh, Wei-Chun Huang, Kushin Mukherjee, Timothy T. Rogers · PDF
Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
Sreejan Kumar, Raja Marjieh, Byron Zhang, Declan Iain Campbell, Michael Y. Hu, Umang Bhatt, Brenden Lake, Thomas L. Griffiths · PDF
Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation lag
Lukas S. Huber, Fred W. Mast, Felix A. Wichmann · PDF
Context-Sensitive Semantic Reasoning in Large Language Models
Tyler Giallanza, Declan Iain Campbell · PDF
Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks
Alex Graeme Murphy, Joel Zylberberg, Alona Fyshe · PDF
Differentiable Optimization of Similarity Scores Between Models and Brains
Nathan Cloos, Markus Siegel, Scott L. Brincat, Earl K. Miller, Christopher J Cueva · PDF
Disentangling Recurrent Neural Dynamics with Stochastic Representational Geometry
David Lipshutz, Amin Nejatbakhsh, Alex H Williams · PDF
Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration
Ziqi Wen, Tianqin Li, Zhi Jing, Tai Sing Lee · PDF
Enriching ConvNets with pre-cortical processing enhances alignment with human brain responses
Niklas Müller, H.Steven Scholte, Iris Groen · PDF
Explaining Human Comparisons using Alignment-Importance Heatmaps
Nhut Truong, Dario Pesenti, Uri Hasson · PDF
Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis
Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky, Guy Wolf · PDF
How aligned are different alignment metrics?
Jannis Ahlert, Thomas Klein, Felix A. Wichmann, Robert Geirhos · PDF
Human and Deep Neural Network Alignment in Navigational Affordance Perception
Clemens Georg Bartnik, Iris Groen · PDF
Human-like Geometric Abstraction in Large Pre-Trained Neural Networks
Declan Iain Campbell, Sreejan Kumar, Tyler Giallanza, Jonathan D. Cohen, Thomas L. Griffiths · PDF
Humans diverge from language models when predicting spoken language
Thomas L. Botch, Emily S Finn · PDF
Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling
Wanqian Bao, Uri Hasson · PDF
Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
Nishad Singhi, Jae Myung Kim, Karsten Roth, Zeynep Akata · PDF
Inferring DNN-Brain Alignment using Representational Similarity Analyses can be Problematic
Marin Dujmovic, Jeffrey Bowers, Federico Adolfi, Gaurav Malhotra · PDF
Inter-animal transforms as a guide to model-brain comparison
Imran Thobani, Javier Sagastuy-Brena, Aran Nayebi, Rosa Cao, Daniel LK Yamins · PDF
Is my "red" your "red"?: Unsupervised alignment of qualia structures via optimal transport
Genji Kawakita, Ariel Mikhael Zeleznikow-Johnston, Ken Takeda, Naotsugu Tsuchiya, Masafumi Oizumi · PDF
Koopman Operator Based Dynamical Similarity Analysis for Data-driven Quantification of Distance between Dynamics
Shunsuke Kamiya, Jun Kitazono, Masafumi Oizumi · PDF
Learning and Aligning Structured Random Feature Networks
Vivian White, Muawiz Sajjad Chaudhary, Guy Wolf, Guillaume Lajoie, Kameron Decker Harris · PDF
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning
Carlos A. Velazquez-Vargas, Isaac Ray Christian, Jordan Taylor, Sreejan Kumar · PDF
Less is More: Discovering Concise Network Explanations
Neehar Kondapaneni, Markus Marks, Oisin Mac Aodha, Pietro Perona · PDF
Lessons learned in the study of representational alignment in physical reasoning
Felix Jedidja Binder, Rahul Mysore Venkatesh, Daniel LK Yamins, Judith E Fan · PDF
Measuring Human-CLIP Alignment at Different Abstraction Levels
Pablo Hernández-Cámara, Jorge Vila-Tomás, Jesus Malo, Valero Laparra · PDF
Measuring Mechanistic Interpretability at Scale Without Humans
Roland S. Zimmermann, David A. Klindt, Wieland Brendel · PDF
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
Paul Steven Scotti, Mihir Tripathy, Cesar Torrico, Reese Kneeland, Tong Chen, Ashutosh Narang, Charan Santhirasegaran, Jonathan Xu, Thomas Naselaris, Kenneth A. Norman, Tanishq Mathew Abraham · PDF
Modality-Agnostic fMRI Decoding of Vision and Language
Mitja Nikolaus, Milad Mozafari, Nicholas Asher, Leila Reddy, Rufin VanRullen · PDF
On convex decision regions in deep network representations
Lenka Tětková, Thea Brüsch, Teresa Scheidt, Fabian Mager, Rasmus Aagaard, Jonathan Foldager, Tommy Sonne Alstrøm, Lars Kai Hansen · PDF
On the universality of neural encodings in CNNs
Florentin Guth, Brice Ménard · PDF
ReAlnet: Achieving More Human Brain-Like Vision via Human Neural Representational Alignment
Zitong Lu, Yile Wang, Julie Golomb · PDF
Removing High Frequency Information Improves DNN Behavioral Alignment
Max Wolff, Evgenia Rusak, Wieland Brendel · PDF
Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain
Gustaw Opielka, Jessica Loke, H.Steven Scholte · PDF
Self-supervised learning facilitates neural representation structures that can be unsupervisedly aligned to human behaviors
Soh Takahashi, Masaru Sasaki, Ken Takeda, Masafumi Oizumi · PDF
Simplicity in Complexity
Surabhi S Nath, Kevin Shen, Aenne Annelie Brielmann, Peter Dayan · PDF
Symbolic Variables in Distributed Networks that Count
Satchel Grant, Zhengxuan Wu, James Lloyd McClelland, Noah Goodman · PDF
TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES – BUT DOES IT DO SO FOR THE “RIGHT REASONS”?
Tong Liu, Iza Škrjanec, Vera Demberg · PDF
Texture bias in primate ventral visual cortex
Akshay Vivek Jagadeesh, Margaret Livingstone · PDF
The benefits of Incorporating Shape Priors in Contrastive Learning
Junru Zhao, Tianqin Li, Tai Sing Lee · PDF
The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication
Tom Kouwenhoven, Max Peeperkorn, Bram Van Dijk, Stephan Raaijmakers, Tessa Verhoef · PDF
The impact of task structure, representational geometry, and learning mechanism on compositional generalization
Samuel Lippl, Kim Stachenfeld · PDF
The role of shared labels and experiences in representational alignment
Kushin Mukherjee, Siddharth Suresh, Xizheng Yu, Gary Lupyan · PDF
Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting
Taha Osama A Binhuraib, Greta Tuckute, Nicholas Blauch · PDF
Towards neural foundation models for vision: Aligning EEG, MEG and fMRI representations to perform decoding, encoding and modality conversion
Matteo Ferrante, Tommaso Boccato, Nicola Toschi · PDF
Unsupervised alignment reveals structural commonalities and differences in neural representations of natural scenes across individuals and brain areas
Ken Takeda, Kota Abe, Jun Kitazono, Masafumi Oizumi · PDF
Unveiling the Dynamics of Transfer Learning Representations
Thomas Goerttler, Klaus Obermayer · PDF
What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes
Victor Lecomte, Kushal Thaman, Rylan Schaeffer, Naomi Bashkansky, Trevor Chow, Sanmi Koyejo · PDF
Wild Comparisons: A Study of how Representation Similarity Changes when Input Data is Drawn from a Shifted Distribution
Davis Brown, Madelyn Ruth Shapiro, Alyson Bittner, Jackson Warley, Henry Kvinge · PDF

Accepted papers (56)

☆A case for sparse positive alignment of neural systems

☆An Analysis of Human Alignment of Latent Diffusion Models

☆Beyond Sight: Probing Alignment Between Image Models and Blind V1

☆Biased Causal Strength Judgments in Humans and Large Language Models

☆Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex

☆Can Foundation Models Smell Like Humans?

☆Can Generative Multimodal Models Count to Ten?

☆Categories vs Semantic Features: What shape the similarities people discern in photographs of objects?

☆Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction

☆Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation lag

☆Context-Sensitive Semantic Reasoning in Large Language Models

☆Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks

☆Differentiable Optimization of Similarity Scores Between Models and Brains

☆Disentangling Recurrent Neural Dynamics with Stochastic Representational Geometry

☆Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration

☆Enriching ConvNets with pre-cortical processing enhances alignment with human brain responses

☆Explaining Human Comparisons using Alignment-Importance Heatmaps

☆Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

☆How aligned are different alignment metrics?

☆Human and Deep Neural Network Alignment in Navigational Affordance Perception

☆Human-like Geometric Abstraction in Large Pre-Trained Neural Networks

☆Humans diverge from language models when predicting spoken language

☆Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling

☆Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

☆Inferring DNN-Brain Alignment using Representational Similarity Analyses can be Problematic

☆Inter-animal transforms as a guide to model-brain comparison

☆Is my "red" your "red"?: Unsupervised alignment of qualia structures via optimal transport

☆Koopman Operator Based Dynamical Similarity Analysis for Data-driven Quantification of Distance between Dynamics

☆Learning and Aligning Structured Random Feature Networks

☆Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning

☆Less is More: Discovering Concise Network Explanations

☆Lessons learned in the study of representational alignment in physical reasoning

☆Measuring Human-CLIP Alignment at Different Abstraction Levels

☆Measuring Mechanistic Interpretability at Scale Without Humans

☆MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

☆Modality-Agnostic fMRI Decoding of Vision and Language

☆On convex decision regions in deep network representations

☆On the universality of neural encodings in CNNs

☆ReAlnet: Achieving More Human Brain-Like Vision via Human Neural Representational Alignment

☆Removing High Frequency Information Improves DNN Behavioral Alignment

☆Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain

☆Self-supervised learning facilitates neural representation structures that can be unsupervisedly aligned to human behaviors

☆Simplicity in Complexity

☆Symbolic Variables in Distributed Networks that Count

☆TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES – BUT DOES IT DO SO FOR THE “RIGHT REASONS”?

☆Texture bias in primate ventral visual cortex

☆The benefits of Incorporating Shape Priors in Contrastive Learning

☆The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication

☆The impact of task structure, representational geometry, and learning mechanism on compositional generalization

☆The role of shared labels and experiences in representational alignment

☆Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting

☆Towards neural foundation models for vision: Aligning EEG, MEG and fMRI representations to perform decoding, encoding and modality conversion

☆Unsupervised alignment reveals structural commonalities and differences in neural representations of natural scenes across individuals and brain areas

☆Unveiling the Dynamics of Transfer Learning Representations

☆What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes

☆Wild Comparisons: A Study of how Representation Similarity Changes when Input Data is Drawn from a Shifted Distribution

A case for sparse positive alignment of neural systems

An Analysis of Human Alignment of Latent Diffusion Models

Beyond Sight: Probing Alignment Between Image Models and Blind V1

Biased Causal Strength Judgments in Humans and Large Language Models

Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex

Can Foundation Models Smell Like Humans?

Can Generative Multimodal Models Count to Ten?

Categories vs Semantic Features: What shape the similarities people discern in photographs of objects?

Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction

Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation lag

Context-Sensitive Semantic Reasoning in Large Language Models

Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks

Differentiable Optimization of Similarity Scores Between Models and Brains

Disentangling Recurrent Neural Dynamics with Stochastic Representational Geometry

Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration

Enriching ConvNets with pre-cortical processing enhances alignment with human brain responses

Explaining Human Comparisons using Alignment-Importance Heatmaps

Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

How aligned are different alignment metrics?

Human and Deep Neural Network Alignment in Navigational Affordance Perception

Human-like Geometric Abstraction in Large Pre-Trained Neural Networks

Humans diverge from language models when predicting spoken language

Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling

Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

Inferring DNN-Brain Alignment using Representational Similarity Analyses can be Problematic

Inter-animal transforms as a guide to model-brain comparison

Is my "red" your "red"?: Unsupervised alignment of qualia structures via optimal transport

Koopman Operator Based Dynamical Similarity Analysis for Data-driven Quantification of Distance between Dynamics

Learning and Aligning Structured Random Feature Networks

Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning

Less is More: Discovering Concise Network Explanations

Lessons learned in the study of representational alignment in physical reasoning

Measuring Human-CLIP Alignment at Different Abstraction Levels

Measuring Mechanistic Interpretability at Scale Without Humans

MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

Modality-Agnostic fMRI Decoding of Vision and Language

On convex decision regions in deep network representations

On the universality of neural encodings in CNNs

ReAlnet: Achieving More Human Brain-Like Vision via Human Neural Representational Alignment

Removing High Frequency Information Improves DNN Behavioral Alignment

Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain

Self-supervised learning facilitates neural representation structures that can be unsupervisedly aligned to human behaviors

Simplicity in Complexity

Symbolic Variables in Distributed Networks that Count

TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES – BUT DOES IT DO SO FOR THE “RIGHT REASONS”?

Texture bias in primate ventral visual cortex

The benefits of Incorporating Shape Priors in Contrastive Learning

The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication

The impact of task structure, representational geometry, and learning mechanism on compositional generalization

The role of shared labels and experiences in representational alignment

Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting

Towards neural foundation models for vision: Aligning EEG, MEG and fMRI representations to perform decoding, encoding and modality conversion

Unsupervised alignment reveals structural commonalities and differences in neural representations of natural scenes across individuals and brain areas

Unveiling the Dynamics of Transfer Learning Representations

What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes

Wild Comparisons: A Study of how Representation Similarity Changes when Input Data is Drawn from a Shifted Distribution