ICLR 2024 Past Safety & alignment
ICLR 2024 Workshop on Representational Alignment
ICLR 2024 Workshop Re-Align
- Submission deadline
- Feb 9, 2024, 11:59 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (56)
Fetched from OpenReview (v2) on 2026-06-10.
-
A case for sparse positive alignment of neural systems
-
An Analysis of Human Alignment of Latent Diffusion Models
-
Beyond Sight: Probing Alignment Between Image Models and Blind V1
-
Biased Causal Strength Judgments in Humans and Large Language Models
-
Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex
-
Can Foundation Models Smell Like Humans?
-
Can Generative Multimodal Models Count to Ten?
-
Categories vs Semantic Features: What shape the similarities people discern in photographs of objects?
-
Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction
-
Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation lag
-
Context-Sensitive Semantic Reasoning in Large Language Models
-
Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks
-
Differentiable Optimization of Similarity Scores Between Models and Brains
-
Disentangling Recurrent Neural Dynamics with Stochastic Representational Geometry
-
Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration
-
Enriching ConvNets with pre-cortical processing enhances alignment with human brain responses
-
Explaining Human Comparisons using Alignment-Importance Heatmaps
-
Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis
-
How aligned are different alignment metrics?
-
Human and Deep Neural Network Alignment in Navigational Affordance Perception
-
Human-like Geometric Abstraction in Large Pre-Trained Neural Networks
-
Humans diverge from language models when predicting spoken language
-
Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling
-
Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models
-
Inferring DNN-Brain Alignment using Representational Similarity Analyses can be Problematic
-
Inter-animal transforms as a guide to model-brain comparison
-
Is my "red" your "red"?: Unsupervised alignment of qualia structures via optimal transport
-
Koopman Operator Based Dynamical Similarity Analysis for Data-driven Quantification of Distance between Dynamics
-
Learning and Aligning Structured Random Feature Networks
-
Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning
-
Less is More: Discovering Concise Network Explanations
-
Lessons learned in the study of representational alignment in physical reasoning
-
Measuring Human-CLIP Alignment at Different Abstraction Levels
-
Measuring Mechanistic Interpretability at Scale Without Humans
-
MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data
-
Modality-Agnostic fMRI Decoding of Vision and Language
-
On convex decision regions in deep network representations
-
On the universality of neural encodings in CNNs
-
ReAlnet: Achieving More Human Brain-Like Vision via Human Neural Representational Alignment
-
Removing High Frequency Information Improves DNN Behavioral Alignment
-
Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain
-
Self-supervised learning facilitates neural representation structures that can be unsupervisedly aligned to human behaviors
-
Simplicity in Complexity
-
Symbolic Variables in Distributed Networks that Count
-
TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES – BUT DOES IT DO SO FOR THE “RIGHT REASONS”?
-
Texture bias in primate ventral visual cortex
-
The benefits of Incorporating Shape Priors in Contrastive Learning
-
The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication
-
The impact of task structure, representational geometry, and learning mechanism on compositional generalization
-
The role of shared labels and experiences in representational alignment
-
Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting
-
Towards neural foundation models for vision: Aligning EEG, MEG and fMRI representations to perform decoding, encoding and modality conversion
-
Unsupervised alignment reveals structural commonalities and differences in neural representations of natural scenes across individuals and brain areas
-
Unveiling the Dynamics of Transfer Learning Representations
-
What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes
-
Wild Comparisons: A Study of how Representation Similarity Changes when Input Data is Drawn from a Shifted Distribution