ICLR 2024 Past Safety & alignment

ICLR 2024 Workshop on Representational Alignment

ICLR 2024 Workshop Re-Align

Submission deadline
Feb 9, 2024, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (56)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A case for sparse positive alignment of neural systems

    Jacob S. Prince, Colin Conwell, George A. Alvarez, Talia Konkle · PDF
  2. An Analysis of Human Alignment of Latent Diffusion Models

    Lorenz Linhardt, Marco Morik, Sidney Bender, Naima Elosegui Borras · PDF
  3. Beyond Sight: Probing Alignment Between Image Models and Blind V1

    Galen Pogoncheff, Jacob Granley, Alfonso Rodil, Leili Soo, Lily Marie Turkstra, Lucas Gil Nadolskis, Arantxa Alfaro Saez, Cristina Soto Sanchez, Eduardo Fernandez Jover, Michael Beyeler · PDF
  4. Biased Causal Strength Judgments in Humans and Large Language Models

    Anita Keshmirian, Moritz Willig, Babak Hemmatian, Ulrike Hahn, Kristian Kersting, Tobias Gerstenberg · PDF
  5. Brain-inspired $L_p$-Convolution benefits large kernels and aligns better with visual cortex

    Jea Kwon, Kyungwoo Song, C. Justin Lee · PDF
  6. Can Foundation Models Smell Like Humans?

    Farzaneh Taleb, Miguel Vasco, Nona Rajabi, Mårten Björkman, Danica Kragic · PDF
  7. Can Generative Multimodal Models Count to Ten?

    Sunayana Rane, Alexander Ku, Jason Michael Baldridge, Ian Tenney, Thomas L. Griffiths, Been Kim · PDF
  8. Categories vs Semantic Features: What shape the similarities people discern in photographs of objects?

    Siddharth Suresh, Wei-Chun Huang, Kushin Mukherjee, Timothy T. Rogers · PDF
  9. Comparing Abstraction in Humans and Large Language Models Using Multimodal Serial Reproduction

    Sreejan Kumar, Raja Marjieh, Byron Zhang, Declan Iain Campbell, Michael Y. Hu, Umang Bhatt, Brenden Lake, Thomas L. Griffiths · PDF
  10. Comparing supervised learning dynamics: Deep neural networks match human data efficiency but show a generalisation lag

    Lukas S. Huber, Fred W. Mast, Felix A. Wichmann · PDF
  11. Context-Sensitive Semantic Reasoning in Large Language Models

    Tyler Giallanza, Declan Iain Campbell · PDF
  12. Correcting Biased Centered Kernel Alignment Measures in Biological and Artificial Neural Networks

    Alex Graeme Murphy, Joel Zylberberg, Alona Fyshe · PDF
  13. Differentiable Optimization of Similarity Scores Between Models and Brains

    Nathan Cloos, Markus Siegel, Scott L. Brincat, Earl K. Miller, Christopher J Cueva · PDF
  14. Disentangling Recurrent Neural Dynamics with Stochastic Representational Geometry

    David Lipshutz, Amin Nejatbakhsh, Alex H Williams · PDF
  15. Does resistance to style-transfer equal Global Shape Bias? Measuring network sensitivity to global shape configuration

    Ziqi Wen, Tianqin Li, Zhi Jing, Tai Sing Lee · PDF
  16. Enriching ConvNets with pre-cortical processing enhances alignment with human brain responses

    Niklas Müller, H.Steven Scholte, Iris Groen · PDF
  17. Explaining Human Comparisons using Alignment-Importance Heatmaps

    Nhut Truong, Dario Pesenti, Uri Hasson · PDF
  18. Harmony in Diversity: Merging Neural Networks with Canonical Correlation Analysis

    Stefan Horoi, Albert Manuel Orozco Camacho, Eugene Belilovsky, Guy Wolf · PDF
  19. How aligned are different alignment metrics?

    Jannis Ahlert, Thomas Klein, Felix A. Wichmann, Robert Geirhos · PDF
  20. Human and Deep Neural Network Alignment in Navigational Affordance Perception

    Clemens Georg Bartnik, Iris Groen · PDF
  21. Human-like Geometric Abstraction in Large Pre-Trained Neural Networks

    Declan Iain Campbell, Sreejan Kumar, Tyler Giallanza, Jonathan D. Cohen, Thomas L. Griffiths · PDF
  22. Humans diverge from language models when predicting spoken language

    Thomas L. Botch, Emily S Finn · PDF
  23. Identifying and Interpreting Non-Aligned Human Conceptual Representations using Language Modeling

    Wanqian Bao, Uri Hasson · PDF
  24. Improving Intervention Efficacy via Concept Realignment in Concept Bottleneck Models

    Nishad Singhi, Jae Myung Kim, Karsten Roth, Zeynep Akata · PDF
  25. Inferring DNN-Brain Alignment using Representational Similarity Analyses can be Problematic

    Marin Dujmovic, Jeffrey Bowers, Federico Adolfi, Gaurav Malhotra · PDF
  26. Inter-animal transforms as a guide to model-brain comparison

    Imran Thobani, Javier Sagastuy-Brena, Aran Nayebi, Rosa Cao, Daniel LK Yamins · PDF
  27. Is my "red" your "red"?: Unsupervised alignment of qualia structures via optimal transport

    Genji Kawakita, Ariel Mikhael Zeleznikow-Johnston, Ken Takeda, Naotsugu Tsuchiya, Masafumi Oizumi · PDF
  28. Koopman Operator Based Dynamical Similarity Analysis for Data-driven Quantification of Distance between Dynamics

    Shunsuke Kamiya, Jun Kitazono, Masafumi Oizumi · PDF
  29. Learning and Aligning Structured Random Feature Networks

    Vivian White, Muawiz Sajjad Chaudhary, Guy Wolf, Guillaume Lajoie, Kameron Decker Harris · PDF
  30. Learning to Abstract Visuomotor Mappings using Meta-Reinforcement Learning

    Carlos A. Velazquez-Vargas, Isaac Ray Christian, Jordan Taylor, Sreejan Kumar · PDF
  31. Less is More: Discovering Concise Network Explanations

    Neehar Kondapaneni, Markus Marks, Oisin Mac Aodha, Pietro Perona · PDF
  32. Lessons learned in the study of representational alignment in physical reasoning

    Felix Jedidja Binder, Rahul Mysore Venkatesh, Daniel LK Yamins, Judith E Fan · PDF
  33. Measuring Human-CLIP Alignment at Different Abstraction Levels

    Pablo Hernández-Cámara, Jorge Vila-Tomás, Jesus Malo, Valero Laparra · PDF
  34. Measuring Mechanistic Interpretability at Scale Without Humans

    Roland S. Zimmermann, David A. Klindt, Wieland Brendel · PDF
  35. MindEye2: Shared-Subject Models Enable fMRI-To-Image With 1 Hour of Data

    Paul Steven Scotti, Mihir Tripathy, Cesar Torrico, Reese Kneeland, Tong Chen, Ashutosh Narang, Charan Santhirasegaran, Jonathan Xu, Thomas Naselaris, Kenneth A. Norman, Tanishq Mathew Abraham · PDF
  36. Modality-Agnostic fMRI Decoding of Vision and Language

    Mitja Nikolaus, Milad Mozafari, Nicholas Asher, Leila Reddy, Rufin VanRullen · PDF
  37. On convex decision regions in deep network representations

    Lenka Tětková, Thea Brüsch, Teresa Scheidt, Fabian Mager, Rasmus Aagaard, Jonathan Foldager, Tommy Sonne Alstrøm, Lars Kai Hansen · PDF
  38. On the universality of neural encodings in CNNs

    Florentin Guth, Brice Ménard · PDF
  39. ReAlnet: Achieving More Human Brain-Like Vision via Human Neural Representational Alignment

    Zitong Lu, Yile Wang, Julie Golomb · PDF
  40. Removing High Frequency Information Improves DNN Behavioral Alignment

    Max Wolff, Evgenia Rusak, Wieland Brendel · PDF
  41. Saliency Suppressed, Semantics Surfaced: Visual Transformations in Neural Networks and the Brain

    Gustaw Opielka, Jessica Loke, H.Steven Scholte · PDF
  42. Self-supervised learning facilitates neural representation structures that can be unsupervisedly aligned to human behaviors

    Soh Takahashi, Masaru Sasaki, Ken Takeda, Masafumi Oizumi · PDF
  43. Simplicity in Complexity

    Surabhi S Nath, Kevin Shen, Aenne Annelie Brielmann, Peter Dayan · PDF
  44. Symbolic Variables in Distributed Networks that Count

    Satchel Grant, Zhengxuan Wu, James Lloyd McClelland, Noah Goodman · PDF
  45. TEMPERATURE-SCALING SURPRISAL ESTIMATES IMPROVE FIT TO HUMAN READING TIMES – BUT DOES IT DO SO FOR THE “RIGHT REASONS”?

    Tong Liu, Iza Škrjanec, Vera Demberg · PDF
  46. Texture bias in primate ventral visual cortex

    Akshay Vivek Jagadeesh, Margaret Livingstone · PDF
  47. The benefits of Incorporating Shape Priors in Contrastive Learning

    Junru Zhao, Tianqin Li, Tai Sing Lee · PDF
  48. The Curious Case of Representational Alignment: Unravelling Visio-Linguistic Tasks in Emergent Communication

    Tom Kouwenhoven, Max Peeperkorn, Bram Van Dijk, Stephan Raaijmakers, Tessa Verhoef · PDF
  49. The impact of task structure, representational geometry, and learning mechanism on compositional generalization

    Samuel Lippl, Kim Stachenfeld · PDF
  50. The role of shared labels and experiences in representational alignment

    Kushin Mukherjee, Siddharth Suresh, Xizheng Yu, Gary Lupyan · PDF
  51. Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting

    Taha Osama A Binhuraib, Greta Tuckute, Nicholas Blauch · PDF
  52. Towards neural foundation models for vision: Aligning EEG, MEG and fMRI representations to perform decoding, encoding and modality conversion

    Matteo Ferrante, Tommaso Boccato, Nicola Toschi · PDF
  53. Unsupervised alignment reveals structural commonalities and differences in neural representations of natural scenes across individuals and brain areas

    Ken Takeda, Kota Abe, Jun Kitazono, Masafumi Oizumi · PDF
  54. Unveiling the Dynamics of Transfer Learning Representations

    Thomas Goerttler, Klaus Obermayer · PDF
  55. What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes

    Victor Lecomte, Kushal Thaman, Rylan Schaeffer, Naomi Bashkansky, Trevor Chow, Sanmi Koyejo · PDF
  56. Wild Comparisons: A Study of how Representation Similarity Changes when Input Data is Drawn from a Shifted Distribution

    Davis Brown, Madelyn Ruth Shapiro, Alyson Bittner, Jackson Warley, Henry Kvinge · PDF