ICLR 2025 Past Safety & alignment

Second Workshop on Representational Alignment at ICLR 2025

ICLR 2025 Re-Align Workshop

Submission deadline
Feb 6, 2025, 12:30 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (38)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Aligning LLMs with Domain Invariant Reward Models

    David Wu, Sanjiban Choudhury · PDF
  2. AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

    Ahmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte Suresh, Abhay Puri, Xiangru Jian, Pierre-Andre Noel, Sathwik Tejaswi Madhusudhan, Marco Pedersoli, Bang Liu, Nicolas Chapados, Yoshua Bengio, Enamul Hoque, Christopher Pal, Issam H. Laradji, David Vazquez, Perouz Taslakian, Spandana Gella, Sai Rajeswar · PDF
  3. Augmenting X-ray Astronomical Representations with Scientific Knowledge through Contrastive Learning

    Juan Rafael Martínez-Galarza, Nicolò Oreste Pinciroli Vago, Shivam Raval, Carolina Cuesta-Lazaro, Melanie Weber, David Alvarez-Melis, Alberto Accomazzi, Cecilia Garraffo, Joshua Knutson, Ryan Thill, Christopher B. Green, Imantha Ahangama · PDF
  4. Beyond Adversarial Robustness: Breaking the Robustness-Alignment Trade-off in Object Recognition

    Pinyuan Feng, Drew Linsley, Thibaut Boissin, Alekh Karkada Ashok, Thomas Fel, Stephanie Olaiya, Thomas Serre · PDF
  5. Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

    Johanna Vielhaben, Dilyara Bareeva, Jim Berend, Wojciech Samek, Nils Strodthoff · PDF
  6. Brain-like slot representation for sequence working memory in recurrent neural networks

    Mingye Wang, Stefano Fusi, Kim Stachenfeld · PDF
  7. Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments

    Lorenz Linhardt, Tom Neuhäuser, Lenka Tětková, Oliver Eberle · PDF
  8. Closing The Modality Gap Enables Novel Multimodal Learning Applications

    Eleonora Grassucci, Giordano Cicchetti, Danilo Comminiello · PDF
  9. Cognitive Neural Architecture Search Reveals Hierarchical Entailment

    Lukas Kuhn, sari sadiya, Gemma Roig · PDF
  10. Complexity in Complexity: Understanding Visual Complexity Through Structure, Color, and Surprise

    Karahan Sarıtaş, Peter Dayan, Kevin Shen, Surabhi S Nath · PDF
  11. Computer Graphics from a Neuroscientist's perspective

    Shreya Kapoor, Bernhard Egger · PDF
  12. Conjuring Semantic Similarity

    Tian Yu Liu, Stefano Soatto · PDF
  13. Contrastive Representations for Combinatorial Reasoning

    Alicja Ziarko, Michał Bortkiewicz, Michał Zawalski, Benjamin Eysenbach, Piotr Miłoś · PDF
  14. Cross-Modal Alignment Regularization: Enhancing Language Models with Vision Model Representations

    Yulu Gan, Kaiya Ivy Zhao, Phillip Isola · PDF
  15. Do Large Language Models Perceive Orderly Number Concepts as Humans?

    Xuanjie Liu, Cong Zeng, Shengkun Tang, Ziyu Wang, zhiqiang xu, Gus Xia · PDF
  16. Dual-Pathway Neural Networks: Harnessing Scene and Object Pathways for Enhanced Visual Understanding

    Fahad Sarfraz, Bahram Zonooz, Elahe Arani · PDF
  17. Exploring Geometric Representational Alignment through Ollivier Ricci curvature and Ricci Flow

    Nahid Torbati, Michael Gaebler, Simon M. Hofmann, Nico Scherf · PDF
  18. Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

    Simon Park, Abhishek Panigrahi, Yun Cheng, Dingli Yu, Anirudh Goyal, Sanjeev Arora · PDF
  19. Investigating the Role of Representation Switching Costs in Goal Persistence Bias

    Gaia Molinaro, Aly Lidayan, Anne Collins · PDF
  20. Kernel Alignment using Manifold Approximation

    Mohammad Tariqul Islam, Du Liu, Deblina Sarkar · PDF
  21. Linking Neural Representations To Adaptive Behavior With Cognitive Modeling

    Christina Maher, Salman Qasim, Lizbeth Nunez Martinez, Angela Radulescu, Ignacio Saez · PDF
  22. Model Alignment Search

    Satchel Grant · PDF
  23. Model alignment using inter-modal bridges

    Ali Gholamzadeh, Noor Sajid · PDF
  24. Model Connectomes: A Generational Approach to Data-Efficient Language Models

    Klemen Kotar, Greta Tuckute · PDF
  25. Modularity is the Bedrock of Natural and Artificial Intelligence

    Alessandro Salatiello · PDF
  26. Partial Alignment of Representations via Interventional Consistency

    Felix Leeb, Satoshi Hayakawa, Yuhta Takida, Yuki Mitsufuji · PDF
  27. Place Field Representation Learning During Policy Learning

    M Ganesh Kumar, Blake Bordelon, Jacob A Zavatone-Veth, Cengiz Pehlevan · PDF
  28. Representation-alignment in Theory-of-Mind tasks across Language Models and Agents

    Rohini Elora Das, Krissh Bhargava, Krishna Shinde, Rajarshi Das · PDF
  29. REPRESENTATIONAL ALIGNMENT OF GLOMERULI ACTIVATION IN MURINE OLFACTORY BULB

    Vivek kumar Agarwal, Julia Manasson, Mario H. Garrido-Czacki, Ilia Sucholutsky · PDF
  30. Revisiting the Relation Between Robustness and Universality

    Max Klabunde, Laura Caspari, Florian Lemmerich · PDF
  31. Shared Global and Local Geometry of Language Model Embeddings

    Andrew Lee, Fernanda Viégas, Martin Wattenberg · PDF
  32. The Effect of Representational Compression on Flexibility Across Learning in Humans and Artificial Neural Networks

    Mia Whitefield, Christopher Summerfield · PDF
  33. The in-context inductive biases of vision-language models differ across modalities

    Kelsey R Allen, Ishita Dasgupta, Eliza Kosoy, Andrew Kyle Lampinen · PDF
  34. The Spotlight Resonance Method: Resolving The Alignment of Embedded Activations

    George Bird · PDF
  35. Traveling Waves Integrate Spatial Information Into Spectral Representations

    Mozes Jacobs, Roberto C. Budzinski, Lyle Muller, Demba E. Ba, T. Anderson Keller · PDF
  36. Understanding task representations in neural networks via Bayesian ablation

    Andrew Joohun Nam, Declan Iain Campbell, Thomas L. Griffiths, Jonathan D. Cohen, Sarah-Jane Leslie · PDF
  37. Understanding the Emergence of Multimodal Representation Alignment

    Megan Tjandrasuwita, Chanakya Ekbote, Liu Ziyin, Paul Pu Liang · PDF
  38. Unsupervised Neuronal Matching with Spontaneous Neuronal Activity

    Shunsuke Kamiya, Taiga Mitamura, Muneki Ikeda, Masafumi Oizumi · PDF