ICLR 2025PastSafety & alignment

Second Workshop on Representational Alignment at ICLR 2025

ICLR 2025 Re-Align Workshop

Official website ↗OpenReview venue ↗See all ICLR workshops →✎ Edit this entry

Submission deadline: Feb 6, 2025, 12:30 UTC
imported from OpenReview — check the website for extensions
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (38)

Fetched from OpenReview (v2) on 2026-06-10.

Aligning LLMs with Domain Invariant Reward Models
David Wu, Sanjiban Choudhury · PDF
AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding
Ahmed Masry, Juan A. Rodriguez, Tianyu Zhang, Suyuchen Wang, Chao Wang, Aarash Feizi, Akshay Kalkunte Suresh, Abhay Puri, Xiangru Jian, Pierre-Andre Noel, Sathwik Tejaswi Madhusudhan, Marco Pedersoli, Bang Liu, Nicolas Chapados, Yoshua Bengio, Enamul Hoque, Christopher Pal, Issam H. Laradji, David Vazquez, Perouz Taslakian, Spandana Gella, Sai Rajeswar · PDF
Augmenting X-ray Astronomical Representations with Scientific Knowledge through Contrastive Learning
Juan Rafael Martínez-Galarza, Nicolò Oreste Pinciroli Vago, Shivam Raval, Carolina Cuesta-Lazaro, Melanie Weber, David Alvarez-Melis, Alberto Accomazzi, Cecilia Garraffo, Joshua Knutson, Ryan Thill, Christopher B. Green, Imantha Ahangama · PDF
Beyond Adversarial Robustness: Breaking the Robustness-Alignment Trade-off in Object Recognition
Pinyuan Feng, Drew Linsley, Thibaut Boissin, Alekh Karkada Ashok, Thomas Fel, Stephanie Olaiya, Thomas Serre · PDF
Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers
Johanna Vielhaben, Dilyara Bareeva, Jim Berend, Wojciech Samek, Nils Strodthoff · PDF
Brain-like slot representation for sequence working memory in recurrent neural networks
Mingye Wang, Stefano Fusi, Kim Stachenfeld · PDF
Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments
Lorenz Linhardt, Tom Neuhäuser, Lenka Tětková, Oliver Eberle · PDF
Closing The Modality Gap Enables Novel Multimodal Learning Applications
Eleonora Grassucci, Giordano Cicchetti, Danilo Comminiello · PDF
Cognitive Neural Architecture Search Reveals Hierarchical Entailment
Lukas Kuhn, sari sadiya, Gemma Roig · PDF
Complexity in Complexity: Understanding Visual Complexity Through Structure, Color, and Surprise
Karahan Sarıtaş, Peter Dayan, Kevin Shen, Surabhi S Nath · PDF
Computer Graphics from a Neuroscientist's perspective
Shreya Kapoor, Bernhard Egger · PDF
Conjuring Semantic Similarity
Tian Yu Liu, Stefano Soatto · PDF
Contrastive Representations for Combinatorial Reasoning
Alicja Ziarko, Michał Bortkiewicz, Michał Zawalski, Benjamin Eysenbach, Piotr Miłoś · PDF
Cross-Modal Alignment Regularization: Enhancing Language Models with Vision Model Representations
Yulu Gan, Kaiya Ivy Zhao, Phillip Isola · PDF
Do Large Language Models Perceive Orderly Number Concepts as Humans?
Xuanjie Liu, Cong Zeng, Shengkun Tang, Ziyu Wang, zhiqiang xu, Gus Xia · PDF
Dual-Pathway Neural Networks: Harnessing Scene and Object Pathways for Enhanced Visual Understanding
Fahad Sarfraz, Bahram Zonooz, Elahe Arani · PDF
Exploring Geometric Representational Alignment through Ollivier Ricci curvature and Ricci Flow
Nahid Torbati, Michael Gaebler, Simon M. Hofmann, Nico Scherf · PDF
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
Simon Park, Abhishek Panigrahi, Yun Cheng, Dingli Yu, Anirudh Goyal, Sanjeev Arora · PDF
Investigating the Role of Representation Switching Costs in Goal Persistence Bias
Gaia Molinaro, Aly Lidayan, Anne Collins · PDF
Kernel Alignment using Manifold Approximation
Mohammad Tariqul Islam, Du Liu, Deblina Sarkar · PDF
Linking Neural Representations To Adaptive Behavior With Cognitive Modeling
Christina Maher, Salman Qasim, Lizbeth Nunez Martinez, Angela Radulescu, Ignacio Saez · PDF
Model Alignment Search
Satchel Grant · PDF
Model alignment using inter-modal bridges
Ali Gholamzadeh, Noor Sajid · PDF
Model Connectomes: A Generational Approach to Data-Efficient Language Models
Klemen Kotar, Greta Tuckute · PDF
Modularity is the Bedrock of Natural and Artificial Intelligence
Alessandro Salatiello · PDF
Partial Alignment of Representations via Interventional Consistency
Felix Leeb, Satoshi Hayakawa, Yuhta Takida, Yuki Mitsufuji · PDF
Place Field Representation Learning During Policy Learning
M Ganesh Kumar, Blake Bordelon, Jacob A Zavatone-Veth, Cengiz Pehlevan · PDF
Representation-alignment in Theory-of-Mind tasks across Language Models and Agents
Rohini Elora Das, Krissh Bhargava, Krishna Shinde, Rajarshi Das · PDF
REPRESENTATIONAL ALIGNMENT OF GLOMERULI ACTIVATION IN MURINE OLFACTORY BULB
Vivek kumar Agarwal, Julia Manasson, Mario H. Garrido-Czacki, Ilia Sucholutsky · PDF
Revisiting the Relation Between Robustness and Universality
Max Klabunde, Laura Caspari, Florian Lemmerich · PDF
Shared Global and Local Geometry of Language Model Embeddings
Andrew Lee, Fernanda Viégas, Martin Wattenberg · PDF
The Effect of Representational Compression on Flexibility Across Learning in Humans and Artificial Neural Networks
Mia Whitefield, Christopher Summerfield · PDF
The in-context inductive biases of vision-language models differ across modalities
Kelsey R Allen, Ishita Dasgupta, Eliza Kosoy, Andrew Kyle Lampinen · PDF
The Spotlight Resonance Method: Resolving The Alignment of Embedded Activations
George Bird · PDF
Traveling Waves Integrate Spatial Information Into Spectral Representations
Mozes Jacobs, Roberto C. Budzinski, Lyle Muller, Demba E. Ba, T. Anderson Keller · PDF
Understanding task representations in neural networks via Bayesian ablation
Andrew Joohun Nam, Declan Iain Campbell, Thomas L. Griffiths, Jonathan D. Cohen, Sarah-Jane Leslie · PDF
Understanding the Emergence of Multimodal Representation Alignment
Megan Tjandrasuwita, Chanakya Ekbote, Liu Ziyin, Paul Pu Liang · PDF
Unsupervised Neuronal Matching with Spontaneous Neuronal Activity
Shunsuke Kamiya, Taiga Mitamura, Muneki Ikeda, Masafumi Oizumi · PDF

Accepted papers (38)

☆Aligning LLMs with Domain Invariant Reward Models

☆AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

☆Augmenting X-ray Astronomical Representations with Scientific Knowledge through Contrastive Learning

☆Beyond Adversarial Robustness: Breaking the Robustness-Alignment Trade-off in Object Recognition

☆Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

☆Brain-like slot representation for sequence working memory in recurrent neural networks

☆Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments

☆Closing The Modality Gap Enables Novel Multimodal Learning Applications

☆Cognitive Neural Architecture Search Reveals Hierarchical Entailment

☆Complexity in Complexity: Understanding Visual Complexity Through Structure, Color, and Surprise

☆Computer Graphics from a Neuroscientist's perspective

☆Conjuring Semantic Similarity

☆Contrastive Representations for Combinatorial Reasoning

☆Cross-Modal Alignment Regularization: Enhancing Language Models with Vision Model Representations

☆Do Large Language Models Perceive Orderly Number Concepts as Humans?

☆Dual-Pathway Neural Networks: Harnessing Scene and Object Pathways for Enhanced Visual Understanding

☆Exploring Geometric Representational Alignment through Ollivier Ricci curvature and Ricci Flow

☆Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

☆Investigating the Role of Representation Switching Costs in Goal Persistence Bias

☆Kernel Alignment using Manifold Approximation

☆Linking Neural Representations To Adaptive Behavior With Cognitive Modeling

☆Model Alignment Search

☆Model alignment using inter-modal bridges

☆Model Connectomes: A Generational Approach to Data-Efficient Language Models

☆Modularity is the Bedrock of Natural and Artificial Intelligence

☆Partial Alignment of Representations via Interventional Consistency

☆Place Field Representation Learning During Policy Learning

☆Representation-alignment in Theory-of-Mind tasks across Language Models and Agents

☆REPRESENTATIONAL ALIGNMENT OF GLOMERULI ACTIVATION IN MURINE OLFACTORY BULB

☆Revisiting the Relation Between Robustness and Universality

☆Shared Global and Local Geometry of Language Model Embeddings

☆The Effect of Representational Compression on Flexibility Across Learning in Humans and Artificial Neural Networks

☆The in-context inductive biases of vision-language models differ across modalities

☆The Spotlight Resonance Method: Resolving The Alignment of Embedded Activations

☆Traveling Waves Integrate Spatial Information Into Spectral Representations

☆Understanding task representations in neural networks via Bayesian ablation

☆Understanding the Emergence of Multimodal Representation Alignment

☆Unsupervised Neuronal Matching with Spontaneous Neuronal Activity

Aligning LLMs with Domain Invariant Reward Models

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Augmenting X-ray Astronomical Representations with Scientific Knowledge through Contrastive Learning

Beyond Adversarial Robustness: Breaking the Robustness-Alignment Trade-off in Object Recognition

Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers

Brain-like slot representation for sequence working memory in recurrent neural networks

Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments

Closing The Modality Gap Enables Novel Multimodal Learning Applications

Cognitive Neural Architecture Search Reveals Hierarchical Entailment

Complexity in Complexity: Understanding Visual Complexity Through Structure, Color, and Surprise

Computer Graphics from a Neuroscientist's perspective

Conjuring Semantic Similarity

Contrastive Representations for Combinatorial Reasoning

Cross-Modal Alignment Regularization: Enhancing Language Models with Vision Model Representations

Do Large Language Models Perceive Orderly Number Concepts as Humans?

Dual-Pathway Neural Networks: Harnessing Scene and Object Pathways for Enhanced Visual Understanding

Exploring Geometric Representational Alignment through Ollivier Ricci curvature and Ricci Flow

Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?

Investigating the Role of Representation Switching Costs in Goal Persistence Bias

Kernel Alignment using Manifold Approximation

Linking Neural Representations To Adaptive Behavior With Cognitive Modeling

Model Alignment Search

Model alignment using inter-modal bridges

Model Connectomes: A Generational Approach to Data-Efficient Language Models

Modularity is the Bedrock of Natural and Artificial Intelligence

Partial Alignment of Representations via Interventional Consistency

Place Field Representation Learning During Policy Learning

Representation-alignment in Theory-of-Mind tasks across Language Models and Agents

REPRESENTATIONAL ALIGNMENT OF GLOMERULI ACTIVATION IN MURINE OLFACTORY BULB

Revisiting the Relation Between Robustness and Universality

Shared Global and Local Geometry of Language Model Embeddings

The Effect of Representational Compression on Flexibility Across Learning in Humans and Artificial Neural Networks

The in-context inductive biases of vision-language models differ across modalities

The Spotlight Resonance Method: Resolving The Alignment of Embedded Activations

Traveling Waves Integrate Spatial Information Into Spectral Representations

Understanding task representations in neural networks via Bayesian ablation

Understanding the Emergence of Multimodal Representation Alignment

Unsupervised Neuronal Matching with Spontaneous Neuronal Activity