ICML 2026 Past AgentsSafety & alignmentInterpretability

2nd Workshop on Compositional Learning: Safety, Interpretability, and Agents

CompLearn 2026

Submission deadline
May 7, 2026, 23:59 AoE (UTC−12)
from the workshop website
Notification
May 22, 2026
Submission portal
OpenReview
Notes
Deadline added from the workshop website. Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (135)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Compositional Calculus for Semantic Synergy in Language Model Embeddings

    Abel Jansma · PDF
  2. A mathematical theory of balancing relational generalization and memorization

    Luke Cheng, Samuel Lippl
  3. A Theory of Atomic Features and Four Testable Predictions

    Kenny Peng, Jon Kleinberg, Nikhil Garg
  4. Actionable Interpretability Must Be Defined in Terms of Symmetries: A Compositional Probabilistic Approach

    Pietro Barbiero, Mateo Espinosa Zarlenga, Francesco Giannini, Alberto Termine, Filippo Bonchi, Mateja Jamnik, Giuseppe Marra
  5. Adaptive Minds: Empowering Agents with LoRA-as-Tools

    Pavan C Shekar, Aswanth Krishnan · PDF
  6. Adaptive Recurrence as Algorithmic Time for Length Generalization in Addition

    Imran Ibrahimli, Stefan Wermter, Jae Hee Lee · PDF
  7. Additive Relational Bindings in Transformers: What Sparse Autoencoders Miss

    Sebastian Hönig, Kushal Jain, Su Ji Park, Bart Bussmann, Patrick Leask
  8. Ask, Don’t Judge: Binary Questions for Interpretable LLM Evaluation and Self-Improvement

    Sangwoo Cho, Kushal Chawla, Pengshan Cai, Zefang Liu, Chenyang Zhu, Shi-Xiong Zhang, Sambit Sahu
  9. Atomic Chess Reveals Compositional Reasoning Failures in LLMs

    Ryan Co, Karthik Reddy Konuganti
  10. Attractor Inversion: A Geometric Account of Adversarial Manipulation in Human Decision-Making

    Leo Lorence George, Anushri Iyer, Abhishek Bakshi, Pavan Kulkarni
  11. Beyond Safe Data: Pretraining-Stage Alignment with Regular Safety Reflection

    Jinhan Li, Kexian Tang, Yihan Xu, Zhuorui Ye, Kaifeng Lyu
  12. Biregular Sparse Initialization Shifts the Rate and Shape of Compositional Escape in Sequential Arithmetic Curricula

    Clément Castellon, Arindam Biswas
  13. CaMeLs Can Use Computers Too: System-level Security for Computer Use Agents

    Hanna Foerster, Tom Blanchard, Kristina Nikolić, Ilia Shumailov, Cheng Zhang, Robert D. Mullins, Nicolas Papernot, Florian Tramèr, Yiren Zhao
  14. Causal Cartographer: From Mapping to Reasoning Over Counterfactual Worlds

    Gael Gendron, Joze M. Rozanec, Michael J. Witbrock, Gillian Dobbie
  15. Causal-JEPA: Learning World Models through Object-Level Latent Masking

    Heejeong Nam, Quentin Le Lidec, Lucas Maes, Yann LeCun, Randall Balestriero
  16. CB-Orchestrator: Adaptive Workflow Optimization for LLM Agents via Contextual Bandits

    Jiahang Sun, Zhiwei Shang, Zhipiao Liu, Hongwei Yang, XIE GUOQING, Shuang Qiu, Zhongxiang Dai
  17. Chain-of-Thought Gradient Descent

    Hong-Yu Chen, Venkat Sripad Ganti, Jerry Yao-Chieh Hu, Hude Liu, Han Liu
  18. Circuit Modularity Predicts Compositional Generalization: Theory and Evidence from Transformers

    Kaustubh S. Bukkapatnam, Siddharth Karuturi
  19. Circuit Oracle: Automating Attribution Graph Analysis via Natural-Language Queries

    Hong Kiat Tan, Shariar Kabir, Swastik Agrawal, Sai V R Chereddy, Sriram Balasubramanian
  20. ClinSeekAgent: Automating Multi-modal Evidence Seeking for Agentic Clinical Reasoning

    Juncheng Wu, Letian Zhang, Yuhan Wang, Haoqin Tu, Hardy Chen, Zijun Wang, Cihang Xie, Yuyin Zhou · PDF
  21. CLIP Models Generalize Less Than Compositional Benchmarks Suggest

    Shuman Peng, Arnas Uselis, Darina Koishigarina, Martin Ester, Seong Joon Oh
  22. CMAG: Concept-Scaffolded Retrieval for Marketplace Avatar Generation

    Rajeev Goel, Jason Ding, Phani Harish Wajjala, Pavan K. Turaga, Tejaswi Gowda, Krishna C. Garikipati
  23. Code-enabled language models can outperform reasoning models on diverse tasks

    Cedegao E. Zhang, Cédric Colas, Gabriel Poesia, Joshua B. Tenenbaum, Jacob Andreas
  24. COGITAO: A Procedural and Object-Centric Framework to Evaluate Compositional and Systematic Generalization

    Yassine Taoudi-Benchekroun, Klim Troyan, Pascal Josef Sager, Stefan Gerber, Lukas Tuggener, Thilo Stadelmann, Benjamin F Grewe · PDF
  25. CompFlow: Composing Velocity Fields for Multi-Condition Generation

    Luca Miglior, Vincenzo Gervasi, Davide Bacciu · PDF
  26. Compositional Adversarial Training for Robust Visual Watermarking

    Anirudh Satheesh, Michael-Andrei Panaitescu-Liess, Andrew Ye Xu, Georgios Milis, Heng Huang, Zikui Cai, Furong Huang
  27. Compositional Agentic Formulation Search for Open-Vocabulary Audio-Visual Event Localization

    Beomgwon Jo, Sunchan Park, Kyeongbo Kong
  28. Compositional by Design: Background-Invariant Representations via Linear Additivity in VLMs

    Youssef Zaazou, Mark Thomas
  29. Compositional Consistency-Guided Decoding for Three-Way Logical Question Answering

    Tianyi Huang, Ming Ren Hou, Jiaheng Su, Yutong Zhang, Ziling Zhang
  30. Compositional Evolutionary Probing of LLM Safety Alignment

    Ashish Baghel
  31. Compositional Failure in Audio-Visual LLMs: Late-Layer Prior Dominance Under Cross-modal Conflict

    Adarsh Sudheer, David Li, Omar El-Banna, Ishaan Kodarapu, Arjun Bahuguna, Vasu Sharma · PDF
  32. Compositional Investigation: Why Reasoning Enables Tool-Using Agents to Fix What They Diagnose

    Dhatri C, Tadisetty Sai Yashwanth
  33. Compositional Neuro-Symbolic Reasoning

    Anugyan Das, Omkar Ghugarkar, Vishvesh G Bhat, Asad Aali
  34. Compositional Self-Improvement

    Changho Shin, Daiwei Chen, John Cooper, Brenden Lake, Frederic Sala, Ramya Korlakai Vinayak · PDF
  35. Compositional Skill Acquisition in Agentic Pipelines via Reinforcement Learning and Knowledge Distillation

    Akshaykumar, Tadisetty Sai Yashwanth
  36. Compositional Skill Chaining and Policy Blending for Hard Exploration in the BRIO Labyrinth Game

    Young-Min Kim, Bo-Yeong Kang
  37. Compositional Skill Execution in LLM Multi-Agent Systems: A Comparative Study of Collaboration Architectures for Long-Horizon Tasks

    Mihyang Kim · PDF
  38. Compositional Underdetermination in AI Agents: When Behavioral Success Is Not Compositional Evidence

    Aviral Srivastava, Sourav Panda
  39. Concepts in Motion: Temporal Concept Bottleneck Model for Interpretable Video Classification

    Patrick Knab, Sascha Marton, Philipp Johannes Schubert, Drago Andres Guggiana Nilo, Christian Bartelt · PDF
  40. Count Me If You Can: Geometric Failure Modes in Language Model Counting

    Nicholas Bai, Ayushi Mehrotra
  41. CUA-Skill: Developing Computer Using Agents with a Skill Framework

    Tianyi Chen, Yinheng Li, Michael Solodko, Sen Wang, Nan Jiang, Junheng Hao, Tingyuan Cui, Jongwoo Ko, Sara Abdali, Suzhen Zheng, Pashmina Cameron, Justin Wagle, Kazuhito Koishida · PDF
  42. Dimensionality Controls When Modularity Helps in Continual Learning

    Kathrin Korte, Christian Medeiros Adriano, Joachim Winther Pedersen, Eleni Nisioti, Sebastian Risi
  43. Direction-Conditioned Policies via Compositional Subgoal Scoring for Online Goal-Conditioned Reinforcement Learning

    Swaminathan S K, Damiya Gondha, Theyanesh Eswaramoorthy Rajahkrishnan, Aritra Hazra
  44. Dissociating Decodability and Causal Use in Bracket-Sequence Transformers

    Aryan Sharma, Cutter Dawes, Shivam Raval
  45. Do Thinking Tokens Help with Safety?

    Narutatsu Ri, Abhishek Panigrahi, Sanjeev Arora
  46. Don't Trust Stubborn Neighbors: A Security Framework for Agentic Networks

    Samira Abedini, Sina Mavali, Lea Schönherr, Martin Pawelczyk, Rebekka Burkholz
  47. DPMI: A Principled Index for Neural Polysemanticity via Dirichlet Process Mixture Modeling

    Manan Gupta, Dhruv Kumar
  48. Dual-Resolution Recursive Energy: Certified Contract–Expand Inference for Sequential Decision Making

    Haozhou Gao, ZENG JIARUI, Wendi Ren, Yanwen Liu, Shuang Li
  49. Emergent Compositional Skills in Mixture-of-Experts VLAs

    Shlok Shah, Rhiaan Jhaveri, Tharun Kumar Tiruppali Kalidoss, Chirayu Nimonkar, Ishaan Javali, Dhruv Shah
  50. Emergent Social Intelligence Risks in Generative Multi-Agent Systems

    Yue Huang, Yu Jiang, Wenjie Wang, Haomin Zhuang, Xiaonan Luo, Yuchen Ma, Zhangchen Xu, Zichen Chen, Nuno Moniz, Zinan Lin, Pin-Yu Chen, Nitesh V. Chawla, Nouha Dziri, Huan Sun, Xiangliang Zhang · PDF
  51. Entropy-Aware GUI Grounding: From Failure Analysis to Improved Localization

    Chengxin Liu, Moon Ye-Bin, Tae-Hyun Oh
  52. Evolution of Cooperation in LLM Societies : A Multi-Lingual Examination

    Kriti Mahajan
  53. Evolutionary System Prompt Learning for Reinforcement Learning in LLMs

    Lunjun Zhang, Ryan Chen, Bradly C. Stadie
  54. Explaining is Harder Than Predicting Alone: Evaluating Concept-based Explanations of MLLMs as ICL Visual Classifiers

    Carmen Quiles Ramírez, Leticia Lorena Rodriguez, Nicolas Martorell, Natalia Díaz-Rodríguez
  55. Fixed-Point Reasoning: Stable and Adaptive Deep Looped Models

    Sajad Movahedi, Shlomo Libo Feigin, Vera Milovanović, Alexander Theus, Thomas Hofmann, Valentina Boeva, T. Konstantin Rusch, Antonio Orvieto
  56. FormalImG: Evaluating Structural Compositional Generalization for T2I Models

    Hong-Jie You, Jie-Jing Shao, Xiao-Wen Yang, Zhi-Fan Wu, Lin-Han Jia, Lan-Zhe Guo, Yu-Feng Li · PDF
  57. From Composition to Compositionality: Discovering Reusable Structure in Polyphonic Music Embeddings

    Zhijin Guo, Richard Freedman, Martha Lewis
  58. From Mechanistic to Compositional Interpretability

    Ward Gauderis, Thomas Dooms, Steven T. Homer, Kola Ayonrinde, Geraint A. Wiggins
  59. From Numbers to Narratives: Goal-Oriented Summarization of Machine Learning Model Differences

    Nam Hyeon-Woo, Tae-Hyun Oh, Zeynep Akata, Stephan Alaniz
  60. From Self-Preservation to Peer-Preservation: A Staged Framing of Preservation-Oriented Misalignment in Frontier Models

    Rundong Yang · PDF
  61. Fusion is the New Mutation: Bandit-Guided Evolution on Workflow Graphs

    Zhiwei Shang, Jiahang Sun, Mingrong Gong, Mingze Kong, Zikun Qu, Pingchen Lu, Junhao Dong, Zhipiao Liu, Hongwei Yang, XIE GUOQING, Yao Shu, Zhongxiang Dai
  62. Gating Enables Curvature: A Geometric Expressivity Gap in Attention

    Satwik Bathula, Anand Joshi
  63. Grad Detect: Gradient-Based Hallucination Detection in LLMs

    Anand Kamat, Daniel Blake, Brent M. Werness · PDF
  64. Hidden in Plain Sight: Benchmarking Agent Safety Against Decomposition Attacks with DeCompBench

    Vikhyath Kothamasu, Virginia Smith, Chhavi Yadav
  65. HINT: Task Demonstrations for Hierarchical Inference in Abstract Reasoning

    Nirlipta Pande, Georg Niess, Julian Gutheil, Roman Kern, Robert Legenstein, Robert Peharz
  66. How does RL Post-training Induce Skill Composition? A Case Study on Countdown

    Simon Park, Simran Kaur, Sanjeev Arora
  67. How Many Features Can a Language Model Store Under the Linear Representation Hypothesis?

    Nikhil Garg, Jon Kleinberg, Kenny Peng
  68. IGG: A Benchmark for Interactive GUI Grounding under Visibility Constraints

    Kyeong Seon Kim, Jiyeon Son, Tae-Hyun Oh
  69. Improving the Compositionality of Triplet-Based Neural Algorithmic Reasoners

    Stjepan Požgaj, Dobrik Georgiev Georgiev, Marin Šilić, Goran Delac, Klemo Vladimir
  70. In-Context Learning Amplifies a Latent Compositional Circuit

    Melissa Wessel
  71. Installing and Obstructing Heuristics: Learning Dynamics in Nim

    Leo Villani, Sultan Daniels, Ijin Yu, Anant Sahai
  72. Introspective Coupling: LMs Explain Themselves Better Than Training Targets

    Zifan Carl Guo, Laura Ruis, Jacob Andreas, Belinda Z. Li
  73. Irreducible Supervision Enables Compositional Generalization in Post-Training

    Ellen Ma, Nikhil Anand
  74. Language Elicits Emergent Symbol Processing in Vision Foundation Models

    Jung-Chun Liu, Naihao Deng, Joyce Chai
  75. Large Language Models Can Follow Instructions, But Not Many at Once: Phase Transitions in Compositional Constraint Satisfaction

    Mariya I. Vasileva
  76. Learning Compositional Tasks via Trigger Compositions: Using Scratchpads as Pre-Answer Workspaces

    Heejin Choi
  77. Learning to Theorize the World from Observation

    Doojin Baek, Gyubin Lee, Junyeob Baek, Hosung Lee, Sungjin Ahn
  78. Learning What’s Missing: Failure-Driven Skill Discovery via Predicate Bridges

    Yanwen Liu, Wendi Ren, Haozhou Gao, Shuang Li
  79. LGPro: Language-Guided Prototype Discovery for Compositional Zero-Shot Learning

    Anna-Alina Bondarets, Taras Rumezhak, Volodymyr Karpiv · PDF
  80. Logit Grafting: The Post-Training Delta is Sparse, Portable, and Powerful

    Apurv Verma, Binh-Nguyen Nguyen, Hai Phan, Lingxiao Wang
  81. MAVEN: Improving Generalization in Agentic Tool Calling

    Omkar Ghugarkar, Vishvesh G Bhat, Muhammad Ahmed Mohsin, Asad Aali
  82. Meaning Representations as Variational Quantum Circuits

    Tilen Gaetano Limbäck-Stokin, Tanishka Birdavade, Kin Ian Lo, Mehrnoosh Sadrzadeh
  83. Measuring the Limits of Continual Learning for LLMs

    Nimit Kalra, Narutatsu Ri, Zerzar Bukhari, Ang Li, Sanae Lotfi, Liam H Fowl, Micah Goldblum
  84. Mitigating Over-Personalization in Language Models via Structured Memory

    Hakeem Hannoon, Andrew Zhao, Mihir Narayan, Sharvin Goyal, Ivaxi Sheth
  85. MKEvolve: A Modular Multi-Agent Framework for Kernel Code Generation

    Jason Yoo, Rajarshi Saha, Tao Yu, Shaowei Zhu, Wei Tang, Youngsuk Park
  86. MoTVLA: A Vision-Language-Action Model with Unified Fast-Slow Reasoning

    Wenhui Huang, Changhe Chen, Han Qi, Chen Lv, Yilun Du, Heng Yang
  87. Multi-Agent Systems are Mixtures of Experts: Who Becomes an Influencer?

    Franka Bause, Jonas Niederle, Martin Pawelczyk, Rebekka Burkholz
  88. MultiVulnBench: A Large-Scale Benchmark for Count Bias in LLM-Based Multi-Vulnerability Detection

    Manan Gupta, Chinmay Pushkar, Sanchit Kabra, Dhruv Kumar, Jagat Sesh Challa
  89. Noise-Tolerant Verification of Compositional Boolean Recovery

    Pranay Jha
  90. Not Just RLHF: Why Alignment Alone Won't Fix Multi-Agent Sycophancy

    Adarsh Kumarappan, Ananya Mujoo
  91. Nouns, Not Modifiers: OpenVLA Parses Objects but Fails at Spatial Composition

    Jin Yoo
  92. On the Role of Learned Alignment Matrices in LatentMAS

    Spursh Deshpande, Wenhao Lu
  93. Operads for compositional reasoning in LLMs

    Nathaniel Bottman, Kyle Richardson
  94. Playing Devil’s Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

    Ishaan Kelkar, Nebras Alam, Vikram Kakaria, Madhur Panwar, Vasu Sharma, Maheep Chaudhary
  95. Policy Transfer for Hierarchical Goal-Conditioned Reinforcement Learning

    Usman Islam, Zhixun Chen, Stefanos Leonardos, Matteo Leonetti, Yali Du
  96. Preference Instability in Reward Models: Detection and Mitigation via Sparse Autoencoders

    Shunchang Liu, Xin Chen, Belen Martin Urcelay, Francesco Croce
  97. Reasoning as State Transition: A Representational Analysis of Reasoning Evolution in Large Language Models

    Siyuan Zhang, Jialian Li, Yichi Zhang, Xiao Yang, Yinpeng Dong, Hang Su
  98. Reasoning Phases Are Continuous, Not Discrete: Evidence from Switching Linear Dynamical Systems Applied to Chain-of-Thought Residual Streams

    Manan Gupta, Dhruv Kumar
  99. Reasoning with Neologisms: Can Soft Tokens Learn Composable Reasoning Skills Without Forgetting?

    Antonin Berthon, Mihaela van der Schaar
  100. Reflection Anchors for Interpretable Compositional Visual Reasoning in Multimodal Reinforcement Learning

    Xuan Gong, Hanbo Huang, Hao Zheng, Yiran Zhang, Wenbin Dai, Weishu Zhao, Shiyu Liang · PDF
  101. Retrieval is Enough: Training-Free Interpretability with a Tool-Using Agent

    Sriram Balasubramanian, Soheil Feizi
  102. RL Post-Training Builds Compositional Reasoning Strategies

    Azwar Abdulsalam, Nishil Patel, Andrew M Saxe
  103. Safety Cost of Steering Vectors Is Separable and Reducible

    Yuxiao Li, Gjergji Kasneci
  104. Sample Complexity of Scientific Discovery: PAC Learnability of Compositional Function Trees

    Şuayp Talha Kocabay, Talha Rüzgar Akkuş, Kerem Yalçın
  105. Separable Representations of Task Complexity and Deliberation in Reasoning Language Models.

    Xuan-Quang Nguyen, Hieu M. Vu, Dung Viet Nguyen, Hai Tuan Luu, Linh Duy Tran, Tan Minh Nguyen
  106. Sparse Autoencoders Find Causal, Lineage-Specific Context Features in Chromatin Foundation Models

    Nicole Ching, Ayushi Mehrotra
  107. Sparse Memory Finetuning as a Low-Forgetting Alternative to LoRA and Full Finetuning

    Prakhar Gupta, Garv Shah, Satyam Goyal, Anirudh Kanchi
  108. Spatial Compositional Counterfactuals in Concept Bottleneck Models

    Ran Eisenberg, Ofir Lindenbaum · PDF
  109. Spatially Stable GUI Grounding via Zoom Consistency Loss

    Moon Ye-Bin, Jiyeon Son, Tae-Hyun Oh
  110. Stop Probing, Start Coding: Why Linear Probes and Sparse Autoencoders Fail at Compositional Generalisation

    Vitória Barin-Pacela, Shruti Joshi, Isabela Camacho, Simon Lacoste-Julien, David Klindt
  111. Struct-to-Reason: Enhancing Video Understanding of Vision-Language Models by Decoupling Perception and Reasoning via Structured Summary

    Hengyu Liu, Chenxin Li, Wenbo Hu, Zhiqin Yang, Yuxin Chen, Ying Shan, Brandon Y. Feng
  112. Structure over Pixels: Learning Variable-Length Visual Programs

    Piotr Wyrwinski, Kacper Dobek, Krzysztof Krawiec
  113. Successor Re-grounding Audits Compositional Rollout Mismatch in Neuro-Symbolic Search

    Miroslav Lžičař
  114. TAME the BALROG: Task-Adaptive Modular Evolution framework for Game Agents

    Ola Aleksandra Pasieka, Dominika Woszczyk, Antoine Cully, Borja G. León
  115. The Compositional Generalization Gap in Named Entity Recognition: Static Benchmarks Overestimate Transferable Performance

    Varun Kotte
  116. The Spurious Composition Problem: Conditional Independence as a Necessary and Sufficient Condition for Systematic Generalization

    Siddharth Karuturi, Kaustubh S. Bukkapatnam, Soham Batra, Laksh Patel, Tanush Ajay Shastry
  117. The Theory and Practice of MAP Inference over Non-Convex Constraints

    Leander Kurscheidt, Gabriele Masina, Roberto Sebastiani, Antonio Vergari
  118. THEIA: Learning Complete Kleene Three-Valued Logic in a Pure-Neural Modular Architecture

    Augustus Haoyang Li
  119. Toward Compositional Latent Action Interfaces for Generalizable Agents

    Heejeong Nam, Chandradithya S Jonnalagadda, Harshit Aggarwal, Eric Xu
  120. Tracking Training Phases in Compositional Learning with Task-Agnostic Measures

    Niclas Dern, Selma Mazioud, Jakob Heiss, Avrajit Ghosh, Curtis James McDonald, Gabriel Clara, Bin Yu
  121. Universality, Composition Generalization, and Algorithm Emulation All In-Context

    Jerry Yao-Chieh Hu, Hong-Yu Chen, Po-Chiao Lin, Maojiang Su, Han Liu
  122. Unsafe Only in Combination: Interaction-Barrier Shielding for Tool-Using LLM Agents

    Rishabh Bhattacharya
  123. Unsupervised Decomposition with Recombination-Consistent Diffusion Models

    Archer Wang, Emile Timothy Anand, Yilun Du, Marin Soljacic
  124. VASAE: Naming SAE Dictionary Directions with Vocabulary-Aligned Anchoring

    Kairui Zhang, Ziwen Yu, Zahraa S. Abdallah, Martha Lewis
  125. Visual Counterfactual Explanations with Compositional Generative Models

    Daniil Kirilenko, Dario Fenoglio, Martin Gjoreski, Marc Langheinrich
  126. What Do Latent Agents Actually Represent? Interpreting Hidden-State Communication in Multi-Agent Systems

    Wenhao Lu, Spursh Deshpande
  127. What makes the whole? Probing Attribute-Level Compositionality in LLM Judges

    Savita Bhat, Vasudeva Varma
  128. When Do Diffusion Models learn to Generate Multiple Objects?

    Yujin Jeong, Arnas Uselis, Iro Laina, Seong Joon Oh, Anna Rohrbach
  129. When Do Multi-Agent Systems Outperform? Analysing the Learning Efficiency of Agentic Systems

    Junwei Su, Chuan Wu
  130. When Does Composition Compose? A PAC-Theoretic Framework for Compositional Faithfulness, Safety Certificates, and Training Dynamics

    Siddharth Karuturi, Kaustubh S. Bukkapatnam, Tanush Ajay Shastry
  131. When Does Disentanglement Enable Compositional Generalization? A Transfer Bound and Its Empirical Validation

    Rishi Ashish Shah, Shivaay Dhondiyal, Sarthak Pandey
  132. When to Re-Plan: Subgoal Persistence in Hierarchical Latent Reasoning

    Ayushi Chadha
  133. Where’s the Plan? Locating Latent Planning in Language Models with Lightweight Mechanistic Interventions

    Nicole H. Ma, Nick Rui · PDF
  134. Which Way Did It Move? Diagnosing and Overcoming Directional Motion Blindness in Video LLMs

    Jongseo Lee, Hyuntak Lee, Sunghun Kim, Sooa Kim, Jihoon Chung, Jinwoo Choi
  135. Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw

    Zijun Wang, Haoqin Tu, Letian Zhang, Hardy Chen, Juncheng Wu, Xiangyan Liu, Zhenlong Yuan, Tianyu Pang, Michael Qizhe Shieh, Fengze Liu, Zeyu Zheng, Huaxiu Yao, Yuyin Zhou, Cihang Xie