ICLR 2026 Past AgentsGenerative models

Workshop on Multi-Agent Learning and Its Opportunities in the Era of Generative AI

MALGAI

Submission deadline
Feb 11, 2026, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (57)

Fetched from OpenReview (v2) on 2026-06-10.

  1. AccelOpt: A Self-Improving LLM Agentic System for AI Accelerator Kernel Optimization

    Genghan Zhang, Shaowei Zhu, Anjiang Wei, Zhenyu Song, Allen Nie, Zhen Jia, Nandita Vijaykumar, Yida Wang, Kunle Olukotun · PDF
  2. AffectMind: Proactive Knowledge Grounding with Affective Multimodal Signals for Aligned Marketing Dialogue

    Xinyu Wang, Xiaomin Zhao, Yifei Kang, Zhihao Lin, Xiang Luo, Zhang Chengbiao, Jin Cheng, Yixin Wang, Yangyang Zhang, Ernie Tian, Zhiguo Tao, Xiaofei Han, Xiaotong Ding · PDF
  3. Agent-as-a-Coach: Towards Fully Agentic, Stateful, and Tool-Augmented Process Rewards

    Ed Li, Junyu Ren, Cat Yan, Kerem Goksel · PDF
  4. AI Organizations Are More Effective but Less Aligned than Individual Agents

    Judy Hanwen Shen, Daniel Zhu, Siddarth Srinivasan, Henry Sleight, Lawrence T. Wagner III, Morgan Jane Matthews, Jascha Sohl-Dickstein, Erik Jones · PDF
  5. AI-BAAM: AI-Driven Bank Statement Analytics as Alternative Data for Malaysian MSME Credit Scoring

    Chun Chet Ng, Zhen Hao Chu, Jia Yu Lim, Boon Yin Yin, Low Wei Zeng, Jin Khye Tan · PDF
  6. ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering

    Zhuowen Yuan, Tao Liu, Yang Yang, Yang Wang, Feng Qi, Kaushik Rangadurai, Bo Li, Shuang Yang · PDF
  7. Assessing Sovereignty in Multi-Agent Collaborations

    Eleonore Vissol-Gaudin, janosch haber, Andikan Otung · PDF
  8. Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling

    Yang Cai, Weiqiang Zheng · PDF
  9. Beyond Reasoning: RL-Policy Guided LLM Inference for Efficient Strategy in Liar’s Poker

    Richard Dewey, János Botyánszki, Ciamac C. Moallemi, Andrew Zheng · PDF
  10. BEYOND SYNTAX: ACTION SEMANTICS LEARNING FOR APP AGENTS

    Bohan Tang, Dezhao Luo, Jianheng Liu, Jingxuan Chen, Shaogang Gong, Jianye HAO, Jun Wang, Kun Shao · PDF
  11. Beyond Text-Passing: Shared Cognitive Substrates for Multi-Agent LLM Coordination

    Ning Coeva · PDF
  12. Build, Judge, Optimize: A Blueprint for Continuous Improvement of Multi-Agent Consumer Assistants

    Steven Guanxing Xu, Alejandro Breen, Aayush Sheth, Sudeep Das, Zhucheng Zhan, Hongtai Wei, Charles Wright, Marcus Yearwood · PDF
  13. Can Small Agents Collaborate to Beat a Single Large Language Model?

    Agata Żywot, Xinyi Chen, Yifei Yuan, Anders Søgaard, Maarten de Rijke · PDF
  14. CATTLE TRADE: A MULTI-AGENT BENCHMARK FOR LLM BLUFFING, BIDDING, AND NEGOTIATION

    Robert Müller · PDF
  15. ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making

    Ziyang Guo, Yifan Wu, Jason Hartline, Ken Holstein, Jessica Hullman · PDF
  16. CooperBench: Benchmarking Cooperation in Coding Agents

    Arpandeep Khatua, Hao Zhu, Peter Tran, Arya Prabhudesai, Frederic Sadrieh, Johann Kaspar Lieberwirth, Xinkai Yu, Yicheng Fu, Michael J Ryan, Jiaxin Pei, Diyi Yang · PDF
  17. CORAL: Cooperative Multi-Agent Orchestration for LLM Adaptation Across Diverse Environments

    Nitin Vetcha · PDF
  18. Divide-and-Conquer CoT: RL for Reducing Latency via Parallel Reasoning

    Arvind V. Mahankali, Kaiyue Wen, Tengyu Ma · PDF
  19. Do Language Models Deceive? Strategic Behavior and Emergent Deception in Multi-Agent Auctions

    Aman Sharma · PDF
  20. Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems

    Lang Feng, Longtao Zheng, Shuo He, Fuxiang Zhang, Bo An · PDF
  21. EconAI: Dynamic Persona Evolution and Memory-Aware Agents inEvolving Economic Environments

    Yijin Chen, Ning Lyu, Shengning Lang, Hao Yan, Zhiguo Tao, Xiaotong Ding, Xiaotong Zhu · PDF
  22. Evaluating Cooperation in LLM Social Groups through Elected Leadership

    Ryan Faulkner, Anushka Deshpande, David Guzman Piedrahita, Joel Z Leibo, Zhijing Jin · PDF
  23. Evaluating LLM Agents as Human Simulators in Climate Social Dilemmas

    Kaiyuan Liu, Xiaoxuan Hou, Jiayi Yuan, Natasha Jaques · PDF
  24. EvoCF: Multi-Agent Collaboration via Agentic Memory-Driven Evolutionary Counterfactual Planning

    Haotian Chi, Zeyu Feng, Xingrui Yu, Linbo Luo, Yew-Soon Ong, Ivor Tsang, Hechang Chen, Yi Chang, Haiyan Yin · PDF
  25. Expanding the Capabilities of Reinforcement Learning via Text Feedback

    Yuda Song, Lili Chen, Fahim Tajwar, Rémi Munos, Deepak Pathak, Drew Bagnell, Aarti Singh, Andrea Zanette · PDF
  26. Explanations are a Means to an End: Decision Theoretic Explanation Evaluation

    Ziyang Guo, Berk Ustun, Jessica Hullman · PDF
  27. Federation over Text

    Dixi Yao, Tahseen Rabbani, Tian Li · PDF
  28. Group Distributionally Robust Optimization-Driven RL for LLM Reasoning

    Kishan Panaganti, Zhenwen Liang, Wenhao Yu, Haitao Mi, Dong Yu · PDF
  29. GT-HarmBench: Benchmarking AI Safety Risks Through the Lens of Game Theory

    X. Angelo Huang, Pepijn Cobben, Thao Amelia Pham, Terry Jingchen Zhang, Zhijing Jin · PDF
  30. Heterogeneous Low-Bandwidth Pre-Training of LLMs

    Yazan Obeidi, Amir Sarfi, Joel Lidin, Paul Janson, Eugene Belilovsky · PDF
  31. Hierarchical Generative Agents for Simulating Sequential Human Behavior

    Maria G. Mendoza, Lucas Waldburger, Jin Lee, S. Shankar Sastry · PDF
  32. How Communication Modalities Shape Topology in Generative Multi-Agent Systems

    Vinicius Covas · PDF
  33. Interpretable Multi-Agent Debate for Political Opinion Simulation

    Aali Azamat uulu, Justin Xue Taing, Alibek Dadajonov, Mayank Goel · PDF
  34. JaxAHT: A JAX-Based Library for Ad Hoc Teamwork

    Caroline Wang, Rolando Fernandez, Jiaxun Cui, Johnny Liu, Aditya Madhan, Zhihan Wang, Lingyun Xiao, Di Yang Shi, Arrasy Rahman, Peter Stone · PDF
  35. LaneRoPE: Positional Encoding for Collaborative Parallel Reasoning and Generation

    Gabriele Cesa, Thomas Hehn, Aleix Torres-Camps, Àlex Batlle Casellas, Jordi Ros-Giralt, Arash Behboodi, Tribhuvanesh Orekondy · PDF
  36. Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic

    Shuo Liu, Tianle Chen, Ryan Amiri, Christopher Amato · PDF
  37. Learning the Preferences of a Learning Agent

    Karim Abdel Sadek, Mark Bedaywi, Rhys Gould, Stuart Russell · PDF
  38. Let’s Talk, Not Type: An Oral-First Multi-Agent Architecture for Guarani

    Samantha Adorno, Akshata Kishore Moharir, Ratna Kandala · PDF
  39. MAGIC: Multi-Agent Generative Intention Coordination

    David Huk, Oliver Hamelijnck, Dimitris Demiris, Theodoros Damoulas · PDF
  40. MAPLE: Multi-Agent Prior Learning for Constructing Tree Ensembles

    Nguyen Viet Tuan Kiet, Nguyen Ba Thinh, Thanh Trung Huynh, Hieu Pham · PDF
  41. MetroRehearsal: Tool-Guided Multi-Agent Debate for Metro Emergency Planning

    Jinlin Li, Xiao Zhou, Yingying Zhang, Xian Wu · PDF
  42. Multi-Agent Consensus Matrix Modeling for Medical Decision-Making: A Role-Specialized LLM Framework for Oncology MDT Consultations

    Ziyi Ni, Yiming Yan, Xiaoyi Qu, Yanzhan Chen, Chuang Liu · PDF
  43. Not All Clients Are Equal: Collaborative Model Personalization on Heterogeneous Multi-Modal Clients

    Minhyuk Seo, Taeheon Kim, Hankook Lee, Jonghyun Choi, Tinne Tuytelaars · PDF
  44. Novelty-Gated Experience Sharing for Multi-Agent Reinforcement Learning

    Manish Sai Kota, Thomas Fan, Harshita Poojary, Nolawi Teklehaimanot, Aishwarya Balwani · PDF
  45. Reasonably reasoning agents avoid game-theoretic failures in zero-shot, provably

    Enoch H. Kang · PDF
  46. RPRA: Predicting an LLM-Judge for Efficient but Performant Inference

    Dylan R. Ashley, Gael Le Lan, Changsheng Zhao, Naina Dhingra, Zhipeng Cai, Ernie Chang, Mingchen Zhuge, Yangyang Shi, Vikas Chandra, Jürgen Schmidhuber · PDF
  47. Safe Test-Time Reinforcement learning for Imperfect Information Games

    Ondrej Kubicek, Viliam Lisý, Tuomas Sandholm · PDF
  48. Scaling Inference-Time Computation via Opponent Simulation: Enabling Online Strategic Adaptation in Repeated Negotiation

    Xiangyu Liu, Di Wang, Zhe Feng, Aranyak Mehta · PDF
  49. Self-Improvement of Language Models by Post-Training on Multi-Agent Debate

    Ankur Samanta, Akshayaa Magesh, Runzhe Wu, Ayush Jain, Youliang Yu, Daniel Jiang, Boris Vidolov, Paul Sajda, Yonathan Efroni, Kaveh Hassani · PDF
  50. Self-Questioning Language Models

    Lili Chen, Mihir Prabhudesai, Katerina Fragkiadaki, Hao Liu, Deepak Pathak · PDF
  51. SkillTracer: Structural Failure Attribution and Refinement of Agentic Skills in Long-Horizon Web Tasks

    Yuyang Li, Yiran Dou, Jie-Jing Shao, Yueming Lyu, Ivor Tsang, Haiyan Yin · PDF
  52. Talk Before You Retrieve: Agent-Led Discussions for Better RAG in Medical QA

    XUANZHAO DONG, Wenhui Zhu, Hao Wang, Xiwen Chen, Peijie Qiu, Rui Yin, Yi Su, Yalin Wang · PDF
  53. Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling

    Jeffrey T. H. Wong, Zixi Zhang, Junyi Liu, Yiren Zhao · PDF
  54. The Decrypto Benchmark for Multi-Agent Reasoning and Theory of Mind

    Andrei Lupu, Timon Willi, Jakob Nicolaus Foerster · PDF
  55. UT-Evolve: AN EVOLUTIONARY AGENT FOR UNIT TEST WRITING

    Arshika Lalan, Rajat Ghosh, Debojyoti Dutta · PDF
  56. Verified Multi-Agent Orchestration: A Plan-Execute-Verify-Replan Framework for Complex Query Resolution

    Xing Zhang, Yanwei CUI, Guanghui Wang, Qucy Wei Qiu, Ziyuan Li, Fangwei Han, Yajing Huang, Hengzhi Qiu, Bing Zhu, Peiyang He · PDF
  57. Zero-Shot Coordination among LLM Agents

    Adrian Hayler, Shashank Reddy Chirra, Andrei Lupu, Johannes Forkel, Bidipta Sarkar, Siheng Feng, Jakob Nicolaus Foerster · PDF