NeurIPS 2024 Past Other

Language Gamification - NeurIPS 2024 Workshop

LanGame

Submission deadline
Oct 6, 2024, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (39)

Fetched from OpenReview (v2) on 2026-06-10.

  1. AidanBench: Evaluating Novel Idea Generation on Open-Ended Questions

    Aidan McLaughlin, Anuja Uppuluri, James Campbell · PDF
  2. Automated Design of Agentic Systems

    Shengran Hu, Cong Lu, Jeff Clune · PDF
  3. Beyond Benchmarking: Automated Capability Discovery via Model Self-Exploration

    Cong Lu, Shengran Hu, Jeff Clune · PDF
  4. Boundless Socratic Learning with Language Games

    Tom Schaul · PDF
  5. Communication via Shared Memory Improves Multi-agent Pathfinding

    Alsu Sagirova, Yuri Kuratov, Mikhail Burtsev · PDF
  6. CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing

    Chen Yang, Chenyang Zhao, Quanquan Gu, Dongruo Zhou · PDF
  7. Creativity Has Entered the Chat, With a Stranger: Novelty is a Nash Equilibrium

    Kotaro Sakamoto, Shiro Takagi, Shuhei Ogawa, Yutaka Matsuo · PDF
  8. Dynamic Planning with a LLM

    Gautier Dagan, Frank Keller, Alex Lascarides · PDF
  9. Economics Arena for Large Language Models

    Shangmin Guo, Haochuan Wang, Haoran Bu, Yi Ren, Dianbo Sui, Yu-Ming Shang, Siting Estee Lu · PDF
  10. Efficacy of Language Model Self-Play in Non-Zero-Sum Games

    Austen Liao, Nicholas Tomlin, Dan Klein · PDF
  11. Embodied LLM Agents Learn to Cooperate in Organized Teams

    Xudong Guo, Kaixuan Huang, Jiale Liu, Wenhui Fan, Natalia Vélez, Qingyun Wu, Huazheng Wang, Thomas L. Griffiths, Mengdi Wang · PDF
  12. Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation

    Quanting Xie, So Yeon Min, Tianyi Zhang, Kedi Xu, Aarav Bajaj, Russ Salakhutdinov, Matthew Johnson-Roberson, Yonatan Bisk · PDF
  13. Estimating Effects of Tokens in Preference Learning

    Hsiao-Ru Pan, Maximilian Mordig, Bernhard Schölkopf · PDF
  14. Evaluating the role of ‘Constitutions’ for learning from AI feedback

    Saskia Redgate, Andrew Michael Bean, Adam Mahdi · PDF
  15. Evolving Alignment via Asymmetric Self-Play

    Ziyu Ye, Rishabh Agarwal, Tianqi Liu, Rishabh Joshi, Sarmishta Velury, Quoc V Le, Qijun Tan, Yuan Liu · PDF
  16. GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents

    Anthony Costarelli, Mat Allen, Roman Hauksson, Grace Sodunke, Suhas Hariharan, Carlson Cheng, Wenjie Li, Joshua M Clymer, Arjun Yadav · PDF
  17. Games as Ontology Engines: AI and LLMs Invoke Spatiotemporal and Metaphysical Realities in Virtual Worlds

    Jasmine Roberts, Andrzej Banburski · PDF
  18. Improving Branching Language via Self-Reflection

    Kolby Nottingham, Ruo-Ping Dong, Ben Kasper, Wesley N. Kerr · PDF
  19. LlaMa meets Cheburashka: impact of cultural background for LLM quiz reasoning

    Mikhail Lifar, Bogdan Protsenko, Daniil Kupriianenko, Nazar Chubkov, Kulaev Kirill Dmitrievich, Alexander Guda, Irina Piontkovskaya · PDF
  20. Mimicking Human Emotions: Persona-Driven Behavior of LLMs in the ‘Buy and Sell’ Negotiation Game

    mingyu jeon, Jae Young Suh · PDF
  21. Multi-Step Preference Optimization via Two-Player Markov Games

    Yongtao Wu, Luca Viano, Yihang Chen, Zhenyu Zhu, Quanquan Gu, Volkan Cevher · PDF
  22. On Reward Functions For Self-Improving Chain-of-Thought Reasoning Without Supervised Datasets (Abridged Version)

    Thomas Foster, Eltayeb Ahmed, Jonathan Cook, Shalev Lifshitz, Tim Rocktäschel, Jakob Nicolaus Foerster · PDF
  23. OnThePlanning Abilities of OpenAI’s o1 Models: Feasibility, Optimality, and Generalizability

    Kevin Wang, Junbo Li, Neel P. Bhatt, Yihan Xi, qiang liu, ufuk topcu, Zhangyang Wang · PDF
  24. PACE: Procedural Abstractions for Communicating Efficiently

    Jonathan David Thomas, Andrea Silvi, Devdatt Dubhashi, Vikas Garg, Moa Johansson · PDF
  25. PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making

    Jonathan Light, Sixue Xing, Yuanzhe Liu, Weiqin Chen, Min Cai, Xiusi Chen, Guanzhi Wang, Wei Cheng, Yisong Yue, Ziniu Hu · PDF
  26. PokéChamp: an Expert-level Minimax Language Agent for Competitive Pokémon

    Seth Karten, Andy Luu Nguyen, Chi Jin · PDF
  27. Positive Experience Reflection for Agents in Interactive Text Environments

    Philip Lippmann, Matthijs T. J. Spaan, Jie Yang · PDF
  28. Reinterpreting Signaling and Referential Games as Generative Models

    Ryo Ueda · PDF
  29. Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?

    Wenzhe Li, Yong Lin, Mengzhou Xia, Chi Jin · PDF
  30. S2L-RM: Short-to-Long Reward Modeling

    Changyu Chen, Zichen Liu, Haonan Wang, Chao Du, Tianyu Pang, Qian Liu, Arunesh Sinha, Pradeep Varakantham, Min Lin · PDF
  31. Sample-Efficient Alignment for LLMs

    Zichen Liu, Changyu Chen, Chao Du, Wee Sun Lee, Min Lin · PDF
  32. SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search

    Hanwen Du, Bo Peng, Xia Ning · PDF
  33. Sharing Minds during MARL Training for Enhanced Cooperative LLM Agents

    Jiaxuan Gao, Yule Wen, Chao Yu, Yi Wu · PDF
  34. Situated Instruction Following Under Ambiguous Human Intent

    So Yeon Min, Xavier Puig, Devendra Singh Chaplot, Tsung-Yen Yang, Akshara Rai, Priyam Parashar, Russ Salakhutdinov, Yonatan Bisk, Roozbeh Mottaghi · PDF
  35. Strategic Collusion of LLM Agents: Market Division in Multi-Commodity Competitions

    Ryan Y. Lin, Siddhartha Ojha, Kevin Cai, Maxwell Chen · PDF
  36. Strategic Interactions between Large Language Models-based Agents in Beauty Contests

    Siting Estee Lu · PDF
  37. Stutter Makes Smarter: Learning Self-Improvement for Large Language Models

    Pei-Chen Ho, Meng-Hsi Chen, Alberto Bernacchia, Philipp Ennen, Yen-Chen Wu, Da-shan Shiu · PDF
  38. TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation

    Jonathan Cook, Tim Rocktäschel, Jakob Nicolaus Foerster, Dennis Aumiller, Alex Wang · PDF
  39. What Makes Your Model a Low-empathy or Warmth Person: Exploring the Oringins of Personality in LLMs

    Shu Yang, Shenzhe Zhu, Liang Liu, Mengdi Li, Lijie Hu, Di Wang · PDF