NeurIPS 2024 Past Other
Language Gamification - NeurIPS 2024 Workshop
LanGame
- Submission deadline
- Oct 6, 2024, 23:59 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (39)
Fetched from OpenReview (v2) on 2026-06-10.
-
AidanBench: Evaluating Novel Idea Generation on Open-Ended Questions
-
Automated Design of Agentic Systems
-
Beyond Benchmarking: Automated Capability Discovery via Model Self-Exploration
-
Boundless Socratic Learning with Language Games
-
Communication via Shared Memory Improves Multi-agent Pathfinding
-
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
-
Creativity Has Entered the Chat, With a Stranger: Novelty is a Nash Equilibrium
-
Dynamic Planning with a LLM
-
Economics Arena for Large Language Models
-
Efficacy of Language Model Self-Play in Non-Zero-Sum Games
-
Embodied LLM Agents Learn to Cooperate in Organized Teams
-
Embodied-RAG: General Non-parametric Embodied Memory for Retrieval and Generation
-
Estimating Effects of Tokens in Preference Learning
-
Evaluating the role of ‘Constitutions’ for learning from AI feedback
-
Evolving Alignment via Asymmetric Self-Play
-
GameBench: Evaluating Strategic Reasoning Abilities of LLM Agents
-
Games as Ontology Engines: AI and LLMs Invoke Spatiotemporal and Metaphysical Realities in Virtual Worlds
-
Improving Branching Language via Self-Reflection
-
LlaMa meets Cheburashka: impact of cultural background for LLM quiz reasoning
-
Mimicking Human Emotions: Persona-Driven Behavior of LLMs in the ‘Buy and Sell’ Negotiation Game
-
Multi-Step Preference Optimization via Two-Player Markov Games
-
On Reward Functions For Self-Improving Chain-of-Thought Reasoning Without Supervised Datasets (Abridged Version)
-
OnThePlanning Abilities of OpenAI’s o1 Models: Feasibility, Optimality, and Generalizability
-
PACE: Procedural Abstractions for Communicating Efficiently
-
PIANIST: Learning Partially Observable World Models with LLMs for Multi-Agent Decision Making
-
PokéChamp: an Expert-level Minimax Language Agent for Competitive Pokémon
-
Positive Experience Reflection for Agents in Interactive Text Environments
-
Reinterpreting Signaling and Referential Games as Generative Models
-
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial?
-
S2L-RM: Short-to-Long Reward Modeling
-
Sample-Efficient Alignment for LLMs
-
SAPIENT: Mastering Multi-turn Conversational Recommendation with Strategic Planning and Monte Carlo Tree Search
-
Sharing Minds during MARL Training for Enhanced Cooperative LLM Agents
-
Situated Instruction Following Under Ambiguous Human Intent
-
Strategic Collusion of LLM Agents: Market Division in Multi-Commodity Competitions
-
Strategic Interactions between Large Language Models-based Agents in Beauty Contests
-
Stutter Makes Smarter: Learning Self-Improvement for Large Language Models
-
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
-
What Makes Your Model a Low-empathy or Warmth Person: Exploring the Oringins of Personality in LLMs