ICLR 2024 Past Large language modelsAgents
ICLR 2024 Workshop on Large Language Model (LLM) Agents
LLMAgents @ ICLR 2024
- Submission deadline
- Feb 12, 2024, 23:59 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (96)
Fetched from OpenReview (v2) on 2026-06-10.
-
A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
-
A-CONECT: Designing AI-based Conversational Chatbot for Early Dementia Intervention
-
Adapting Uni-Modal Language Models for Dense Multi-Modal Co-Reference Resolution using Parameter Augmentation
-
Agent Instructs Large Language Models to be General Zero-Shot Reasoners
-
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
-
Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
-
AgentBoard: An Analytical Evaluation Board of Multi-turn LLM Agents
-
Agents: An Open-source Framework for Autonomous Language Agents
-
An Embodied Generalist Agent in 3D World
-
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL
-
Are Machines Better at Slow Thinking? Unveiling Human-Machine Inference Gaps in Entailment Verification
-
AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
-
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
-
Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping
-
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments
-
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS
-
Bring Your Own KG: Self-Supervised Program Synthesis for Zero-Shot KGQA
-
Can Large Language Models be Good Path Planners? A Benchmark and Investigation on Spatial-temporal Reasoning
-
Collaborative LLM-Agents for Editable Driving Scene Simulation
-
Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
-
Corex: Pushing the Boundaries of Complex Reasoning through Multi-Model Collaboration
-
Data-Copilot: Bridging Billions of Data and Humans with Autonomous Workflow
-
Decision-Oriented Dialogue for Human-AI Collaboration
-
Do LLM Agents Have Regret? A Case Study in Online Learning and Games
-
EASYTOOL: Enhancing LLM-based Agents with Concise Tool Instruction
-
EcoAssistant: Using LLM Assistants More Affordably and Accurately
-
Efficient Human-AI Coordination via Preparatory Language-based Convention
-
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records
-
Empowering Autonomous Driving with Large Language Models: A Safety Perspective
-
Executable Code Actions Elicit Better LLM Agents
-
Exploring Collaboration Mechanisms for LLM Agents: A Social Psychology View
-
Exposing Limitations of Language Model Agents in Sequential-Task Compositions on the Web
-
Expressing and Exploiting Parallelism in Language Model Decoding
-
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design
-
FL-TAC: Enhanced Fine-Tuning in Federated Learning via Low-Rank, Task-Specific Adapter Clustering
-
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
-
GPT-4V(ision) is a Generalist Web Agent, if Grounded
-
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models
-
Hierarchical Auto-Organizing System for Open-Ended Multi-Agent Navigation
-
If LLM Is the Wizard, Then Code Is the Wand: A Survey on How Code Empowers Large Language Models to Serve as Intelligent Agents
-
IntentGPT: Few-shot Intent Discovery with Large Language Models
-
Is it Possible to Edit Large Language Models Robustly?
-
L3GO: Language Agents with Chain-of-3D-Thoughts for Generating Unconventional Objects
-
LangProp: A code optimization framework using Large Language Models applied to driving
-
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
-
Language-guided Skill Learning with Temporal Variational Inference
-
Large Language Model Evaluation Via Multi AI Agents: Preliminary results
-
Large Language Models can Strategically Deceive their Users when Put Under Pressure
-
LEAGUE++: EMPOWERING CONTINUAL ROBOT LEARNING THROUGH GUIDED SKILL ACQUISITION WITH LARGE LANGUAGE MODELS
-
Limitations of Agents Simulated by Predictive Models
-
LLF-Bench: Benchmark for Interactive Learning from Language Feedback
-
LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models
-
LLM-Deliberation: Evaluating LLMs with Interactive Multi-Agent Negotiation Game
-
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
-
MAGIC: INVESTIGATION OF LARGE LANGUAGE MODEL POWERED MULTI-AGENT IN COGNITION, ADAPTABILITY, RATIONALITY AND COLLABORATION
-
Making Retrieval-Augmented Language Models Robust to Irrelevant Context
-
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents
-
MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning
-
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
-
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
-
Open-TI: Open Traffic Intelligence with Augmented Language Model
-
OpenAgents: An Open Platform for Language Agents in the Wild
-
OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models
-
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement
-
Plan-Seq-Learn: Language Model Guided RL for Solving Long Horizon Robotics Tasks
-
Preference-Conditioned Language-Guided Abstraction
-
Prioritizing Safeguarding Over Autonomy: Risks of LLM Agents for Science
-
ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning
-
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents
-
R2E: Turning any Github Repository into a Programming Agent Environment
-
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement
-
ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent
-
REX: Rapid Exploration and eXploitation for AI agents
-
S-Agents: Self-organizing Agents in Open-ended Environments
-
SAGE: Bridging Semantic and Actionable Parts for Generalizable Manipulation of Articulated Objects
-
SceneCraft: An LLM Agent for Synthesizing 3D Scene as Blender Code
-
SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents
-
Self-Alignment of Large Language Models via Multi-Agent Social Simulation
-
SELF-IMAGINE: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination
-
Self-Training Language Models in Arithmetic Reasoning
-
Simulating Opinion Dynamics with Networks of LLM-based Agents
-
TaskBench: Benchmarking Large Language Models for Task Automation
-
The Agent Ohana: Designing Unified Data and Training Pipeline for Effective Agent Learning
-
The ART of LLM Refinement: Ask, Refine, Trust
-
The Wisdom of Partisan Crowds: Comparing Collective Intelligence in Humans and LLM-based Agents
-
Towards General Computer Control: A Multimodal Agent for Red Dead Redemption II as a Case Study
-
Towards Natural Language-Driven Industrial Assembly Using Foundation Models
-
Towards Self-Improving Language Models for Code Generation
-
Towards Unified Alignment Between Agents, Humans, and Environment
-
TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems
-
TravelPlanner: A Benchmark for Real-World Planning with Language Agents
-
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
-
VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks
-
WavCraft: Audio Editing and Generation with Large Language Models
-
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
-
WorkArena: How Capable are Web Agents at Solving Common Knowledge Work Tasks?