NeurIPS 2025 Past Robotics

NeurIPS 2025 Workshop on Embodied World Models for Decision Making

NeurIPS 2025 Workshop EWM

Submission deadline
Sep 3, 2025, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (51)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Smooth Sea Never Made a Skilled SAILOR: Robust Imitation via Learning to Search

    Arnav Kumar Jain, Vibhakar Mohta, Subin Kim, Atiksh Bhardwaj, Juntao Ren, Yunhai Feng, Sanjiban Choudhury, Gokul Swamy · PDF
  2. Abstract Sim2Real through Approximate Information States

    Yunfu Deng, Josiah P. Hanna · PDF
  3. Ada-Diffuser: Latent-Aware Adaptive Diffusion for Decision-Making

    Fan Feng, Selena Ge, Minghao Fu, Zijian Li, Yujia Zheng, Zeyu Tang, Yingyao Hu, Biwei Huang, Kun Zhang · PDF
  4. Adversarial Diffusion for Robust Reinforcement Learning

    Daniele Foffano, Alessio Russo, Alexandre Proutiere · PDF
  5. Avi: A 3D Vision-Language Action Model Architecture generating Action from Volumetric Inference

    Harris Song, Long Le · PDF
  6. Beyond Experience: Fictive Learning as an Inherent Advantage of World Models

    Jianning Chen, Masakazu Taira, Kenji Doya · PDF
  7. Bridging the Sim-to-Real Gap in Humanoid Dynamics via Learned Nonlinear Operators

    Jieming Cui, Zhenghao Qi, Yutang Lin, Yifei Zhao, Yuntian Hu, Lei Kuang, Shuang Qiu, Ruihua Zhang, Bin He, Yixin Zhu · PDF
  8. Communicating Plans, Not Percepts: Scalable Multi-Agent Coordination with Embodied World Models

    Brennen Hill, Mant Koh En Wei, Jishnuanandh Thangavel · PDF
  9. Coupled Distributional Random Expert Distillation for World Model Online Imitation Learning

    Shangzhe Li, Zhiao Huang, Hao Su · PDF
  10. CRISP: Contact-guided Real2Sim from Monocular Video with Planar Scene Primitives

    Zihan Wang, Jiashun Wang, Jeff Tan, Yiwen Zhao, Jessica K. Hodgins, Shubham Tulsiani, Deva Ramanan · PDF
  11. Decoupled Planning and Execution with LLM-Driven World Models for Efficient Reinforcement learning

    Guoqing Ma · PDF
  12. Divide and Merge: Motion and Semantic Learning in End-to-End Autonomous Driving

    Yinzhe Shen, Omer Sahin Tas, Kaiwen Wang, Royden Wagner, Christoph Stiller · PDF
  13. EnerVerse-AC: Envisioning Embodied Environments with Action Condition

    Yuxin Jiang, Shengcong Chen, Siyuan Huang, Liliang Chen, Pengfei Zhou, Yue Liao, Xindong He, Chiming Liu, Hongsheng Li, Maoqing Yao, Guanghui Ren · PDF
  14. Exploring exploration with foundation agents in interactive environments

    Daniel P. Sawyer, Nan Rosemary Ke, Hubert Soyer, Martin Engelcke, John Reid, David P Reichert, Drew A. Hudson, Alexander Lerchner, Danilo Jimenez Rezende, Timothy P Lillicrap, Michael Curtis Mozer, Jane X Wang · PDF
  15. FalconWing: An Ultra-Light Fixed-Wing Platform for Indoor Aerial Applications

    Yan Miao, Will Shen, Hang Cui, Sayan Mitra · PDF
  16. FLAM: Scaling Latent Action World Models with Factorization

    Zizhao Wang, Chang Shi, Jiaheng Hu, Roberto Martín-Martín, Peter Stone · PDF
  17. Foundation Models as World Models: A Foundational Study in Text-Based GridWorlds

    Remo Sasso, Michelangelo Conserva, Dominik Jeurissen, Paulo Rauber · PDF
  18. Generative World Models of Tasks: LLM-Driven Hierarchical Scaffolding for Embodied Agents

    Brennen Hill · PDF
  19. Geosteering Through the Lens of Decision Transformers: Toward Embodied Sequence Decision-Making

    Hibat Errahmen DJECTA · PDF
  20. HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Robotic Assembly

    Gireesh Nandiraju, Yuanliang Ju, Chaoyi Xu, He Wang · PDF
  21. How Foundational Skills Influence VLM-based Embodied Agents: A Native Perspective

    Bo Peng, Pi Bu, Keyu Pan, Xinrun Xu, Miao Chen, Yang Du, Lin Li, Jun Song, Tong Xu, Bo Zheng · PDF
  22. Improvisational Reasoning with Vision-Language Models for Grounded Procedural Planning

    Md Masudur Rahman, Yupeng Zhuo, Juan Wachs · PDF
  23. In-Context Policy Iteration for Dynamic Manipulation

    Mark Van der Merwe, Devesh K. Jha · PDF
  24. Latent Weight Diffusion: Generating reactive policies instead of trajectories

    Shashank Hegde, Satyajeet Das, Gautam Salhotra, Gaurav S. Sukhatme · PDF
  25. Learning to Focus: Prioritizing Informative Histories with Structured Attention Mechanisms in Partially Observable Reinforcement Learning

    Daniel De Dios Allegue, Jinke He, Frans A Oliehoek · PDF
  26. LLM-Guided Probabilistic Program Induction for POMDP Model Estimation

    Aidan Curtis, Hao Tang, Thiago Veloso, Kevin Ellis, Joshua B. Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling · PDF
  27. Mobile Manipulation with Active Inference for Long-Horizon Rearrangement Tasks

    Corrado Pezzato, Ozan Catal, Toon Van de Maele, Riddhi J. Pitliya, Tim Verbelen · PDF
  28. NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows

    Denis Tarasov, Alexander Nikulin, Ilya Zisman, Albina Klepach, Lyubaykin Nikita, Andrei Polubarov, Alexander Derevyagin, Vladislav Kurenkov · PDF
  29. OpenGVL - Benchmarking Visual Temporal Progress for Data Curation

    Paweł Budzianowski, Emilia Wiśnios, Gracjan Góral, Igor Kulakov, Viktor Petrenko, Krzysztof Walas · PDF
  30. Opinion: A Unified World Model is the cornerstone for integrating perception, reasoning, and decision-making in embodied AI

    Yipeng Xu · PDF
  31. Opinion: How Can Causal AI Benefit World Models?

    Qiuling Pan, Hong Zhou, Zhouchen Lin · PDF
  32. Opinion: Learning Intuitive Physics May Require More Than Visual Data

    Ellen Su, Solim LeGris, Todd M. Gureckis, Mengye Ren · PDF
  33. Opinion: Small VLAs Self-Learn Consistency

    Francesco Capuano, Adil Zouitine, Michel Aractingi · PDF
  34. Opinion: Towards Unified Expressive Policy Optimization for Robust Robot Learning

    Haidong Huang, Haiyue Zhu, Jiayu Song, Xixin Zhao, Yaohua Zhou, Jiayi Zhang, Yuze Zhai, Xiaocong Li · PDF
  35. Plan Verification for LLM-Based Embodied Task Completion Agents

    Ananth Hariharan, Vardhan Dongre, Dilek Hakkani-Tür, Gokhan Tur · PDF
  36. PolicyGRID: Acting to Understand, Understanding to Act

    Taqiya Ehsan, Shuren Xia, Jorge Ortiz · PDF
  37. RDAR: Reward-Driven Agent Relevance Estimation for Autonomous Driving

    Carlo Bosio, Greg Woelki, Noureldin Hendy, Nicholas Roy, Byungsoo Kim · PDF
  38. Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics

    Chenhao Li, Andreas Krause, Marco Hutter · PDF
  39. ROPES: Robotic Pose Estimation via Score-based Causal Representation Learning

    Pranamya Prashant Kulkarni, Puranjay Datta, Emre Acartürk, Burak Varıcı, Karthikeyan Shanmugam, Ali Tajer · PDF
  40. ScenePhys — Controllable Physics Videos for World-Model Evaluation

    Arshia Hemmat, Emad Aghahosseini, Alireza Nasri, Mohammad Hossein Shaker Ardakani, Amirmasoud Rismanchian, Ali Mamanpoosh, Afsaneh Fatemi · PDF
  41. Sim-to-Real Contact-Rich Pivoting via Optimization-Guided RL with Vision and Touch

    Yuki Shirai, Kei Ota, Devesh K. Jha, Diego Romeres · PDF
  42. SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards

    Hunar Batra, Haoqin Tu, Hardy Chen, Yuanze Lin, Cihang Xie, Ronald Clark · PDF
  43. SPUR: Scaling Reward Learning from Human Demonstrations

    Anthony Liang, Yigit Korkmaz, Jiahui Zhang, Jesse Zhang, Abrar Anwar, Sidhant Kaushik, Yufei Wang, Yu Xiang, David Held, Dieter Fox, Abhishek Gupta, Stephen Tu, Erdem Biyik · PDF
  44. Stable Planning through Aligned Representations in Model-Based Reinforcement Learning

    Misagh Soltani, Forest Agostinelli · PDF
  45. Steering Diffusion Policies with Value-Guided Denoising

    Hanming Ye · PDF
  46. The Physical Basis of Prediction: World Model Formation in Neural Organoids via an LLM-Generated Curriculum

    Brennen Hill · PDF
  47. Towards Fine-tuning a Small Vision-Language Model for Aerial Navigation

    Hakob Tamazyan, Narek Nurijanyan, Boris Martirosyan, Hrant Khachatrian · PDF
  48. ViPRA: Video Prediction for Robot Actions

    Sandeep Routray, Hengkai Pan, Unnat Jain, Shikhar Bahl, Deepak Pathak · PDF
  49. Vision-Language Reasoning for Burn Depth Assessment with Structured Diagnostic Hypotheses

    Md Masudur Rahman, Mohamed El Masry, Kristo Nuutila, Gayle Gordillo, Juan Wachs · PDF
  50. VLA-OS: Structuring and Dissecting Planning Representations and Paradigms in Vision-Language-Action Models

    Chongkai Gao, Zixuan Liu, Zhenghao Chi, Junshan Huang, Xin Fei, Yiwen Hou, Yuxuan Zhang, Yudi Lin, Zhirui Fang, Zeyu Jiang, Lin Shao · PDF
  51. WHALE: Towards Generalizable and Scalable World Models for Embodied Decision-making

    Zhilong Zhang, Ruifeng Chen, Junyin Ye, Yihao Sun, Haoxiang Ren, Xinghao Du, Pengyuan Wang, Jing-Cheng Pang, Kaiyuan Li, Tian-Shuo Liu, Haoxin Lin, Yang Yu, Zhi-Hua Zhou · PDF