ICLR 2026PastOther

ICLR 2026 the 2nd Workshop on World Models: Understanding, Modelling and Scaling

ICLR 2026 Workshop World Models

Official website ↗OpenReview venue ↗See all ICLR workshops →✎ Edit this entry

Submission deadline: Feb 8, 2026, 11:59 UTC
OpenReview-synced 2026-02-08 11:59 UTC (as of 2026-06-23) — extensions on OpenReview are applied automatically; verify on the website.
Submission portal: OpenReview
Notes: Topics were auto-suggested and may be imprecise — edits welcome.

Accepted papers (94)

Fetched from OpenReview (v2) on 2026-06-10.

[Tiny Paper] Safe Streaming Flow Planning by Aligning Generation with Execution
Seunghwan Jang, Jeongyong Yang, Siddharth Ancha, SooJean Han · PDF
[Tiny Paper] GEST-Engine: Controllable Multi-Actor Video Synthesis with Perfect Spatiotemporal Annotations
Nicolae Cudlenco, Mihai Masala, Marius Leordeanu · PDF
[Tiny Paper] Integrating Simulation and Chain-of-thought Reasoning in Multimodal-Language Models For Physical Reasoning
YingQiao Wang, Eric Bigelow, Tomer Ullman, Yujin Tang, Sebastian Risi · PDF
[Tiny Paper] Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces
Anthony Kobanda, Waris Radji · PDF
[Tiny Paper] Modular Training-Free Construction of Executable 3D Worlds from Narrative Text
Sanchit Singh · PDF
[Tiny Paper] Probabilistic Dreaming for World Models
Gavin Y. Wong · PDF
[Tiny Paper] Shortcut World Models: Learning to Leap, Not Step
Pranav Lakshmanan, Paras Chopra · PDF
[TINY PAPER] Temporal Reversal Asymmetry: A Physics-Inspired Metric for Evaluating World Models
Kanpat Vesessook, Kevin Yang · PDF
[Tiny Paper] Toward Pixel-Grounded World Models for Powered Descent: A Rocket Landing Benchmark and Expert Baseline
Charles Duong, Aviral Vaidya, Aditya Iyer, Lucas Maes, Aidan LaBella, Randall Balestriero · PDF
A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents
Raghu Arghal, Phoebe Chen, Niall Dalton, Evgenii Kortukov, Calum McNamara, Angelos Nalmpantis, Moksh Nirvaan, Gabriele Sarti, Mario Giulianelli · PDF
A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures
Basile Terver, Randall Balestriero, Megi Dervishi, David Fan, Quentin Garrido, Tushar Nagarajan, Koustuv Sinha, Wancong Zhang, Michael Rabbat, Yann LeCun, Amir Bar · PDF
Action Shapley: A training data selection metric for Training World Models for Reinforcement Learning
Rajat Ghosh, Debojyoti Dutta · PDF
Active World-Model with 4D-informed Re- trieval for Exploration and Awareness
Elaheh Vaezpour, Amirhosein Javadi, Tara Javidi · PDF
Beyond Patient Invariance: Learning Cardiac Dynamics via Action-Conditioned JEPAs
Jose Geraldo Fernandes, Luiz Facury de Souza, Pedro Robles Dutenhefner, Wagner Meira Jr. · PDF
BlockMamba: Efficient Scalable Structured Sparsity for Mamba
Harshvardhan Mestha, Khaleelulla Khan Nazeer, David Kappel, Anand Subramoney · PDF
Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Prakhar Gupta, Vaibhav Gupta · PDF
CausalPhysics: Unifying Semantic Reasoning, Physical Dynamics, and Counterfactual Simulation in World Models
Mysore supreeth, Manish Mehta · PDF
CausalSliders: Graph-Guided LoRA Interventions for Causally Consistent Image Editing
Aditi Tiwari, Akshit Bhalla, Darshan Ganesh Prasad, Heng Ji · PDF
CausalSpatial: A Benchmark for Object-Centric Causal Spatial Reasoning
Wenxin Ma, Chenlong Wang, Ruisheng Yuan, Hao Chen, Nanru Dai, Yijun Yang, Chengxin Qian, Zhao-Yang Wang, Alan Yuille, Jieneng Chen · PDF
Cognitive Digital Twin Framework: Modeling and Real-Time Decision Making
Yangyang Zhang, Mengtong Li, Xinyu Wang, Zhihao Lin, Xiang Luo, Ernie Tian, Ning Lyu, Zhiguo Tao, Xiaotong Ding, Aaron Wang · PDF
Coherence‑Validated Causal World Models for Multi‑Scale Alzheimer’s Disease Progression and Pharmacologic Reversal
David Scott Lewis, Enrique Zueco · PDF
Compositional Planning with Jumpy World Models
Jesse Farebrother, Matteo Pirotta, Andrea Tirinzoni, Marc G Bellemare, Alessandro Lazaric, Ahmed Touati · PDF
Computer-Using World Model
Yiming Guan, Rui Yu, John Zhang, Lu Wang, Chaoyun Zhang, Liqun Li, Bo Qiao, Si Qin, He Huang, Fangkai Yang, Pu Zhao, Lukas Wutschitz, Samuel Kessler, Huseyin A Inan, Robert Sim, Saravan Rajmohan, Qingwei Lin, Dongmei Zhang · PDF
Consistent Video World Model With Geometry-Aware Rotary Position Embedding
Chendong Xiang, Jiajun Liu, Jintao Zhang, Xiao Yang, Zhengwei Fang, Shizun Wang, Zijun Wang, Yingtian Zou, Hang Su, Jun Zhu · PDF
Cross-View World Models
Rishabh Sharma, Gijs Hogervorst, Wayne Mackey, David Heeger, Stefano Martiniani · PDF
Ctrl-World: A Controllable Generative World Model for Robot Manipulation
Yanjiang Guo, Lucy Xiaoyang Shi, Jianyu Chen, Chelsea Finn · PDF
DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion
Adam Lee · PDF
Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks
Weijiang Li, Yilin Zhu, Rajarshi Das, Parijat Dube · PDF
Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction [Tiny Paper]
Michael Hauri, Friedemann Zenke · PDF
Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents
Xiao Yu, Baolin Peng, Ruize Xu, Michel Galley, Hao Cheng, Suman Nath, Jianfeng Gao, Zhou Yu · PDF
EGO-FLIGHT: Egocentric Grounding of Order for Frame-Level Inference in General Human Timelines
Jiahang He, Anya Singh, Jai Relan, Varun Nair · PDF
ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction
Qineng Wang, Wenlong Huang, Yu Zhou, Hang Yin, Tianwei Bao, Jianwen Lyu, Weiyu Liu, Ruohan Zhang, Jiajun Wu, Li Fei-Fei, Manling Li · PDF
Environment Maps: Structured Environmental Representations for Long-Horizon Agents
Yenchia Feng, Chirag Sharma, Karime Maamari · PDF
Evidential Latent World Models for Safe Model-based Reinforcement Learning
Alisson Henrique Kolling, Junior Costa De Jesus, Victor Augusto Kich, Ricardo Bedin Grando, Matheus Gonçalves Mateus, Rodrigo da Silva Guerra, Paulo L. J. Drews-Jr · PDF
EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory
Jiahao Wang, Luoxin Ye, Taiming Lu, Junfei Xiao, Jiahan Zhang, Yuxiang Guo, Xijun Liu, Rama Chellappa, Cheng Peng, Alan Yuille, Jieneng Chen · PDF
FluIDWorld: Fluid-like Interactive Dynamics for 4D Worlds
Hyeongju Mun, In-Hwan Jin, Sohyeong Kim, Kyeongbo Kong · PDF
GridWM-Judge: Evaluating Vision-Language Model Judges in Grid Worlds via World Model Deficits
Qinan Zhang, Qihang Jin · PDF
Grounding Generated Videos in Feasible Plans via World Models
Christos Ziakas, Amir Bar, Alessandra Russo · PDF
H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model
Wenyuan Chen, Jinbang Huang, Oscar Pang, Zhiyuan Li, Xiao Hu, Lingfeng Zhang, Zhanguang Zhang, Mark Coates, Tongtong Cao, Xingyue Quan, Yingxue Zhang · PDF
Hierarchical Latent Action Model
Hanjung Kim, Lerrel Pinto, Seon Joo Kim · PDF
Hierarchical World Models for Strategic AI Agents: Bridging Simulation and Reality through Multi-Fidelity Learning
Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari · PDF
Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning
Xiangyu Meng, Zixian Zhang, Zhenghao Zhang, Junchao Liao, Long Qin, Weizhi Wang · PDF
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
Runze Zhao, Yue Yu, Ruhan Wang, Chunfeng Huang, Dongruo Zhou · PDF
LaMo: A Latent Motion World Model for Long-Horizon Prediction
Azwar Abdulsalam, Christopher Hoang, Mengye Ren · PDF
Latent Imagination Thinking: Beyond Recursive Models for Reasoning
Karim Farid, Jelena Bratulić, Sudhanshu Mittal, Cordelia Schmid, Thomas Brox · PDF
LatentGS: Probabilistic Densification for Efficient, Compact, and Faster 3D Gaussian Splatting
Shuja Khalid, Mohamed Ibrahim, Yang Liu · PDF
Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning
Wenlong Tang · PDF
Learning Navigable World Models via Latent Energy Shaping
Luiz Facury de Souza, Jose Geraldo Fernandes, Pedro Robles Dutenhefner, Wagner Meira Jr. · PDF
Lifting Ego World Models for Planning and Control
Alex N Wang, Trevor Darrell, Pavel Izmailov, Yutong Bai, Amir Bar · PDF
Mnemo: Policy Learning Accelerated by Experience
Xingrui Gu, Chuyi Jiang · PDF
Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning
Rohan Deb, Stephen J. Wright, Arindam Banerjee · PDF
Model Space Reasoning as Search in Feedback Space for Planning Domain Generation
James Oswald, Daniel Obolensky, Volodymyr Varha, Vasilije Dragovic, Kavitha Srinivas, Harsha Kokel, Michael Katz, Shirin Sohrabi · PDF
Model-Based Meta-Learning for Algorithm Discovery
Theo Wolf, Alexander David Goldie, Jarek Luca Liesen, Uljad Berdica, Mattie Fellows, Jakob Nicolaus Foerster · PDF
Motion Attribution for Video Generation
Xindi Wu, Despoina Paschalidou, Jun Gao, Antonio Torralba, Laura Leal-Taixé, Olga Russakovsky, Sanja Fidler, Jonathan Lorraine · PDF
MULTI-COMPONENT OUTCOME PREDICTION FOR ENTERPRISE ROUTING VIA HIERARCHICAL CREDIT ASSIGNMENT
Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari, Tao Liu · PDF
Multi-scale Predictive Representations for Goal-conditioned Reinforcement Learning
Valliappan CA, David Meger, Sai Rajeswar, Pietro Mazzaglia · PDF
Neural Computers
Mingchen Zhuge, Changsheng Zhao, Haozhe Liu, Zijian Zhou, Shuming Liu, Wenyi Wang, Ernie Chang, Gael Le Lan, Junjie Fei, Wenxuan Zhang, Yasheng SUN, Yunyang Xiong, Zechun Liu, Zhipeng Cai, Yining Yang, Yuandong Tian, Yangyang Shi, Vikas Chandra, Jürgen Schmidhuber · PDF
Next Embedding Prediction Makes World Models Stronger
George Bredis, Nikita Balagansky, Daniil Gavrilov, Ruslan Rakhimov · PDF
Parallel Stochastic Gradient-Based Planning for World Models
Michael Psenka, Michael Rabbat, Aditi S. Krishnapriyan, Yann LeCun, Amir Bar · PDF
Physical Informed Driving World Models
Zhuoran Yang, Yanyong Zhang · PDF
PhysLang: a Small Diagnostic Framework for Language-Grounded World Modeling
Noor Mairukh Khan Arnob, Azmine Toushik Wasi · PDF
Planning with Unified Multimodal Models
Yihao Sun, Zhilong Zhang, Yang Yu, Pierre-Luc Bacon · PDF
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
Wenlong Huang, Yu-Wei Chao, Arsalan Mousavian, Ming-Yu Liu, Dieter Fox, Kaichun Mo, Li Fei-Fei · PDF
PREDICTING CAMERA POSE FROM PERSPECTIVE DESCRIPTIONS FOR SPATIAL REASONING
Xuejun Zhang, Aditi Tiwari, Zhenhailong Wang, Heng Ji · PDF
ProgressLM: Towards Progress Reasoning in Vision-Language Models
Jianshu Zhang, Chengxuan Qian, Haosen Sun, Haoran Lu, Dingcheng Wang, Letian Xue, Han Liu · PDF
Reinforcement Learning with World Models for Optimizing Alzheimer’s Disease Treatment Timing and Dosing
David Scott Lewis, Enrique Zueco · PDF
Rethinking Video Generation Model for the Embodied World
Yufan Deng, Zilin Pan, Hongyu Zhang, Xiaojie Li, Huruoqing, Yufei Ding, Yiming Zou, Yan Zeng, Daquan Zhou · PDF
Reward-Forcing: Autoregressive Video Generation with Reward Feedback
Jingran Zhang, Ning Li, Yuanhao Ban, Andrew Bai, Justin Cui · PDF
RigidBench: Evaluating Rigid-Body Physics in Video Generation Models
Swarnim Jain, Shangzhe Wu · PDF
Robustness in the Face of Partial Identifiability in Reward Learning Problems
Filippo Lazzati, Alberto Maria Metelli · PDF
Safe Context Switching for Agents in the Wild: Mitigating Subspace Interference via Orthogonal Adaptation
Akash Das, Ishan Roy · PDF
Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game
Michael Katz, Harsha Kokel, Sarath Sreedharan · PDF
Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation
Jacob Levy, Tyler Westenbroek, Kevin Huang, Fernando Palafox, Patrick Yin, Shayegan Omidshafiei, Dong-Ki Kim, Abhishek Gupta, David Fridovich-Keil · PDF
SpaRRTa: A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models
Turhan Can KARGIN, Wojciech Jasiński, Adam Pardyl, Bartosz Michał Zieliński, Marcin Przewięźlikowski · PDF
Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation
Zhichao Wu, Junyin Ye, Zhilong Zhang, Yihao Sun, Haoxin Lin, Haoxiang Ren, Jiaheng Luo, Lei Yuan, Yang Yu · PDF
Spiking Neural Networks for Continuous Control: Neuromorphic Reinforcement Learning in Conventional Computing
Jessica Hunter, Md Maruf Hossain Shuvo, Krishna Roy · PDF
stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation [Tiny Paper]
Lucas Maes, Quentin Le Lidec, Dan Haramati, Nassim Massaudi, Damien Scieur, Yann LeCun, Randall Balestriero · PDF
Structure from Diffusion: Taming Video Diffusion Models for Camera Pose Estimation in Dynamic Videos
Sihan Liu, Zhuoyuan Wu, Heng Yu, Jun Gao, Jose M. Alvarez · PDF
the Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation
Junichiro Niimi · PDF
Toward World Models for Epidemiology
Zeeshan Memon, Yiqi Su, Christo Kurisummoottil Thomas, Walid Saad, Liang Zhao, Naren Ramakrishnan · PDF
Towards Practical World Model-based Reinforcement Learning for Vision-Language-Action Models
Zhilong Zhang, Haoxiang Ren, Yihao Sun, Yifei Sheng, Haonan Wang, Zhichao Wu, Haoxin Lin, Pierre-Luc Bacon, Yang Yu · PDF
Tree of Options: Temporally Extended World Modeling, Planning, and Execution with Large Language Models
Xiaoling Zeng, Dingyang Chen, Qi Zhang · PDF
Uncertainty-Aware Robotic World Model Makes Offline Model-Based Reinforcement Learning Work on Real Robots
Chenhao Li, Andreas Krause, Marco Hutter · PDF
Understanding Early Collapse in Predictive World-Model Pretraining
Sofiane ENNADIR, Levente Zólyomi, Oleg Smirnov · PDF
VFMF: Dense Forecasting by Generating Foundation Model Features
Gabrijel Boduljak, Yushi Lan, Christian Rupprecht, Andrea Vedaldi · PDF
VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
Zirui Wang, Junyi Zhang, Jiaxin Ge, Long Lian, Letian Fu, Lisa Dunlap, Ken Goldberg, XuDong Wang, Ion Stoica, David M. Chan, Sewon Min, Joseph E. Gonzalez · PDF
Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models
Jialong Wu, Xiaoying Zhang, Hongyi Yuan, XiangCheng Zhang, Tianhao Huang, Changjing He, Chaoyi Deng, Renrui Zhang, Youbin Wu, Mingsheng Long · PDF
WestWorld: A Knowledge-Encoded Scalable Trajectory World Model for Diverse Robotics
Yuchen Wang, Jiangtao Kong, Sizhe Wei, Xiaochang Li, Haohong Lin, Hongjue Zhao, Tianyi Zhou, Lu Gan, Huajie Shao · PDF
What Do World Models Learn in RL? Probing Latent Representations in Learned Environment Simulators
Xinyu Zhang · PDF
What Drives Compositional Generalization? The Importance of Continuous Training Objectives in Visual Generative Models
Karim Farid, Rajat Sahay, Yumna Alnaggar, Simon Schrodi, Volker Fischer, Cordelia Schmid, Thomas Brox · PDF
World Action Models are Zero-shot Policies
Seonghyeon Ye, Yunhao Ge, Kaiyuan Zheng, Shenyuan Gao, Sihyun Yu, George Kurian, Suneel Indupuru, You Liang Tan, Chuning Zhu, Jiannan Xiang, Ayaan Naveed Malik, Kyungmin Lee, William Liang, Nadun Ranawaka Arachchige, Jiasheng Gu, Yinzhen Xu, Guanzhi Wang, Fengyuan Hu, Avnish Narayan, Johan Bjorck, Jing Wang, Gwanghyun Kim, Dantong Niu, Ruijie Zheng, Yuqi Xie, Jimmy Wu, Qi Wang, Danfei Xu, Yilun Du, Ryan Julian, Yevgen Chebotar, Scott Reed, Jan Kautz, Yuke Zhu, Linxi Fan, Joel Jang · PDF
World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry
Yuejiang Liu, Fan Feng, Lingjing Kong, Weifeng Lu, Jinzhou Tang, XiangCheng Zhang, Kun Zhang, Kevin Murphy, Chelsea Finn, Yilun Du · PDF
World Models as Execution Simulators for Automated Program Repair
Mysore supreeth, Atik Faysal, Manish Mehta, Sunil Kothari · PDF
World-Gymnast: Training Robots with Reinforcement Learning in a World Model
Ansh Kumar Sharma, Yixiang Sun, Ninghao Lu, Yunzhe Zhang, Jiarao Liu, Sherry Yang · PDF

Accepted papers (94)

☆[Tiny Paper] Safe Streaming Flow Planning by Aligning Generation with Execution

☆[Tiny Paper] GEST-Engine: Controllable Multi-Actor Video Synthesis with Perfect Spatiotemporal Annotations

☆[Tiny Paper] Integrating Simulation and Chain-of-thought Reasoning in Multimodal-Language Models For Physical Reasoning

☆[Tiny Paper] Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces

☆[Tiny Paper] Modular Training-Free Construction of Executable 3D Worlds from Narrative Text

☆[Tiny Paper] Probabilistic Dreaming for World Models

☆[Tiny Paper] Shortcut World Models: Learning to Leap, Not Step

☆[TINY PAPER] Temporal Reversal Asymmetry: A Physics-Inspired Metric for Evaluating World Models

☆[Tiny Paper] Toward Pixel-Grounded World Models for Powered Descent: A Rocket Landing Benchmark and Expert Baseline

☆A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents

☆A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

☆Action Shapley: A training data selection metric for Training World Models for Reinforcement Learning

☆Active World-Model with 4D-informed Re- trieval for Exploration and Awareness

☆Beyond Patient Invariance: Learning Cardiac Dynamics via Action-Conditioned JEPAs

☆BlockMamba: Efficient Scalable Structured Sparsity for Mamba

☆Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order

☆CausalPhysics: Unifying Semantic Reasoning, Physical Dynamics, and Counterfactual Simulation in World Models

☆CausalSliders: Graph-Guided LoRA Interventions for Causally Consistent Image Editing

☆CausalSpatial: A Benchmark for Object-Centric Causal Spatial Reasoning

☆Cognitive Digital Twin Framework: Modeling and Real-Time Decision Making

☆Coherence‑Validated Causal World Models for Multi‑Scale Alzheimer’s Disease Progression and Pharmacologic Reversal

☆Compositional Planning with Jumpy World Models

☆Computer-Using World Model

☆Consistent Video World Model With Geometry-Aware Rotary Position Embedding

☆Cross-View World Models

☆Ctrl-World: A Controllable Generative World Model for Robot Manipulation

☆DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion

☆Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks

☆Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction [Tiny Paper]

☆Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents

☆EGO-FLIGHT: Egocentric Grounding of Order for Frame-Level Inference in General Human Timelines

☆ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction

☆Environment Maps: Structured Environmental Representations for Long-Horizon Agents

☆Evidential Latent World Models for Safe Model-based Reinforcement Learning

☆EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

☆FluIDWorld: Fluid-like Interactive Dynamics for 4D Worlds

☆GridWM-Judge: Evaluating Vision-Language Model Judges in Grid Worlds via World Model Deficits

☆Grounding Generated Videos in Feasible Plans via World Models

☆H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model

☆Hierarchical Latent Action Model

☆Hierarchical World Models for Strategic AI Agents: Bridging Simulation and Reality through Multi-Fidelity Learning

☆Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning

☆Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

☆LaMo: A Latent Motion World Model for Long-Horizon Prediction

☆Latent Imagination Thinking: Beyond Recursive Models for Reasoning

☆LatentGS: Probabilistic Densification for Efficient, Compact, and Faster 3D Gaussian Splatting

☆Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning

☆Learning Navigable World Models via Latent Energy Shaping

☆Lifting Ego World Models for Planning and Control

☆Mnemo: Policy Learning Accelerated by Experience

☆Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

☆Model Space Reasoning as Search in Feedback Space for Planning Domain Generation

☆Model-Based Meta-Learning for Algorithm Discovery

☆Motion Attribution for Video Generation

☆MULTI-COMPONENT OUTCOME PREDICTION FOR ENTERPRISE ROUTING VIA HIERARCHICAL CREDIT ASSIGNMENT

☆Multi-scale Predictive Representations for Goal-conditioned Reinforcement Learning

☆Neural Computers

☆Next Embedding Prediction Makes World Models Stronger

☆Parallel Stochastic Gradient-Based Planning for World Models

☆Physical Informed Driving World Models

☆PhysLang: a Small Diagnostic Framework for Language-Grounded World Modeling

☆Planning with Unified Multimodal Models

☆PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

☆PREDICTING CAMERA POSE FROM PERSPECTIVE DESCRIPTIONS FOR SPATIAL REASONING

☆ProgressLM: Towards Progress Reasoning in Vision-Language Models

☆Reinforcement Learning with World Models for Optimizing Alzheimer’s Disease Treatment Timing and Dosing

☆Rethinking Video Generation Model for the Embodied World

☆Reward-Forcing: Autoregressive Video Generation with Reward Feedback

☆RigidBench: Evaluating Rigid-Body Physics in Video Generation Models

☆Robustness in the Face of Partial Identifiability in Reward Learning Problems

☆Safe Context Switching for Agents in the Wild: Mitigating Subspace Interference via Orthogonal Adaptation

☆Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

☆Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation

☆SpaRRTa: A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models

☆Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation

☆Spiking Neural Networks for Continuous Control: Neuromorphic Reinforcement Learning in Conventional Computing

☆stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation [Tiny Paper]

☆Structure from Diffusion: Taming Video Diffusion Models for Camera Pose Estimation in Dynamic Videos

☆the Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

[Tiny Paper] Safe Streaming Flow Planning by Aligning Generation with Execution

[Tiny Paper] GEST-Engine: Controllable Multi-Actor Video Synthesis with Perfect Spatiotemporal Annotations

[Tiny Paper] Integrating Simulation and Chain-of-thought Reasoning in Multimodal-Language Models For Physical Reasoning

[Tiny Paper] Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces

[Tiny Paper] Modular Training-Free Construction of Executable 3D Worlds from Narrative Text

[Tiny Paper] Probabilistic Dreaming for World Models

[Tiny Paper] Shortcut World Models: Learning to Leap, Not Step

[TINY PAPER] Temporal Reversal Asymmetry: A Physics-Inspired Metric for Evaluating World Models

[Tiny Paper] Toward Pixel-Grounded World Models for Powered Descent: A Rocket Landing Benchmark and Expert Baseline

A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents

A Lightweight Library for Energy-Based Joint-Embedding Predictive Architectures

Action Shapley: A training data selection metric for Training World Models for Reinforcement Learning

Active World-Model with 4D-informed Re- trieval for Exploration and Awareness

Beyond Patient Invariance: Learning Cardiac Dynamics via Action-Conditioned JEPAs

BlockMamba: Efficient Scalable Structured Sparsity for Mamba

Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order

CausalPhysics: Unifying Semantic Reasoning, Physical Dynamics, and Counterfactual Simulation in World Models

CausalSliders: Graph-Guided LoRA Interventions for Causally Consistent Image Editing

CausalSpatial: A Benchmark for Object-Centric Causal Spatial Reasoning

Cognitive Digital Twin Framework: Modeling and Real-Time Decision Making

Coherence‑Validated Causal World Models for Multi‑Scale Alzheimer’s Disease Progression and Pharmacologic Reversal

Compositional Planning with Jumpy World Models

Computer-Using World Model

Consistent Video World Model With Geometry-Aware Rotary Position Embedding

Cross-View World Models

Ctrl-World: A Controllable Generative World Model for Robot Manipulation

DexSIM: Real-time Dexterous Simulation with Unified Causal Video Diffusion

Do LLMs Build Spatial World Models? Evidence from Grid-World Maze Tasks

Dreamer-CDP: Improving Reconstruction-free World Models Via Continuous Deterministic Representation Prediction [Tiny Paper]

Dyna-Think: Synergizing Reasoning, Acting, and World Model Simulation in AI Agents

EGO-FLIGHT: Egocentric Grounding of Order for Frame-Level Inference in General Human Timelines

ENACT: Evaluating Embodied Cognition with World Modeling of Egocentric Interaction

Environment Maps: Structured Environmental Representations for Long-Horizon Agents

Evidential Latent World Models for Safe Model-based Reinforcement Learning

EvoWorld: Evolving Panoramic World Generation with Explicit 3D Memory

FluIDWorld: Fluid-like Interactive Dynamics for 4D Worlds

GridWM-Judge: Evaluating Vision-Language Model Judges in Grid Worlds via World Model Deficits

Grounding Generated Videos in Feasible Plans via World Models

H-WM: Robotic Task and Motion Planning Guided by Hierarchical World Model

Hierarchical Latent Action Model

Hierarchical World Models for Strategic AI Agents: Bridging Simulation and Reality through Multi-Fidelity Learning

Identity-GRPO: Optimizing Multi-Human Identity-preserving Video Generation via Reinforcement Learning

Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation

LaMo: A Latent Motion World Model for Long-Horizon Prediction

Latent Imagination Thinking: Beyond Recursive Models for Reasoning

LatentGS: Probabilistic Densification for Efficient, Compact, and Faster 3D Gaussian Splatting

Learning Evolving Latent Strategies for Multi-Agent Language Systems without Model Fine-Tuning

Learning Navigable World Models via Latent Energy Shaping

Lifting Ego World Models for Planning and Control

Mnemo: Policy Learning Accelerated by Experience

Model Predictive Control with Differentiable World Models for Offline Reinforcement Learning

Model Space Reasoning as Search in Feedback Space for Planning Domain Generation

Model-Based Meta-Learning for Algorithm Discovery

Motion Attribution for Video Generation

MULTI-COMPONENT OUTCOME PREDICTION FOR ENTERPRISE ROUTING VIA HIERARCHICAL CREDIT ASSIGNMENT

Multi-scale Predictive Representations for Goal-conditioned Reinforcement Learning

Neural Computers

Next Embedding Prediction Makes World Models Stronger

Parallel Stochastic Gradient-Based Planning for World Models

Physical Informed Driving World Models

PhysLang: a Small Diagnostic Framework for Language-Grounded World Modeling

Planning with Unified Multimodal Models

PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation

PREDICTING CAMERA POSE FROM PERSPECTIVE DESCRIPTIONS FOR SPATIAL REASONING

ProgressLM: Towards Progress Reasoning in Vision-Language Models

Reinforcement Learning with World Models for Optimizing Alzheimer’s Disease Treatment Timing and Dosing

Rethinking Video Generation Model for the Embodied World

Reward-Forcing: Autoregressive Video Generation with Reward Feedback

RigidBench: Evaluating Rigid-Body Physics in Video Generation Models

Robustness in the Face of Partial Identifiability in Reward Learning Problems

Safe Context Switching for Agents in the Wild: Mitigating Subspace Interference via Orthogonal Adaptation

Seemingly Simple Planning Problems are Computationally Challenging: The Countdown Game

Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation

SpaRRTa: A Synthetic Benchmark for Evaluating Spatial Intelligence in Visual Foundation Models

Speedup Patch: Learning a Plug-and-Play Policy to Accelerate Embodied Manipulation

Spiking Neural Networks for Continuous Control: Neuromorphic Reinforcement Learning in Conventional Computing

stable-worldmodel-v1: Reproducible World Modeling Research and Evaluation [Tiny Paper]

Structure from Diffusion: Taming Video Diffusion Models for Camera Pose Estimation in Dynamic Videos

the Mouth is Not the Brain: Bridging Energy-Based World Models and Language Generation

Toward World Models for Epidemiology