CVPR 2026 Past EfficiencyComputer vision

1st Workshop on Video World Models: Interaction, Memory, Efficiency (Non-Proceedings Track)

CVPR 2026 Workshop VideoWorldModel

Submission deadline
TBA — know the deadline? Add it in one line
The file opens with a ready-to-fill template — takes about a minute.
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (24)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Building a Precise Video Language with Human–AI Oversight

    Siyuan Cen, Hewei Wang, Chancharik Mitra, Isaac Li, Yuhan Huang, Yu Tong Tiffany Ling, Irene Pi, Shihang Zhu, Yili Han, Yilun Du, Deva Ramanan, Zhiqiu Lin · PDF
  2. Causal State Compression for Long-Horizon Video World Models: A Bounded-Drift Theory and Efficient Architecture

    Kaustubh S. Bukkapatnam, Siddharth Karuturi · PDF
  3. Causal State Entropy Bounds on Predictive Horizons in Video World Models

    Siddharth Karuturi, Kaustubh S. Bukkapatnam · PDF
  4. DECOMWM: Interpretable Reward Decomposition for World-Model-Based Trajectory Selection

    Yun Sang Nam, KimJinChan · PDF
  5. Dexterous World Models

    Byungjun Kim, Taeksoo Kim, Junyoung Lee, Hanbyul Joo · PDF
  6. EgoControl: Controllable Egocentric Video Generation via 3D Full-Body Poses

    Enrico Pallotta, Sina Mokhtarzadeh Azar, Lars Doorenbos, Serdar Ozsoy, Umar Iqbal, Juergen Gall · PDF
  7. Forecasting Motion in the Wild

    Neerja Thakkar, Shiry Ginosar, Jacob C Walker, Jitendra Malik, Joao Carreira, Carl Doersch · PDF
  8. Future Optical Flow Prediction Improves Robot Control & Video Generation

    Kanchana Ranasinghe, Honglu Zhou, Yu Fang, Luyu Yang, Le Xue, Ran Xu, Caiming Xiong, silvio savarese, Michael S Ryoo, Juan Carlos Niebles · PDF
  9. Inference-Time Planning with Action-Conditioned Video Models for Generalizable Robot Manipulation

    Zhiting Mei, Yanbo Xu, Tenny Yin, Ola Sho, Anirudha Majumdar · PDF
  10. Is Your Driving World Model an All-Around Player?

    Lingdong Kong, Alan Liang, Tianyi Yan, Hongsi Liu, Yu Yang, Ziqi Huang, Xian Sun, Wei Yin, Jialong Zuo, Yixuan Hu, Dekai Zhu, Dongyue Lu, Youquan Liu, Guangfeng Jiang, Linfeng Li, Xiangtai Li, Long Zhuo, Lai Xing Ng, Benoit R Cottereau, Changxin Gao, Liang Pan, Wei Tsang Ooi, Ziwei Liu · PDF
  11. NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos

    Yuxue Yang, Lue Fan, Ziqi Shi, Junran Peng, Feng Wang, Zhaoxiang Zhang · PDF
  12. Objects in Generated Videos Are Slower Than They Appear: Models Suffer Sub-Earth Gravity and Don't Know Galileo's Principle...for now

    Varun Varma Thozhiyoor, Shivam Tripathi, Venkatesh Babu Radhakrishnan, Anand Bhattad · PDF
  13. Olaf-World: Orienting Latent Actions for Video World Modeling

    Yuxin Jiang, Yuchao Gu, Ivor Tsang, Mike Zheng Shou · PDF
  14. OmniView: An All-Seeing Diffusion Model for 3D and 4D View Synthesis

    Xiang Fan, Sharath Girish, Vivek Ramanujan, Chaoyang Wang, Ashkan Mirzaei, Peter Sushko, Aliaksandr Siarohin, Sergey Tulyakov, Ranjay Krishna · PDF
  15. Rays as Pixels: Learning a Joint Distribution of Videos and Camera Trajectories

    Wonbong Jang, Shikun Liu, Soubhik Sanyal, Juan Camilo Perez, Kam Woh Ng, Juan-Manuel Perez-Rua, Yiannis Douratsos, Tao Xiang · PDF
  16. RoboWM-Bench: A Benchmark for Evaluating World Models in Robotic Manipulation

    Feng Jiang, Yang Chen, Jingkai Xu, Yuchen Liu, Haifeng Wang, Zhenhao Shen, Jasper Lu, Shu Chen, Shengze Huang, Yuanfei Wang, Ruihai Wu · PDF
  17. SEGAR: Selective Enhancement for Generative Augmented Reality

    Fanjun Bu, Chenyang Yuan, Hiroshi Yasuda · PDF
  18. Spectral World Models: Provably Consistent Long-Horizon Video Generation via Koopman Operator Decomposition

    Siddharth Karuturi, Kaustubh S. Bukkapatnam · PDF
  19. The Pulse of Motion: Measuring Physical Frame Rate from Visual Dynamics

    Xiangbo Gao, Mingyang Wu, Siyuan Yang, Jiongze Yu, Pardis Taghavi, Fangzhou Lin, Zhengzhong Tu · PDF
  20. VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

    Sixiao Zheng, Minghao Yin, Wenbo Hu, Xiaoyu Li, Ying Shan, Yanwei Fu · PDF
  21. Video Models Reason Early: Exploiting Plan Commitment for Maze Solving

    Kaleb Newman, Tyler Zhu, Olga Russakovsky · PDF
  22. WFM-Eval: Interpretable Error Diagnostics for Video World Models in Robotics

    Sahil Khose, Mengqi Zhang, Prithvijit Chattopadhyay, Judy Hoffman · PDF
  23. WorldCam: Interactive Autoregressive 3D Gaming Worlds with Camera Pose as a Unifying Geometric Representation

    Jisu Nam, Yicong Hong, Chun-Hao Paul Huang, Feng Liu, JoungBin Lee, Jiyoung Kim, Siyoon Jin, Yunsung Lee, Jaeyoon Jung, Suhwan Choi, Seungryong Kim, Yang Zhou · PDF
  24. WorldPack: Dynamic Frame Compression for Long-context Video World Modeling

    Yuta Oshima, Yusuke Iwasawa, Masahiro Suzuki, Yutaka Matsuo, Hiroki Furuta · PDF