CVPR 2025 Past Large language modelsRoboticsComputer vision

The first CVPR workshop on 3D Vision Language Models (VLMs) for Robotics Manipulation: Opportunities and Challenges

Robo-3Dvlm

Submission deadline
May 16, 2025, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (6)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Agentic Language-Grounded Adaptive Robotic Assembly

    Nicholas Cote, Jaimyn Drake, Sachin Chitta · PDF
  2. Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models

    Chenrui Tie, Shengxiang Sun, Jinxuan Zhu, Yiwei Liu, Jingxiang Guo, Yue Hu, Haonan Chen, Junting Chen, Ruihai Wu, Lin Shao · PDF
  3. Mono3D-VLDL: Perception-Aware Vision-Language Dictionary Learning for Multimodal Fusion in Monocular 3D Grounding

    Tiantian Wang, Haixiang Hu, Haoxiang Liang, zhaoyang zhang, Tinglei Jia, Shuwen Huang, Yongfeng Bu, Xiaowei Qian, Rong Wang, Kaifei Li, Hanke Luo, Hua Cui · PDF
  4. Online Language Splatting

    Saimouli Katragadda, Cho-Ying Wu, Yuliang Guo, Xinyu Huang, Guoquan Huang, Liu Ren · PDF
  5. The One RING: a Robotic Indoor Navigation Generalist

    Ainaz Eftekhar, Luca Weihs, Rose Hendrix, Ege Caglar, Jordi Salvador, Alvaro Herrasti, Winson Han, Eli VanderBilt, Aniruddha Kembhavi, Ali Farhadi, Ranjay Krishna, Kiana Ehsani, Kuo-Hao Zeng · PDF
  6. ZeroMimic: Distilling Robotic Manipulation Skills from Web Videos

    Junyao Shi, Zhuolun Zhao, Tianyou Wang, Ian Pedroza, Amy Luo, Jie Wang, Yecheng Jason Ma, Dinesh Jayaraman · PDF