CVPR 2024 Past RoboticsComputer vision

First Vision and Language for Autonomous Driving and Robotics Workshop

VLADR 2024

Submission deadline
Apr 14, 2024, 23:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (15)

Fetched from OpenReview (v2) on 2026-06-10.

  1. AIDE: An Automatic Data Engine for Object Detection in Autonomous Driving

    Mingfu Liang, Jong-Chyi Su, Samuel Schulter, Sparsh Garg, Shiyu Zhao, Ying Wu, Manmohan Chandraker · PDF
  2. Ambiguous Annotations: When is a Pedestrian not a Pedestrian?

    Luisa Schwirten, Jannes Scholz, Daniel Kondermann, Janis Keuper · PDF
  3. ATLAS: Adaptive Landmark Acquisition using LLM-Guided Navigation

    Utteja Kallakuri, Bharat Prakash, Arnab Neelim Mazumder, Hasib-Al Rashid, Nicholas R Waytowich, Tinoosh Mohsenin · PDF
  4. Collision Avoidance Metric for 3D Camera Evaluation

    Vage Taamazyan, Alberto Dall'Olio, Agastya Kalra · PDF
  5. DriveLM: Driving with Graph Visual Question Answering

    Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Jens Beißwenger, Ping Luo, Andreas Geiger, Hongyang Li · PDF
  6. Driver Activity Classification Using Generalizable Representations from Vision-Language Models

    Ross Greer, Mathias Viborg Andersen, Andreas Møgelmose, Mohan Trivedi · PDF
  7. DriVLMe: Enhancing LLM-based Autonomous Driving Agents with Embodied and Social Experiences

    Yidong Huang, Jacob Sansom, Ziqiao Ma, Felix Gervits, Joyce Chai · PDF
  8. Evolutionary Reward Design and Optimization with Multimodal Large Language Models

    Ali Emre Narin · PDF
  9. Improving End-To-End Autonomous Driving with Synthetic Data from Latent Diffusion Models

    Harsh Goel, Sai Shankar Narasimhan · PDF
  10. Language-Driven Active Learning for Diverse Open-Set 3D Object Detection

    Ross Greer, Bjørk Antoniussen, Andreas Møgelmose, Mohan Trivedi · PDF
  11. Multi-Frame, Lightweight & Efficient Vision-Language Models for Question Answering in Autonomous Driving

    Akshay Gopalkrishnan, Ross Greer, Mohan Trivedi · PDF
  12. On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities

    Xiyang Wu, Ruiqi Xian, Tianrui Guan, Jing Liang, Souradip Chakraborty, Fuxiao Liu, Brian M. Sadler, Dinesh Manocha, Amrit Bedi · PDF
  13. Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based Approach

    Yufei Ding, Haoran Geng, Chaoyi Xu, Xiaomeng Fang, Jiazhao Zhang, Songlin Wei, Qiyu Dai, Zhizheng Zhang, He Wang · PDF
  14. Optimizing Visual Question Answering Models for Driving: Bridging the Gap Between Human and Machine Attention Patterns

    Kaavya Rekanar, Martin Hayes, Ganesh Sistu, Ciaran Eising · PDF
  15. RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation

    Hanxiao Jiang, Binghao Huang, Ruihai Wu, Zhuoran Li, Shubham Garg, Hooshang Nayyeri, Shenlong Wang, Yunzhu Li · PDF