NeurIPS 2025 Past Other

1st Workshop on VLM4RWD @ NeurIPS 2025

VLM4RWD2025

Submission deadline
Nov 5, 2025, 11:59 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (17)

Fetched from OpenReview (v2) on 2026-06-10.

  1. A Comprehensive Survey of Multimodal LLMs for Scientific Discovery

    Liang Yan, Xu Jiang, Jian Ma, Yuhang Liu, Tian Bian, Qichao Wang, Abhishek Basu, Yu Rong, Tingyang Xu, Pengcheng Wu, Le Song, Imran Razzak, Junchi Yan, Zengfeng Huang, Yutong Xie · PDF
  2. Advancing Autonomous VLM Agents via Variational Subgoal-Conditioned Reinforcement Learning

    Qingyuan Wu, Jianheng Liu, Jianye HAO, Jun Wang, Kun Shao · PDF
  3. AMVICC: A Novel Benchmark for Cross-Modal Failure Mode Profiling for VLMs and IGMs

    Aahana Basappa, Pranay Goel, Anusri Karra, Anish Karra, Asa Gilmore, Kevin Zhu · PDF
  4. Closed-Task Validation: A More Robust and Efficient Proxy for Guiding VLM Training

    Enci Zhang, Z.Q. ZHANG, Jiahao Xie, Ruiqi Lu, Boyan Zhou, Cheng Yang · PDF
  5. Do Vision–Language Models Understand Visual Persuasiveness?

    Gyuwon Park · PDF
  6. Don’t Lag, RAG: Training-Free Adversarial Detection Using RAG

    Roie Kazoom, Raz Lapid, Moshe Sipper, Ofer Hadar · PDF
  7. Efficient Inference Scaling for Safety Assurance

    Ruizhong Qiu, Gaotang Li, Ting-Wei Li, Tianxin Wei, Jingrui He, Hanghang Tong · PDF
  8. Efficient Vision-Language Reasoning via Adaptive Token Pruning

    Xue li, Xiaonan Song · PDF
  9. Eureka: Intelligent Feature Engineering for Enterprise AI Cloud Resource Demand Prediction

    Hangxuan Li, Renjun Jia, Xuezhang Wu, zeqi zheng, Yunjie Qian, Xianling Zhang · PDF
  10. From Scenes to Semantics: PersianCLEVR for Bilingual 3D Visual Reasoning

    Kianoosh Vadaei, Melika Shirian, Arshia Hemmat, Mohammad Hassan Heydari, Ali Mamanpoosh, Afsaneh Fatemi · PDF
  11. From Vision to Action: Enabling Real-World Agentic VLMs

    Aravilli Atchuta Ram · PDF
  12. MedVCTP: Improving Accuracy and Explainability in Medical Visual Reasoning

    Aman Syed, Siwon Ryu, Nayan Saxena, Kevin Zhu · PDF
  13. MetaTPT: Meta Test-time Prompt Tuning for Vision-Language Models

    Yuqing Lei, Yingjun Du, Yawen Huang, Xiantong Zhen, Ling Shao · PDF
  14. Scene Understanding via Scene Representation Generation with Vision-Language Models

    Yuan Chen, Peng Shi · PDF
  15. Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning

    Zuyao You · PDF
  16. UpstreamQA: A Modular Framework for Explicit Reasoning on Video Question Answering Tasks

    Jason Nguyen, Ameet Rao, Alexander Chang, Ishaan Kumar, Erin Tan · PDF
  17. VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning

    Jingkun Ma, Runzhe Zhan, Yang Li, Di Sun, Hou Pong Chan, Lidia S. Chao, Derek F. Wong · PDF