CVPR 2026 Past MultimodalNeuroscience

The 1st CogVL: Cognitive Foundations for Multimodal Models Workshop at CVPR 2026

CVPR 2026 Workshop CogVL

Submission deadline
Mar 7, 2026, 12:00 UTC
imported from OpenReview — check the website for extensions
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (17)

Fetched from OpenReview (v2) on 2026-06-10.

  1. Action Without Interaction: Probing the Physical Foundations of Video LMMs via Contact-Release Detection

    Daniel Harari, Michael Sidorov, Chen Shterental, Liel David, Abrham Kahsay Gebreselasie, Muhammad Haris Khan · PDF
  2. Benchmarking Attribute Discrimination in Infant-Scale Vision-Language Models

    Patrick Batsell, Satoshi Tsutsui, Bihan Wen · PDF
  3. Can Vision-Language Models Count? A Synthetic Benchmark and Analysis of Attention-Based Interventions

    Saurav Sengupta, Nazanin Moradinasab, Jiebei Liu, Donald E. Brown · PDF
  4. CounterBench: A Controllable Counterfactual Testbed Reveals Systematic Reasoning Failures in Vision-Language Models

    Aayam Bansal, Ishaan Gangwani · PDF
  5. CP-VLM: Causal Prompting for Human Intention Inference with Vision–Language Models

    KAZUKI OSAMURA, Hidetsugu Uchida, Narishige Abe · PDF
  6. Do Vision-Language Models Revise Beliefs or Just Rationalize? Evidence Update Prompting for Non-Monotonic Visual Reasoning

    Aayam Bansal, Ishaan Gangwani · PDF
  7. Jailbreaking Vision-Language Models Through the Visual Modality

    Aharon Azulay, Jan Dubiński, Zhuoyun Li, Atharv Mittal, Yossi Gandelsman · PDF
  8. Knowing When You Don’t Know: Metacognitive Uncertainty Calibration in Vision--Language Models

    Mahule Roy, Subhas Roy · PDF
  9. Latent-Stability Gated SAM: Detecting Hallucinated Segmentations under Domain Shift

    Muhammad Imran, Yugyung Lee · PDF
  10. Let Androids Dream of Electric Sheep: A Human-Inspired Image Metaphor Understanding and Reasoning Framework

    Chenhao Zhang, Yazhe Niu · PDF
  11. MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models

    Anh Thai, Stefan Stojanov, Zixuan Huang, Bikram Boote, James Matthew Rehg · PDF
  12. Multimodal Graph-of-Thoughts: Hypothesis-Verification Graphs for Multimodal Reasoning in Vision-Language Models

    Irina Belyaeva · PDF
  13. Relational Visual Similarity

    Thao Nguyen, Sicheng Mo, Krishna Kumar Singh, Yilin Wang, Jing Shi, Nicholas Kolkin, Eli Shechtman, Yong Jae Lee, Yuheng Li · PDF
  14. The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs

    Tejas Anvekar, Fenil Bardoliya, Pavan K. Turaga, Chitta Baral, Vivek Gupta · PDF
  15. Think Slow, See Better? Dual-Process Prompting for Vision-Language Model Calibration

    Aayam Bansal, Ishaan Gangwani · PDF
  16. VCRBench: Exploring Long-form Causal Reasoning Capabilities of Large Video Language Models

    Pritam Sarkar, Ali Etemad · PDF
  17. Vision–Language Pretraining with Structured Distractor Augmentation

    Prasanth · PDF