CVPR 2026 Past Computer visionMultimodalEducation

Computer Vision × Education: Building a Cross-Community Agenda for Multimodal Vision in Classrooms

CV4Edu

Submission deadline
TBA — know the deadline? Add it in one line
The file opens with a ready-to-fill template — takes about a minute.
Submission portal
OpenReview
Notes
Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).

Accepted papers (18)

Fetched from OpenReview (v2) on 2026-06-10.

  1. [UNI]101: An Educational Dataset for Introductory Computer Vision

    Ethan Seefried, Changsoo Jung, Videep Venkatesha, Trevor Chartier, Caleb Christian, Jack Fitzgerald, Mariah Bradford, Sifatul Anindho, Matthew Sturgeon, Nathaniel Blanchard · PDF
  2. AI-Assisted Competency Assessment from Egocentric Video in Simulation-Based Nursing Education

    Hanchen David Wang, Yilin Liu, Madison Mason, Surya Rayala, Gautam Biswas, Daniel Levin, Meiyi Ma · PDF
  3. ConfusionBench: An Expert-Validated Benchmark for Confusion Recognition and Localization in Educational Videos

    Lu Dong, Xiao Wang, Mark Frank, Srirangaraj Setlur, Venu Govindaraju, Ifeoma Nwogu · PDF
  4. Context Matters: Peer-Aware Student Behavioral Engagement Measurement via VLM Action Parsing and LLM Sequence Classification

    Ahmed Abdelkawy, Ahmed Elsayed, Asem Ali, Aly Farag, Thomas Tretter, Michael McIntyre · PDF
  5. Cross-modal Affinity-aligned Multimodal Learning Analytics for Predicting Student Collaboration Satisfaction in Game-Based Learning

    Wen-Hsin Tsai, Chia-Ming Lee, Yuk-Ying Tung · PDF
  6. Delta-Gated Incremental Multi-Forward-Pass Modeling for Robust Multimodal Classroom Video Understanding

    Chongyu He, Peter Youngs, Scott Acton · PDF
  7. Diagnosis of Human–Object Interaction Detectors for Real-World Educational Applications

    Divya Mereddy, Ashwin T S, Marcos Quinones Grueiro, Gautam Biswas · PDF
  8. Do Emotion Recognition Models Generalize to Classrooms? Robustness and Fairness Analysis

    Ashwin T S, Srigowri Mayasandra Prasanna, Joyce Horn Fonteles, Gautam Biswas · PDF
  9. Evaluating Web-trained Facial Expression Recognition in Naturalistic Collaborative Learning

    Sifatul Anindho, Videep Venkatesha, Nathaniel Blanchard · PDF
  10. From Emotion Recognition to Mind-Wandering Detection: A Comparative Analysis of Video-Based Emotion Foundation Models

    Ekta Sood, Sebastian Ricke, Trisha Mittal, Sidney K. DMello · PDF
  11. InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading

    Xiao Wang, Lu Dong, Ifeoma Nwogu, Srirangaraj Setlur, Venu Govindaraju · PDF
  12. MES-Bench: A Benchmark for Multimodal Elaborative Simplification and Comprehensibility Evaluation in Language Learning

    Martyna Gruszka, Risa Shinoda, Taiki Miyanishi, Takumi Hirose, Nakamasa Inoue · PDF
  13. Negative Evidence in the Classroom: Learning From What Vision Cannot Reliably See

    Mahule Roy, Subhas Roy · PDF
  14. ReSoFed: Reliability-Guided Model Souping for Robust Federated Learning in Heterogeneous Classroom Environments

    Muhammad Rafsan Kabir, Md Shopon, Marina Gavrilova · PDF
  15. Scaffolding Human Learning by Shaping Visual Environment

    Yuji Zhang, Duo Zhou, Bo Chen, Adi Chalasani, Noah Schroeder, H Chad Lane, ChengXiang Zhai · PDF
  16. Sequence-Based Identification of First-Person Camera Wearers in Third-Person Views

    Ziwei Zhao, Xizi Wang, Yuchen Wang, Feng Cheng, David J. Crandall · PDF
  17. Speech-Synchronized Whiteboard Generation via VLM-Driven Structured Drawing Representations

    Suraj Prasad, Pinak Mahapatra · PDF
  18. VLMath: A Multimodal Vision-Language System for Pedagogically Aligned Math Tutoring

    Mahsa Ardakani, Arshia Eslami, Ramtin Zand · PDF