CVPR 2025 Past Large language modelsComputer vision
CVPR 2025 Workshop Vision Language Models For All
VLMs4All 2025
- Submission deadline
- May 1, 2025, 12:00 UTC imported from OpenReview — check the website for extensions
- Submission portal
- OpenReview
- Notes
- Auto-imported from the OpenReview venue record on 2026-06-10 — please verify and enrich (topics are keyword-guessed).
Accepted papers (18)
Fetched from OpenReview (v2) on 2026-06-10.
-
Behind Maya: Building a Multilingual Vision Language Model
-
Beyond Words: Exploring Cultural Value Sensitivity in Multimodal Models
-
Challenging Multimodal LLMs with African Standardized Exams: A Document VQA Evaluation
-
Chitrakshara: A Large Multilingual Multimodal Dataset for Indian languages
-
CONCAP: Seeing Beyond English with Retrieval-Augmented Captioning
-
Cultural Awareness in Vision-Language Models: A Cross-Country Exploration
-
Culturally-Aware Financial Fraud Detection Using Vision-Language Models
-
CultureShift: Mapping Temporal Cultural Evolution in Vision-Language Models
-
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
-
Enhancing Cultural Awareness in Vision-Language Models: The Power of Multimodal Few-Shot Prompting
-
Enhancing Vision-Language Models for Global Cultural Understanding through Semantic Expansion and Diversity Reranking
-
GeoDiv: Measuring Concept Diversity of Images Across Geographical Regions
-
JEEM: Vision-Language Understanding in Four Arabic Dialects
-
Nayana: A Foundation for Document-Centric Vision-Language Models via Multi-Task, Multimodal, and Multilingual Data Synthesis
-
RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives
-
Synthetic Document Question Answering in Hungarian
-
The use of multi-modal models and machine learning tech-niques to improve the efficiency and accuracy of geospatial data analysis
-
Why do LLaVA Vision-Language Models Reply to Images in English?