Benchmark evaluations, applications, and challenges of large vision language models: A survey
Multimodal Vision Language Models (VLMs) have emerged as a transformative technology
at the intersection of computer vision and natural language processing, enabling machines …
at the intersection of computer vision and natural language processing, enabling machines …
All languages matter: Evaluating lmms on culturally diverse 100 languages
Existing Large Multimodal Models (LMMs) generally focus on only a few regions and
languages. As LMMs continue to improve, it is increasingly important to ensure they …
languages. As LMMs continue to improve, it is increasingly important to ensure they …
Survey of cultural awareness in language models: Text and beyond
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …
From local concepts to universals: Evaluating the multicultural understanding of vision-language models
Despite recent advancements in vision-language models, their performance remains
suboptimal on images from non-western cultures due to underrepresentation in training …
suboptimal on images from non-western cultures due to underrepresentation in training …
Worldcuisines: A massive-scale benchmark for multilingual and multicultural visual question answering on global cuisines
Vision Language Models (VLMs) often struggle with culture-specific knowledge, particularly
in languages other than English and in underrepresented cultural contexts. To evaluate their …
in languages other than English and in underrepresented cultural contexts. To evaluate their …
GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks
While numerous recent benchmarks focus on evaluating generic Vision-Language Models
(VLMs), they fall short in addressing the unique demands of geospatial applications. Generic …
(VLMs), they fall short in addressing the unique demands of geospatial applications. Generic …
Cultural Adaptation of Menus: A Fine-Grained Approach
Machine Translation of Culture-Specific Items (CSIs) poses significant challenges. Recent
work on CSI translation has shown some success using Large Language Models (LLMs) to …
work on CSI translation has shown some success using Large Language Models (LLMs) to …
Multi3Hate: Multimodal, Multilingual, and Multicultural Hate Speech Detection with Vision-Language Models
Warning: this paper contains content that may be offensive or upsetting Hate speech
moderation on global platforms poses unique challenges due to the multimodal and …
moderation on global platforms poses unique challenges due to the multimodal and …
LLM-GLOBE: A Benchmark Evaluating the Cultural Values Embedded in LLM Output
Immense effort has been dedicated to minimizing the presence of harmful or biased
generative content and better aligning AI output to human intention; however, research …
generative content and better aligning AI output to human intention; however, research …
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
Vision-language models (VLMs) have advanced human-AI interaction but struggle with
cultural understanding, often misinterpreting symbols, gestures, and artifacts due to biases …
cultural understanding, often misinterpreting symbols, gestures, and artifacts due to biases …