Towards open-ended visual quality comparison
Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …
A comprehensive study of multimodal large language models for image quality assessment
Abstract While Multimodal Large Language Models (MLLMs) have experienced significant
advancement in visual understanding and reasoning, their potential to serve as powerful …
advancement in visual understanding and reasoning, their potential to serve as powerful …
Quality assessment in the era of large models: A survey
Quality assessment, which evaluates the visual quality level of multimedia experiences, has
garnered significant attention from researchers and has evolved substantially through …
garnered significant attention from researchers and has evolved substantially through …
Descriptive image quality assessment in the wild
With the rapid advancement of Vision Language Models (VLMs), VLM-based Image Quality
Assessment (IQA) seeks to describe image quality linguistically to align with human …
Assessment (IQA) seeks to describe image quality linguistically to align with human …
Towards dimension-enriched underwater image quality assessment
The absorption and scattering of light in the water medium naturally impair the quality of
underwater images, leading to multiple degradation effects including color casts, reduced …
underwater images, leading to multiple degradation effects including color casts, reduced …
Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities
The advent of AI has influenced many aspects of human life, from self-driving cars and
intelligent chatbots to text-based image and video generation models capable of creating …
intelligent chatbots to text-based image and video generation models capable of creating …
T2i-scorer: Quantitative evaluation on text-to-image generation via fine-tuned large multi-modal models
Text-to-image (T2I) generation is a pivotal and core interest within the realm of AI content
generation. Amid the swift advancements of both open-source (such as Stable Diffusion) …
generation. Amid the swift advancements of both open-source (such as Stable Diffusion) …
Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
With the rapid advancement of Multi-modal Large Language Models (MLLMs), MLLM-based
Image Quality Assessment (IQA) methods have shown promising performance in linguistic …
Image Quality Assessment (IQA) methods have shown promising performance in linguistic …
Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics
Human perception of similarity across uni-and multimodal inputs is highly complex, making it
challenging to develop automated metrics that accurately mimic it. General purpose vision …
challenging to develop automated metrics that accurately mimic it. General purpose vision …
Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs
With the rising interest in research on Large Multi-modal Models (LMMs) for video
understanding, many studies have emphasized general video comprehension capabilities …
understanding, many studies have emphasized general video comprehension capabilities …