Towards open-ended visual quality comparison

H Wu, H Zhu, Z Zhang, E Zhang, C Chen, L Liao… - … on Computer Vision, 2024 - Springer
Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …

A comprehensive study of multimodal large language models for image quality assessment

T Wu, K Ma, J Liang, Y Yang, L Zhang - European Conference on …, 2024 - Springer
Abstract While Multimodal Large Language Models (MLLMs) have experienced significant
advancement in visual understanding and reasoning, their potential to serve as powerful …

Quality assessment in the era of large models: A survey

Z Zhang, Y Zhou, C Li, B Zhao, X Liu, G Zhai - arxiv preprint arxiv …, 2024 - arxiv.org
Quality assessment, which evaluates the visual quality level of multimedia experiences, has
garnered significant attention from researchers and has evolved substantially through …

Descriptive image quality assessment in the wild

Z You, J Gu, Z Li, X Cai, K Zhu, C Dong… - arxiv preprint arxiv …, 2024 - arxiv.org
With the rapid advancement of Vision Language Models (VLMs), VLM-based Image Quality
Assessment (IQA) seeks to describe image quality linguistically to align with human …

Towards dimension-enriched underwater image quality assessment

Q Jiang, X Yi, L Ouyang, J Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The absorption and scattering of light in the water medium naturally impair the quality of
underwater images, leading to multiple degradation effects including color casts, reduced …

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

A Ghildyal, Y Chen, S Zadtootaghaj, N Barman… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of AI has influenced many aspects of human life, from self-driving cars and
intelligent chatbots to text-based image and video generation models capable of creating …

T2i-scorer: Quantitative evaluation on text-to-image generation via fine-tuned large multi-modal models

H Wu, X Wu, C Li, Z Zhang, C Chen, X Liu… - Proceedings of the …, 2024 - dl.acm.org
Text-to-image (T2I) generation is a pivotal and core interest within the realm of AI content
generation. Amid the swift advancements of both open-source (such as Stable Diffusion) …

Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Z You, X Cai, J Gu, T Xue, C Dong - arxiv preprint arxiv:2501.11561, 2025 - arxiv.org
With the rapid advancement of Multi-modal Large Language Models (MLLMs), MLLM-based
Image Quality Assessment (IQA) methods have shown promising performance in linguistic …

Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics

S Ghazanfari, S Garg, N Flammarion… - arxiv preprint arxiv …, 2024 - arxiv.org
Human perception of similarity across uni-and multimodal inputs is highly complex, making it
challenging to develop automated metrics that accurately mimic it. General purpose vision …

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Z Zhang, Z Jia, H Wu, C Li, Z Chen, Y Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org
With the rising interest in research on Large Multi-modal Models (LMMs) for video
understanding, many studies have emphasized general video comprehension capabilities …