- Academic Search

H Wu, H Zhu, Z Zhang, E Zhang, C Chen, L Liao… - … on Computer Vision, 2024 - Springer

Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …

Enregistrer Citer Cité 36 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A comprehensive study of multimodal large language models for image quality assessment

T Wu, K Ma, J Liang, Y Yang, L Zhang - European Conference on …, 2024 - Springer

Abstract While Multimodal Large Language Models (MLLMs) have experienced significant
advancement in visual understanding and reasoning, their potential to serve as powerful …

Enregistrer Citer Cité 23 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Quality assessment in the era of large models: A survey

Z Zhang, Y Zhou, C Li, B Zhao, X Liu, G Zhai - arxiv preprint arxiv …, 2024 - arxiv.org

Quality assessment, which evaluates the visual quality level of multimedia experiences, has
garnered significant attention from researchers and has evolved substantially through …

Enregistrer Citer Cité 8 fois Autres articles Version HTML

[Free GPT-4]

[PDF] arxiv.org

Descriptive image quality assessment in the wild

Z You, J Gu, Z Li, X Cai, K Zhu, C Dong… - arxiv preprint arxiv …, 2024 - arxiv.org

With the rapid advancement of Vision Language Models (VLMs), VLM-based Image Quality
Assessment (IQA) seeks to describe image quality linguistically to align with human …

Enregistrer Citer Cité 10 fois Autres articles Les 2 versions Free GPT-4 Version HTML

Towards dimension-enriched underwater image quality assessment

Q Jiang, X Yi, L Ouyang, J Zhou… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

The absorption and scattering of light in the water medium naturally impair the quality of
underwater images, leading to multiple degradation effects including color casts, reduced …

Enregistrer Citer Cité 5 fois Autres articles

[Free GPT-4]

[PDF] arxiv.org

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

A Ghildyal, Y Chen, S Zadtootaghaj, N Barman… - arxiv preprint arxiv …, 2024 - arxiv.org

The advent of AI has influenced many aspects of human life, from self-driving cars and
intelligent chatbots to text-based image and video generation models capable of creating …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] acm.org

T2i-scorer: Quantitative evaluation on text-to-image generation via fine-tuned large multi-modal models

H Wu, X Wu, C Li, Z Zhang, C Chen, X Liu… - Proceedings of the …, 2024 - dl.acm.org

Text-to-image (T2I) generation is a pivotal and core interest within the realm of AI content
generation. Amid the swift advancements of both open-source (such as Stable Diffusion) …

Enregistrer Citer Cité 1 fois Autres articles Les 2 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Z You, X Cai, J Gu, T Xue, C Dong - arxiv preprint arxiv:2501.11561, 2025 - arxiv.org

With the rapid advancement of Multi-modal Large Language Models (MLLMs), MLLM-based
Image Quality Assessment (IQA) methods have shown promising performance in linguistic …

Enregistrer Citer Autres articles Version HTML

[Free GPT-4]

[PDF] arxiv.org

Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics

S Ghazanfari, S Garg, N Flammarion… - arxiv preprint arxiv …, 2024 - arxiv.org

Human perception of similarity across uni-and multimodal inputs is highly complex, making it
challenging to develop automated metrics that accurately mimic it. General purpose vision …

Enregistrer Citer Autres articles Les 2 versions Free GPT-4 Version HTML

[Free GPT-4]

[PDF] arxiv.org

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs

Z Zhang, Z Jia, H Wu, C Li, Z Chen, Y Zhou… - arxiv preprint arxiv …, 2024 - arxiv.org

With the rising interest in research on Large Multi-modal Models (LMMs) for video
understanding, many studies have emphasized general video comprehension capabilities …

Enregistrer Citer Autres articles Les 3 versions Free GPT-4 Version HTML

Créer l'alerte

Citer

Recherche avancée

Enregistré dans Ma bibliothèque

2AFC prompting of large multimodal models for image quality assessment

Towards open-ended visual quality comparison

A comprehensive study of multimodal large language models for image quality assessment

Quality assessment in the era of large models: A survey

Descriptive image quality assessment in the wild

Towards dimension-enriched underwater image quality assessment

Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities

T2i-scorer: Quantitative evaluation on text-to-image generation via fine-tuned large multi-modal models

Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Towards Unified Benchmark and Models for Multi-Modal Perceptual Metrics

Q-Bench-Video: Benchmarking the Video Quality Understanding of LMMs