Perceptual video quality assessment: A survey

X Min, H Duan, W Sun, Y Zhu, G Zhai - Science China Information …, 2024 - Springer
Perceptual video quality assessment plays a vital role in the field of video processing due to
the existence of quality degradations introduced in various stages of video signal …

Cambrian-1: A fully open, vision-centric exploration of multimodal llms

S Tong, E Brown, P Wu, S Woo, M Middepogu… - arxiv preprint arxiv …, 2024 - arxiv.org
We introduce Cambrian-1, a family of multimodal LLMs (MLLMs) designed with a vision-
centric approach. While stronger language models can enhance multimodal capabilities, the …

Towards open-ended visual quality comparison

H Wu, H Zhu, Z Zhang, E Zhang, C Chen, L Liao… - … on Computer Vision, 2024 - Springer
Comparative settings (eg. pairwise choice, listwise ranking) have been adopted by a wide
range of subjective studies for image quality assessment (IQA), as it inherently standardizes …

A comprehensive study of multimodal large language models for image quality assessment

T Wu, K Ma, J Liang, Y Yang, L Zhang - European Conference on …, 2024 - Springer
Abstract While Multimodal Large Language Models (MLLMs) have experienced significant
advancement in visual understanding and reasoning, their potential to serve as powerful …

[HTML][HTML] LLMs and generative agent-based models for complex systems research

Y Lu, A Aleta, C Du, L Shi, Y Moreno - Physics of Life Reviews, 2024 - Elsevier
Abstract The advent of Large Language Models (LLMs) has significantly transformed
research across natural and social sciences, offering new paradigms for understanding …

Depicting beyond scores: Advancing image quality assessment through multi-modal language models

Z You, Z Li, J Gu, Z Yin, T Xue, C Dong - European Conference on …, 2024 - Springer
We introduce a Depict ed image Q uality A ssessment method (DepictQA), overcoming the
constraints of traditional score-based methods. DepictQA allows for detailed, language …

Q-align: Teaching lmms for visual scoring via discrete text-defined levels

H Wu, Z Zhang, W Zhang, C Chen, L Liao, C Li… - arxiv preprint arxiv …, 2023 - arxiv.org
The explosion of visual content available online underscores the requirement for an
accurate machine assessor to robustly evaluate scores across diverse types of visual …

Aesexpert: Towards multi-modality foundation model for image aesthetics perception

Y Huang, X Sheng, Z Yang, Q Yuan, Z Duan… - Proceedings of the …, 2024 - dl.acm.org
The highly abstract nature of image aesthetics perception (IAP) poses a significant
challenge for current multimodal large language models (MLLMs). The lack of human …

Aigc-vqa: A holistic perception metric for aigc video quality assessment

Y Lu, X Li, B Li, Z Yu, F Guan, X Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
With the development of generative models such as the diffusion model and auto-regressive
model AI-generated content (AIGC) is experiencing an explosive growth. Moreover existing …

Promptiqa: Boosting the performance and generalization for no-reference image quality assessment via prompts

Z Chen, H Qin, J Wang, C Yuan, B Li, W Hu… - European Conference on …, 2024 - Springer
Due to the diversity of assessment requirements in various application scenarios for the IQA
task, existing IQA methods struggle to directly adapt to these varied requirements after …