Quality assessment in the era of large models: A survey

Z Zhang, Y Zhou, C Li, B Zhao, X Liu, G Zhai - arxiv preprint arxiv …, 2024 - arxiv.org
Quality assessment, which evaluates the visual quality level of multimedia experiences, has
garnered significant attention from researchers and has evolved substantially through …

Misc: Ultra-low bitrate image semantic compression driven by large multimodal model

C Li, G Lu, D Feng, H Wu, Z Zhang, X Liu… - … on Image Processing, 2024 - ieeexplore.ieee.org
With the evolution of storage and communication protocols, ultra-low bitrate image
compression has become a highly demanding topic. However, all existing compression …

Subjective and objective quality-of-experience assessment for 3d talking heads

Y Zhou, Z Zhang, W Sun, X Liu, X Min… - Proceedings of the 32nd …, 2024 - dl.acm.org
In recent years, immersive communication has emerged as a compelling alternative to
traditional video communication methods. One prospective avenue for immersive …

Geometry-Aware Video Quality Assessment for Dynamic Digital Human

Z Zhang, Y Zhou, W Sun, X Min… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Dynamic Digital Humans (DDHs) are 3D digital models that are animated using predefined
motions and are inevitably bothered by noise/shift during the generation process and …

Q-Refine: A Perceptual Quality Refiner for AI-Generated Image

C Li, H Wu, Z Zhang, H Hao, K Zhang, L Bai… - arxiv preprint arxiv …, 2024 - arxiv.org
With the rapid evolution of the Text-to-Image (T2I) model in recent years, their unsatisfactory
generation result has become a challenge. However, uniformly refining AI-Generated …

Occupancy Map Guided Attributes Artifacts Removal for Video-Based Point Cloud Compression

P Chen, S Wang, Z Li - ACM Transactions on Multimedia Computing …, 2024 - dl.acm.org
Point clouds offer realistic 3D representations of objects and scenes at the expense of large
data volumes. To represent such data compactly in real-world applications, Video-Based …

MEMO-Bench: A Multiple Benchmark for Text-to-Image and Multimodal Large Language Models on Human Emotion Analysis

Y Zhou, Z Zhang, J Cao, J Jia, Y Jiang, F Wen… - arxiv preprint arxiv …, 2024 - arxiv.org
Artificial Intelligence (AI) has demonstrated significant capabilities in various fields, and in
areas such as human-computer interaction (HCI), embodied intelligence, and the design …

Learning Disentangled Representations for Perceptual Point Cloud Quality Assessment via Mutual Information Minimization

Z Shan, Y Zhang, Y Liu, Y Xu - arxiv preprint arxiv:2411.07936, 2024 - arxiv.org
No-Reference Point Cloud Quality Assessment (NR-PCQA) aims to objectively assess the
human perceptual quality of point clouds without relying on pristine-quality point clouds for …

Paps-ovqa: Projection-aware patch sampling for omnidirectional video quality assessment

C Li, Z Zhang, H Wu, K Zhang, L Bai… - … on Circuits and …, 2024 - ieeexplore.ieee.org
In immersive multimedia systems, the perceptual quality model of omnidirectional video is
indispensable. However, to cope with its resolution that is several times higher than ordinary …

Simple Baselines for Projection-based Full-reference and No-reference Point Cloud Quality Assessment

Z Zhang, Y Zhou, W Sun, X Min, G Zhai - arxiv preprint arxiv:2310.17147, 2023 - arxiv.org
Point clouds are widely used in 3D content representation and have various applications in
multimedia. However, compression and simplification processes inevitably result in the loss …