MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning

L Li, Y Yin, S Li, L Chen, P Wang, S Ren, M Li… - arxiv preprint arxiv …, 2023 - arxiv.org
Instruction tuning has significantly advanced large language models (LLMs) such as
ChatGPT, enabling them to align with human instructions across diverse tasks. However …

A survey of multimodal large language model from a data-centric perspective

T Bai, H Liang, B Wan, Y Xu, X Li, S Li, L Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Multimodal large language models (MLLMs) enhance the capabilities of standard large
language models by integrating and processing data from multiple modalities, including text …

Blindly assess quality of in-the-wild videos via quality-aware pre-training and motion perception

B Li, W Zhang, M Tian, G Zhai… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Perceptual quality assessment of the videos acquired in the wilds is of vital importance for
quality assurance of video services. The inaccessibility of reference videos with pristine …

DeepWSD: Projecting degradations in perceptual space to wasserstein distance in deep feature space

X Liao, B Chen, H Zhu, S Wang, M Zhou… - Proceedings of the 30th …, 2022 - dl.acm.org
Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image
quality in a deterministic way by explicitly comparing the features, gauging how severely …

Locally adaptive structure and texture similarity for image quality assessment

K Ding, Y Liu, X Zou, S Wang, K Ma - Proceedings of the 29th ACM …, 2021 - dl.acm.org
The latest advances in full-reference image quality assessment (IQA) involve unifying
structure and texture similarity based on deep representations. The resulting Deep Image …

What factors affect multi-modal in-context learning? an in-depth exploration

L Qin, Q Chen, H Fei, Z Chen, M Li, W Che - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, rapid advancements in Multi-Modal In-Context Learning (MM-ICL) have achieved
notable success, which is capable of achieving superior performance across various tasks …

No-reference bitstream-layer model for perceptual quality assessment of V-PCC encoded point clouds

Q Liu, H Su, T Chen, H Yuan… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
No-reference bitstream-layer models for point cloud quality assessment (PCQA) use the
information extracted from a bitstream for real-time and nonintrusive quality monitoring. We …

Task-specific normalization for continual learning of blind image quality models

W Zhang, K Ma, G Zhai, X Yang - IEEE Transactions on Image …, 2024 - ieeexplore.ieee.org
In this paper, we present a simple yet effective continual learning method for blind image
quality assessment (BIQA) with improved quality prediction accuracy, plasticity-stability trade …

A bayesian quality-of-experience model for adaptive streaming videos

Z Duanmu, W Liu, D Chen, Z Li, Z Wang… - ACM Transactions on …, 2023 - dl.acm.org
The fundamental conflict between the enormous space of adaptive streaming videos and the
limited capacity for subjective experiment casts significant challenges to objective Quality-of …

Adaptive structure and texture similarity metric for image quality assessment and optimization

K Ding, R Zhong, Z Wang, Y Yu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Objective Image Quality Assessment (IQA) aims to design computational models that can
automatically predict the perceived quality of images. The state-of-the-art full-reference IQA …