MIT: A Large-Scale Dataset towards Multi-Modal Multilingual Instruction Tuning
Instruction tuning has significantly advanced large language models (LLMs) such as
ChatGPT, enabling them to align with human instructions across diverse tasks. However …
ChatGPT, enabling them to align with human instructions across diverse tasks. However …
A survey of multimodal large language model from a data-centric perspective
Multimodal large language models (MLLMs) enhance the capabilities of standard large
language models by integrating and processing data from multiple modalities, including text …
language models by integrating and processing data from multiple modalities, including text …
Blindly assess quality of in-the-wild videos via quality-aware pre-training and motion perception
Perceptual quality assessment of the videos acquired in the wilds is of vital importance for
quality assurance of video services. The inaccessibility of reference videos with pristine …
quality assurance of video services. The inaccessibility of reference videos with pristine …
DeepWSD: Projecting degradations in perceptual space to wasserstein distance in deep feature space
Existing deep learning-based full-reference IQA (FR-IQA) models usually predict the image
quality in a deterministic way by explicitly comparing the features, gauging how severely …
quality in a deterministic way by explicitly comparing the features, gauging how severely …
Locally adaptive structure and texture similarity for image quality assessment
The latest advances in full-reference image quality assessment (IQA) involve unifying
structure and texture similarity based on deep representations. The resulting Deep Image …
structure and texture similarity based on deep representations. The resulting Deep Image …
What factors affect multi-modal in-context learning? an in-depth exploration
Recently, rapid advancements in Multi-Modal In-Context Learning (MM-ICL) have achieved
notable success, which is capable of achieving superior performance across various tasks …
notable success, which is capable of achieving superior performance across various tasks …
No-reference bitstream-layer model for perceptual quality assessment of V-PCC encoded point clouds
No-reference bitstream-layer models for point cloud quality assessment (PCQA) use the
information extracted from a bitstream for real-time and nonintrusive quality monitoring. We …
information extracted from a bitstream for real-time and nonintrusive quality monitoring. We …
Task-specific normalization for continual learning of blind image quality models
In this paper, we present a simple yet effective continual learning method for blind image
quality assessment (BIQA) with improved quality prediction accuracy, plasticity-stability trade …
quality assessment (BIQA) with improved quality prediction accuracy, plasticity-stability trade …
A bayesian quality-of-experience model for adaptive streaming videos
The fundamental conflict between the enormous space of adaptive streaming videos and the
limited capacity for subjective experiment casts significant challenges to objective Quality-of …
limited capacity for subjective experiment casts significant challenges to objective Quality-of …
Adaptive structure and texture similarity metric for image quality assessment and optimization
Objective Image Quality Assessment (IQA) aims to design computational models that can
automatically predict the perceived quality of images. The state-of-the-art full-reference IQA …
automatically predict the perceived quality of images. The state-of-the-art full-reference IQA …