Concept-skill transferability-based data selection for large vision-language models

J Lee, B Li, SJ Hwang - arxiv preprint arxiv:2406.10995, 2024 - arxiv.org
Instruction tuning, or supervised finetuning on extensive task-specific data, is necessary for
Large Vision-Language Models (LVLMs) to generalize well across a broad range of vision …

INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

Y Ma, Z Wang, X Sun, W Lin, Q Zhou, J Ji… - arxiv preprint arxiv …, 2024 - arxiv.org
With advancements in data availability and computing resources, Multimodal Large
Language Models (MLLMs) have showcased capabilities across various fields. However …

ICONS: Influence Consensus for Vision-Language Data Selection

X Wu, M **a, R Shao, Z Deng, PW Koh… - arxiv preprint arxiv …, 2024 - arxiv.org
Visual Instruction Tuning typically requires a large amount of vision-language training data.
This data often containing redundant information that increases computational costs without …

Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities

Q Dai, D Zhang, JW Ma, H Peng - arxiv preprint arxiv:2501.12147, 2025 - arxiv.org
Selecting appropriate training data is crucial for effective instruction fine-tuning of large
language models (LLMs), which aims to (1) elicit strong capabilities, and (2) achieve …

TAROT: Targeted Data Selection via Optimal Transport

L Feng, F Nie, Y Liu, A Alahi - arxiv preprint arxiv:2412.00420, 2024 - arxiv.org
We propose TAROT, a targeted data selection framework grounded in optimal transport
theory. Previous targeted data selection methods primarily rely on influence-based greedy …

Mastering Collaborative Multi-modal Data Selection: A Focus on Informativeness, Uniqueness, and Representativeness

Q Yu, Z Shen, Z Yue, Y Wu, W Zhang, Y Li, J Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Instruction tuning fine-tunes pre-trained Multi-modal Large Language Models (MLLMs) to
handle real-world tasks. However, the rapid expansion of visual instruction datasets …

Beyond Similarity: A Gradient-based Graph Method for Instruction Tuning Data Selection

Y Zhao, L Du, X Ding, Y Ouyang, H Wang… - arxiv preprint arxiv …, 2025 - arxiv.org
Large language models (LLMs) have shown great potential across various industries due to
their remarkable ability to generalize through instruction tuning. However, the limited …