Data-efficient Fine-tuning for LLM-based Recommendation

X Lin, W Wang, Y Li, S Yang, F Feng, Y Wei… - Proceedings of the 47th …, 2024 - dl.acm.org
Leveraging Large Language Models (LLMs) for recommendation has recently garnered
considerable attention, where fine-tuning plays a key role in LLMs' adaptation. However, the …

Ideal: Influence-driven selective annotations empower in-context learners in large language models

S Zhang, X **a, Z Wang, LH Chen, J Liu, Q Wu… - arxiv preprint arxiv …, 2023 - arxiv.org
In-context learning is a promising paradigm that utilizes in-context examples as prompts for
the predictions of large language models. These prompts are crucial for achieving strong …

Spanning training progress: Temporal dual-depth scoring (tdds) for enhanced dataset pruning

X Zhang, J Du, Y Li, W **e… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Dataset pruning aims to construct a coreset capable of achieving performance comparable
to the original full dataset. Most existing dataset pruning methods rely on snapshot-based …

Refined Coreset Selection: Towards Minimal Coreset Size under Model Performance Constraints

X **a, J Liu, S Zhang, Q Wu, H Wei… - Forty-first International …, 2024 - openreview.net
Coreset selection is powerful in reducing computational costs and accelerating data
processing for deep learning algorithms. It strives to identify a small subset from large-scale …

Coreset selection with prioritized multiple objectives

X **a, J Liu, S Zhang, Q Wu, T Liu - arxiv preprint arxiv:2311.08675, 2023 - arxiv.org
Coreset selection is powerful in reducing computational costs and accelerating data
processing for deep learning algorithms. It strives to identify a small subset from large-scale …

Mind the Boundary: Coreset Selection via Reconstructing the Decision Boundary

S Yang, Z Cao, S Guo, R Zhang, P Luo… - … on Machine Learning, 2024 - openreview.net
Existing paradigms of pushing the state of the art require exponentially more training data in
many fields. Coreset selection seeks to mitigate this growing demand by identifying the most …

Effective pruning of web-scale datasets based on complexity of concept clusters

A Abbas, E Rusak, K Tirumala, W Brendel… - arxiv preprint arxiv …, 2024 - arxiv.org
Utilizing massive web-scale datasets has led to unprecedented performance gains in
machine learning models, but also imposes outlandish compute requirements for their …

Efficient architecture search via bi-level data pruning

C Tu, P Ye, W Lin, H Ye, C Yu, T Chen… - … on Circuits and …, 2024 - ieeexplore.ieee.org
Improving the efficiency of Neural Architecture Search (NAS) is a challenging but significant
task that has received much attention. Previous studies mainly adopt the Differentiable …

Are Sparse Neural Networks Better Hard Sample Learners?

Q **ao, B Wu, L Yin, CN Gadzinski, T Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
While deep learning has demonstrated impressive progress, it remains a daunting
challenge to learn from hard samples as these samples are usually noisy and intricate …

Dynamic data pruning for automatic speech recognition

Q **ao, P Ma, A Fernandez-Lopez, B Wu, L Yin… - arxiv preprint arxiv …, 2024 - arxiv.org
The recent success of Automatic Speech Recognition (ASR) is largely attributed to the ever-
growing amount of training data. However, this trend has made model training prohibitively …