How much are llms contaminated? a comprehensive survey and the llmsanitize library

M Ravaut, B Ding, F Jiao, H Chen, X Li, R Zhao… - arxiv preprint arxiv …, 2024 - arxiv.org
With the rise of Large Language Models (LLMs) in recent years, new opportunities are
emerging, but also new challenges, and contamination is quickly becoming critical …

A survey on data quality dimensions and tools for machine learning

Y Zhou, F Tu, K Sha, J Ding, H Chen - arxiv preprint arxiv:2406.19614, 2024 - arxiv.org
Machine learning (ML) technologies have become substantial in practically all aspects of
our society, and data quality (DQ) is critical for the performance, fairness, robustness, safety …

Large language model-brained gui agents: A survey

C Zhang, S He, J Qian, B Li, L Li, S Qin, Y Kang… - arxiv preprint arxiv …, 2024 - arxiv.org
GUIs have long been central to human-computer interaction, providing an intuitive and
visually-driven way to access and interact with digital systems. The advent of LLMs …

See what llms cannot answer: A self-challenge framework for uncovering llm weaknesses

Y Chen, Y Liu, J Yan, X Bai, M Zhong, Y Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
The impressive performance of Large Language Models (LLMs) has consistently surpassed
numerous human-designed benchmarks, presenting new challenges in assessing the …

The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective

Z Qin, D Chen, W Zhang, L Yao, Y Huang… - arxiv preprint arxiv …, 2024 - arxiv.org
The rapid development of large language models (LLMs) has been witnessed in recent
years. Based on the powerful LLMs, multi-modal LLMs (MLLMs) extend the modality from …

On pre-training of multimodal language models customized for chart understanding

WC Fan, YC Chen, M Liu, L Yuan, L Sigal - arxiv preprint arxiv …, 2024 - arxiv.org
Recent studies customizing Multimodal Large Language Models (MLLMs) for domain-
specific tasks have yielded promising results, especially in the field of scientific chart …

Large language models make sample-efficient recommender systems

J Lin, X Dai, R Shan, B Chen, R Tang, Y Yu… - Frontiers of Computer …, 2025 - Springer
Conclusion This letter investigates the sample efficiency property of recommender systems
enhanced by large language models. We propose a simple yet effective framework (ie …

Llms for generating and evaluating counterfactuals: A comprehensive study

P Youssef, C Seifert, J Schlötterer - Findings of the Association …, 2024 - aclanthology.org
As NLP models become more complex, understanding their decisions becomes more
crucial. Counterfactuals (CFs), where minimal changes to inputs flip a model's prediction …

Political-llm: Large language models in political science

L Li, J Li, C Chen, F Gui, H Yang, C Yu, Z Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
In recent years, large language models (LLMs) have been widely adopted in political
science tasks such as election prediction, sentiment analysis, policy impact assessment, and …

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

F Xu, Q Hao, Z Zong, J Wang, Y Zhang, J Wang… - arxiv preprint arxiv …, 2025 - arxiv.org
Language has long been conceived as an essential tool for human reasoning. The
breakthrough of Large Language Models (LLMs) has sparked significant research interest in …