Attention heads of large language models: A survey

Z Zheng, Y Wang, Y Huang, S Song, M Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
Since the advent of ChatGPT, Large Language Models (LLMs) have excelled in various
tasks but remain as black-box systems. Consequently, the reasoning bottlenecks of LLMs …

Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models

Q Ren, J Zeng, Q He, J Liang, Y **ao, W Zhou… - arxiv preprint arxiv …, 2025 - arxiv.org
It is crucial for large language models (LLMs) to follow instructions that involve multiple
constraints. However, soft constraints are semantically related and difficult to verify through …

Position: LLMs Can be Good Tutors in Foreign Language Education

J Ye, S Wang, D Zou, Y Yan, K Wang, HT Zheng… - arxiv preprint arxiv …, 2025 - arxiv.org
While recent efforts have begun integrating large language models (LLMs) into foreign
language education (FLE), they often rely on traditional approaches to learning tasks without …

Vulnerability Mitigation for Safety-Aligned Language Models via Debiasing

TQ Tran, A Wachi, R Sato, T Tanabe… - arxiv preprint arxiv …, 2025 - arxiv.org
Safety alignment is an essential research topic for real-world AI applications. Despite the
multifaceted nature of safety and trustworthiness in AI, current safety alignment methods …

Zero-Shot Strategies for Length-Controllable Summarization

F Retkowski, A Waibel - arxiv preprint arxiv:2501.00233, 2024 - arxiv.org
Large language models (LLMs) struggle with precise length control, particularly in zero-shot
settings. We conduct a comprehensive study evaluating LLMs' length control capabilities …

Large Language Model Evaluation Criteria Framework in Healthcare: Fuzzy MCDM Approach

HM Alabool - SN Computer Science, 2025 - Springer
Abstract Large Language Models (LLMs) gained notable popularity in academia and
industry. It has unprecedented features and performance in many applications. LLMs are …

A Comprehensive Evaluation of Cognitive Biases in LLMs

S Malberg, R Poletukhin, CM Schuster… - arxiv preprint arxiv …, 2024 - arxiv.org
We present a large-scale evaluation of 30 cognitive biases in 20 state-of-the-art large
language models (LLMs) under various decision-making scenarios. Our contributions …

Towards AI- Law: A Roadmap to Trustworthy AGI

Y Chao, L Chaochao, W Yingchun, Z Bowen - arxiv preprint arxiv …, 2024 - arxiv.org
Ensuring Artificial General Intelligence (AGI) reliably avoids harmful behaviors is a critical
challenge, especially for systems with high autonomy or in safety-critical domains. Despite …

Instruction Tuning for Story Understanding and Generation with Weak Supervision

Y Yuan, H Chen, C Ng - arxiv preprint arxiv:2501.15574, 2025 - arxiv.org
Story understanding and generation have long been a challenging task in natural language
processing (NLP), especially when dealing with various levels of instruction specificity. In …

[PDF][PDF] How to Write Effective Prompts for Screening Biomedical Literature Using Large Language Models

MT Colangelo, S Guizzardi, M Meleti, E Calciolari… - 2025 - preprints.org
Large language models (LLMs) have emerged as powerful tools for (semi-) automating the
initial screening of abstracts in systematic reviews, offering the potential to significantly …