Controllable text generation for large language models: A survey

X Liang, H Wang, Y Wang, S Song, J Yang… - arxiv preprint arxiv …, 2024 - arxiv.org
In Natural Language Processing (NLP), Large Language Models (LLMs) have demonstrated
high text generation quality. However, in real-world applications, LLMs must meet …

Data augmentation using llms: Data perspectives, learning paradigms and challenges

B Ding, C Qin, R Zhao, T Luo, X Li… - Findings of the …, 2024 - aclanthology.org
In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has
emerged as a pivotal technique for enhancing model performance by diversifying training …

Finding visual task vectors

A Hojel, Y Bai, T Darrell, A Globerson, A Bar - European Conference on …, 2024 - Springer
Visual Prompting is a technique for teaching models to perform a visual task via in-context
examples, without any additional training. In this work, we analyze the activations of MAE …

Inference scaling for long-context retrieval augmented generation

Z Yue, H Zhuang, A Bai, K Hui, R Jagerman… - arxiv preprint arxiv …, 2024 - arxiv.org
The scaling of inference computation has unlocked the potential of long-context large
language models (LLMs) across diverse settings. For knowledge-intensive tasks, the …

Configurable foundation models: Building llms from a modular perspective

C **ao, Z Zhang, C Song, D Jiang, F Yao, X Han… - arxiv preprint arxiv …, 2024 - arxiv.org
Advancements in LLMs have recently unveiled challenges tied to computational efficiency
and continual scalability due to their requirements of huge parameters, making the …

Large language models for social networks: Applications, challenges, and solutions

J Zeng, R Huang, W Malik, L Yin, B Babic… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) are transforming the way people generate, explore, and
engage with content. We study how we can develop LLM applications for online social …

In-context learning for zero-shot medical report generation

R Liu, M Li, S Zhao, L Chen, X Chang… - Proceedings of the 32nd …, 2024 - dl.acm.org
Medical report generation (MRG) has emerged as a pivotal research topic in the medical
multi-modal field, given its potential to alleviate the heavy workloads of radiologists …

Nothing in excess: Mitigating the exaggerated safety for llms via safety-conscious activation steering

Z Cao, Y Yang, H Zhao - arxiv preprint arxiv:2408.11491, 2024 - arxiv.org
Safety alignment is indispensable for Large language models (LLMs) to defend threats from
malicious instructions. However, recent researches reveal safety-aligned LLMs prone to …

Towards tracing trustworthiness dynamics: Revisiting pre-training period of large language models

C Qian, J Zhang, W Yao, D Liu, Z Yin, Y Qiao… - arxiv preprint arxiv …, 2024 - arxiv.org
Ensuring the trustworthiness of large language models (LLMs) is crucial. Most studies
concentrate on fully pre-trained LLMs to better understand and improve LLMs' …

Automated radiotherapy treatment planning guided by GPT-4Vision

S Liu, O Pastor-Serrano, Y Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Radiotherapy treatment planning is a time-consuming and potentially subjective process
that requires the iterative adjustment of model parameters to balance multiple conflicting …