The rise and potential of large language model based agents: A survey

Z **, W Chen, X Guo, W He, Y Ding, B Hong… - Science China …, 2025 - Springer
For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …

Recent advances in deep learning based dialogue systems: A systematic survey

J Ni, T Young, V Pandelea, F Xue… - Artificial intelligence review, 2023 - Springer
Dialogue systems are a popular natural language processing (NLP) task as it is promising in
real-life applications. It is also a complicated task since many NLP tasks deserving study are …

Agentbench: Evaluating llms as agents

X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) are becoming increasingly smart and autonomous,
targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has …

Language models that seek for knowledge: Modular search & generation for dialogue and prompt completion

K Shuster, M Komeili, L Adolphs, S Roller… - arxiv preprint arxiv …, 2022 - arxiv.org
Language models (LMs) have recently been shown to generate more factual responses by
employing modularity (Zhou et al., 2021) in combination with retrieval (Adolphs et al., 2021) …

Evaluating human-language model interaction

M Lee, M Srivastava, A Hardy, J Thickstun… - arxiv preprint arxiv …, 2022 - arxiv.org
Many real-world applications of language models (LMs), such as writing assistance and
code autocomplete, involve human-LM interaction. However, most benchmarks are non …

Opt-iml: Scaling language model instruction meta learning through the lens of generalization

S Iyer, XV Lin, R Pasunuru, T Mihaylov, D Simig… - arxiv preprint arxiv …, 2022 - arxiv.org
Recent work has shown that fine-tuning large pre-trained language models on a collection
of tasks described via instructions, aka instruction-tuning, improves their zero and few-shot …

Mindagent: Emergent gaming interaction

R Gong, Q Huang, X Ma, H Vo, Z Durante… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have the capacity of performing complex scheduling in a
multi-agent system and can coordinate these agents into completing sophisticated tasks that …

RedditBias: A real-world resource for bias evaluation and debiasing of conversational language models

S Barikeri, A Lauscher, I Vulić, G Glavaš - arxiv preprint arxiv:2106.03521, 2021 - arxiv.org
Text representation models are prone to exhibit a range of societal biases, reflecting the non-
controlled and biased nature of the underlying pretraining data, which consequently leads to …

Towards debiasing sentence representations

PP Liang, IM Li, E Zheng, YC Lim… - arxiv preprint arxiv …, 2020 - arxiv.org
As natural language processing methods are increasingly deployed in real-world scenarios
such as healthcare, legal systems, and social science, it becomes necessary to recognize …

[ALINTI][C] Recipes for building an open-domain chatbot

S Roller - arxiv preprint arxiv:2004.13637, 2020