The rise and potential of large language model based agents: A survey

Z **, W Chen, X Guo, W He, Y Ding, B Hong… - Science China …, 2025 - Springer
For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …

A survey on large language model based autonomous agents

L Wang, C Ma, X Feng, Z Zhang, H Yang… - Frontiers of Computer …, 2024 - Springer
Autonomous agents have long been a research focus in academic and industry
communities. Previous research often focuses on training agents with limited knowledge …

Agentcf: Collaborative learning with autonomous language agents for recommender systems

J Zhang, Y Hou, R **e, W Sun, J McAuley… - Proceedings of the …, 2024 - dl.acm.org
Recently, there has been an emergence of employing LLM-powered agents as believable
human proxies, based on their remarkable decision-making capability. However, existing …

Longbench: A bilingual, multitask benchmark for long context understanding

Y Bai, X Lv, J Zhang, H Lyu, J Tang, Z Huang… - arxiv preprint arxiv …, 2023 - arxiv.org
Although large language models (LLMs) demonstrate impressive performance for many
language tasks, most of them can only handle texts a few thousand tokens long, limiting their …

Travelplanner: A benchmark for real-world planning with language agents

J **e, K Zhang, J Chen, T Zhu, R Lou, Y Tian… - arxiv preprint arxiv …, 2024 - arxiv.org
Planning has been part of the core pursuit for artificial intelligence since its conception, but
earlier AI agents mostly focused on constrained settings because many of the cognitive …

Infiagent-dabench: Evaluating agents on data analysis tasks

X Hu, Z Zhao, S Wei, Z Chai, Q Ma, G Wang… - arxiv preprint arxiv …, 2024 - arxiv.org
In this paper, we introduce InfiAgent-DABench, the first benchmark specifically designed to
evaluate LLM-based agents on data analysis tasks. These tasks require agents to end-to …

Large language models are semi-parametric reinforcement learning agents

D Zhang, L Chen, S Zhang, H Xu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Inspired by the insights in cognitive science with respect to human memory and reasoning
mechanism, a novel evolvable LLM-based (Large Language Model) agent framework is …

The What, Why, and How of Context Length Extension Techniques in Large Language Models--A Detailed Survey

S Pawar, SM Tonmoy, SM Zaman, V Jain… - arxiv preprint arxiv …, 2024 - arxiv.org
The advent of Large Language Models (LLMs) represents a notable breakthrough in Natural
Language Processing (NLP), contributing to substantial progress in both text …

Lv-eval: A balanced long-context benchmark with 5 length levels up to 256k

T Yuan, X Ning, D Zhou, Z Yang, S Li, M Zhuang… - arxiv preprint arxiv …, 2024 - arxiv.org
State-of-the-art large language models (LLMs) are now claiming remarkable supported
context lengths of 256k or even more. In contrast, the average context lengths of mainstream …

AgentLens: Visual Analysis for Agent Behaviors in LLM-based Autonomous Systems

J Lu, B Pan, J Chen, Y Feng, J Hu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recently, Large Language Model based Autonomous System (LLMAS) has gained great
popularity for its potential to simulate complicated behaviors of human societies. One of its …