A comprehensive overview of large language models

H Naveed, AU Khan, S Qiu, M Saqib, S Anwar… - arxiv preprint arxiv …, 2023 - arxiv.org
Large Language Models (LLMs) have recently demonstrated remarkable capabilities in
natural language processing tasks and beyond. This success of LLMs has led to a large …

A survey on large language model based autonomous agents

L Wang, C Ma, X Feng, Z Zhang, H Yang… - Frontiers of Computer …, 2024 - Springer
Autonomous agents have long been a research focus in academic and industry
communities. Previous research often focuses on training agents with limited knowledge …

Ultrafeedback: Boosting language models with high-quality feedback

G Cui, L Yuan, N Ding, G Yao, W Zhu, Y Ni, G **e, Z Liu… - 2023 - openreview.net
Reinforcement learning from human feedback (RLHF) has become a pivot technique in
aligning large language models (LLMs) with human preferences. In RLHF practice …

Expel: Llm agents are experiential learners

A Zhao, D Huang, Q Xu, M Lin, YJ Liu… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
The recent surge in research interest in applying large language models (LLMs) to decision-
making tasks has flourished by leveraging the extensive world knowledge embedded in …

Language agent tree search unifies reasoning acting and planning in language models

A Zhou, K Yan, M Shlapentokh-Rothman… - arxiv preprint arxiv …, 2023 - arxiv.org
While large language models (LLMs) have demonstrated impressive performance on a
range of decision-making tasks, they rely on simple acting processes and fall short of broad …

Glore: When, where, and how to improve llm reasoning via global and local refinements

A Havrilla, S Raparthy, C Nalmpantis… - arxiv preprint arxiv …, 2024 - arxiv.org
State-of-the-art language models can exhibit impressive reasoning refinement capabilities
on math, science or coding tasks. However, recent work demonstrates that even the best …

Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

Z Zhang, Y Yao, A Zhang, X Tang, X Ma, Z He… - arxiv preprint arxiv …, 2023 - arxiv.org
Large language models (LLMs) have dramatically enhanced the field of language
intelligence, as demonstrably evidenced by their formidable empirical performance across a …

Text2reward: Automated dense reward function generation for reinforcement learning

T **e, S Zhao, CH Wu, Y Liu, Q Luo, V Zhong… - arxiv preprint arxiv …, 2023 - arxiv.org
Designing reward functions is a longstanding challenge in reinforcement learning (RL); it
requires specialized knowledge or domain data, leading to high costs for development. To …

Fincon: A synthesized llm multi-agent system with conceptual verbal reinforcement for enhanced financial decision making

Y Yu, Z Yao, H Li, Z Deng, Y Cao, Z Chen… - arxiv preprint arxiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated notable potential in conducting complex
tasks and are increasingly utilized in various financial applications. However, high-quality …

Towards end-to-end embodied decision making via multi-modal large language model: Explorations with gpt4-vision and beyond

L Chen, Y Zhang, S Ren, H Zhao, Z Cai… - arxiv preprint arxiv …, 2023 - arxiv.org
In this study, we explore the potential of Multimodal Large Language Models (MLLMs) in
improving embodied decision-making processes for agents. While Large Language Models …