- Academic Search

Z **, W Chen, X Guo, W He, Y Ding, B Hong… - Science China …, 2025 - Springer

For a long time, researchers have sought artificial intelligence (AI) that matches or exceeds
human intelligence. AI agents, which are artificial entities capable of sensing the …

Save Cite Cited by 742 Related articles All 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - ACM Transactions on …, 2024 - dl.acm.org

Large Language Models (LLMs) have significantly impacted numerous domains, including
Software Engineering (SE). Many recent publications have explored LLMs applied to …

Save Cite Cited by 509 Related articles All 8 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

A survey on large language model based autonomous agents

L Wang, C Ma, X Feng, Z Zhang, H Yang… - Frontiers of Computer …, 2024 - Springer

Autonomous agents have long been a research focus in academic and industry
communities. Previous research often focuses on training agents with limited knowledge …

Save Cite Cited by 967 Related articles All 4 versions Free GPT-4 DeepSeek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Trustllm: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu, Q Zhang, Y Li… - arxiv preprint arxiv …, 2024 - arxiv.org

Large language models (LLMs), exemplified by ChatGPT, have gained considerable
attention for their excellent natural language processing capabilities. Nonetheless, these …

Save Cite Cited by 251 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Autogen: Enabling next-gen llm applications via multi-agent conversation framework

Q Wu, G Bansal, J Zhang, Y Wu, S Zhang, E Zhu… - arxiv preprint arxiv …, 2023 - arxiv.org

This technical report presents AutoGen, a new framework that enables development of LLM
applications using multiple agents that can converse with each other to solve tasks. AutoGen …

Save Cite Cited by 742 Related articles All 4 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Ultrafeedback: Boosting language models with high-quality feedback

G Cui, L Yuan, N Ding, G Yao, W Zhu, Y Ni, G **e, Z Liu… - 2023 - openreview.net

Reinforcement learning from human feedback (RLHF) has become a pivot technique in
aligning large language models (LLMs) with human preferences. In RLHF practice …

Save Cite Cited by 244 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[HTML] mlr.press

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Y Huang, L Sun, H Wang, S Wu… - International …, 2024 - proceedings.mlr.press

Large language models (LLMs) have gained considerable attention for their excellent
natural language processing capabilities. Nonetheless, these LLMs present many …

Save Cite Cited by 46 Related articles Cached

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rolellm: Benchmarking, eliciting, and enhancing role-playing abilities of large language models

ZM Wang, Z Peng, H Que, J Liu, W Zhou, Y Wu… - arxiv preprint arxiv …, 2023 - arxiv.org

The advent of Large Language Models (LLMs) has paved the way for complex tasks such as
role-playing, which enhances user interactions by enabling models to imitate various …

Save Cite Cited by 154 Related articles All 2 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Chateval: Towards better llm-based evaluators through multi-agent debate

CM Chan, W Chen, Y Su, J Yu, W Xue, S Zhang… - arxiv preprint arxiv …, 2023 - arxiv.org

Text evaluation has historically posed significant challenges, often demanding substantial
labor and time cost. With the emergence of large language models (LLMs), researchers …

Save Cite Cited by 344 Related articles All 3 versions Free GPT-4 DeepSeek View as HTML

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

On generative agents in recommendation

A Zhang, Y Chen, L Sheng, X Wang… - Proceedings of the 47th …, 2024 - dl.acm.org

Recommender systems are the cornerstone of today's information dissemination, yet a
disconnect between offline metrics and online performance greatly hinders their …

Save Cite Cited by 94 Related articles All 2 versions Free GPT-4 DeepSeek

Cite

Advanced search

Saved to My library

The rise and potential of large language model based agents: A survey

Large language models for software engineering: A systematic literature review

A survey on large language model based autonomous agents

Trustllm: Trustworthiness in large language models

Autogen: Enabling next-gen llm applications via multi-agent conversation framework

Ultrafeedback: Boosting language models with high-quality feedback

[HTML][HTML] Position: TrustLLM: Trustworthiness in large language models

Rolellm: Benchmarking, eliciting, and enhancing role-playing abilities of large language models

Chateval: Towards better llm-based evaluators through multi-agent debate

On generative agents in recommendation