[PDF][PDF] Communicative agents for software development

C Qian, X Cong, C Yang, W Chen, Y Su… - arxiv preprint arxiv …, 2023 - openreview.net
Software engineering is a domain characterized by intricate decision-making processes,
often relying on nuanced intuition and consultation. Recent advancements in deep learning …

Chatdev: Communicative agents for software development

C Qian, W Liu, H Liu, N Chen, Y Dang, J Li… - Proceedings of the …, 2024 - aclanthology.org
Software development is a complex task that necessitates cooperation among multiple
members with diverse skills. Numerous studies used deep learning to improve specific …

Understanding the weakness of large language model agents within a complex android environment

M **ng, R Zhang, H Xue, Q Chen, F Yang… - Proceedings of the 30th …, 2024 - dl.acm.org
Large language models (LLMs) have empowered intelligent agents to execute intricate
tasks within domain-specific software such as browsers and games. However, when applied …

Ferret-ui 2: Mastering universal user interface understanding across platforms

Z Li, K You, H Zhang, D Feng, H Agrawal, X Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Building a generalist model for user interface (UI) understanding is challenging due to
various foundational issues, such as platform diversity, resolution variation, and data …

Open models, closed minds? on agents capabilities in mimicking human personalities through open large language models

L La Cava, A Tagarelli - arxiv preprint arxiv:2401.07115, 2024 - arxiv.org
The emergence of unveiling human-like behaviors in Large Language Models (LLMs) has
led to a closer connection between NLP and human psychology. Scholars have been …

A dynamic LLM-powered agent network for task-oriented agent collaboration

Z Liu, Y Zhang, P Li, Y Liu, D Yang - First Conference on Language …, 2024 - openreview.net
Recent studies show that collaborating multiple large language model (LLM) powered
agents is a promising way for task solving. However, current approaches are constrained by …

Step: Stacked llm policies for web actions

P Sodhi, SRK Branavan, Y Artzi… - First Conference on …, 2024 - openreview.net
Performing tasks on the web presents fundamental challenges to large language models
(LLMs), including combinatorially large open-world tasks and variations across web …

Large language model-brained gui agents: A survey

C Zhang, S He, J Qian, B Li, L Li, S Qin, Y Kang… - arxiv preprint arxiv …, 2024 - arxiv.org
GUIs have long been central to human-computer interaction, providing an intuitive and
visually-driven way to access and interact with digital systems. The advent of LLMs …

G-designer: Architecting multi-agent communication topologies via graph neural networks

G Zhang, Y Yue, X Sun, G Wan, M Yu, J Fang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recent advancements in large language model (LLM)-based agents have demonstrated
that collective intelligence can significantly surpass the capabilities of individual agents …

Docbench: A benchmark for evaluating llm-based document reading systems

A Zou, W Yu, H Zhang, K Ma, D Cai, Z Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
Recently, there has been a growing interest among large language model (LLM) developers
in LLM-based document reading systems, which enable users to upload their own …