Large language model-brained gui agents: A survey

C Zhang, S He, J Qian, B Li, L Li, S Qin, Y Kang… - arxiv preprint arxiv …, 2024 - arxiv.org
GUIs have long been central to human-computer interaction, providing an intuitive and
visually-driven way to access and interact with digital systems. The advent of LLMs …

Large Action Models: From Inception to Implementation

L Wang, F Yang, C Zhang, J Lu, J Qian, S He… - arxiv preprint arxiv …, 2024 - arxiv.org
As AI continues to advance, there is a growing demand for systems that go beyond
language-based assistance and move toward intelligent agents capable of performing real …

Enabling Autonomic Microservice Management through Self-Learning Agents

F Yu, F Yang, X Qin, Z Zhang, J Zhang, Q Lin… - arxiv preprint arxiv …, 2025 - arxiv.org
The increasing complexity of modern software systems necessitates robust autonomic self-
management capabilities. While Large Language Models (LLMs) demonstrate potential in …

[PDF][PDF] Os agents: A survey on mllm-based agents for general computing devices use

X Hu, T **ong, B Yi, Z Wei, R **ao, Y Chen, J Ye, M Tao… - 2024 - preprints.org
The dream to create AI assistants as capable and versatile as the fictional JARVIS from Iron
Man has long captivated imaginations. With the evolution of (multimodal) large language …

Every Software as an Agent: Blueprint and Case Study

M Xu - arxiv preprint arxiv:2502.04747, 2025 - arxiv.org
The rise of (multimodal) large language models (LLMs) has shed light on software agent--
where software can understand and follow user instructions in natural language. However …