Large language model-brained gui agents: A survey

C Zhang, S He, J Qian, B Li, L Li, S Qin, Y Kang… - arxiv preprint arxiv …, 2024 - arxiv.org
GUIs have long been central to human-computer interaction, providing an intuitive and
visually-driven way to access and interact with digital systems. The advent of LLMs …

Large language models empowered personalized web agents

H Cai, Y Li, W Wang, F Zhu, X Shen, W Li… - arxiv preprint arxiv …, 2024 - arxiv.org
Web agents have emerged as a promising direction to automate Web task completion based
on user instructions, significantly enhancing user experience. Recently, Web agents have …

Gui agents: A survey

D Nguyen, J Chen, Y Wang, G Wu, N Park, Z Hu… - arxiv preprint arxiv …, 2024 - arxiv.org
Graphical User Interface (GUI) agents, powered by Large Foundation Models, have
emerged as a transformative approach to automating human-computer interaction. These …

[PDF][PDF] Os agents: A survey on mllm-based agents for general computing devices use

X Hu, T **ong, B Yi, Z Wei, R **ao, Y Chen, J Ye, M Tao… - 2024 - preprints.org
The dream to create AI assistants as capable and versatile as the fictional JARVIS from Iron
Man has long captivated imaginations. With the evolution of (multimodal) large language …