Empowering llm to use smartphone for intelligent task automation
Mobile task automation is an attractive technique that aims to enable voice-based hands-
free user interaction with smartphones. However, existing approaches suffer from poor …
free user interaction with smartphones. However, existing approaches suffer from poor …
Personal llm agents: Insights and survey about the capability, efficiency and security
Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …
been one of the key technologies that researchers and engineers have focused on, aiming …
Foundations and recent trends in multimodal mobile agents: A survey
Mobile agents are essential for automating tasks in complex and dynamic mobile
environments. As foundation models evolve, the demands for agents that can adapt in real …
environments. As foundation models evolve, the demands for agents that can adapt in real …
Large language model-based agents for software engineering: A survey
The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI
agents, ie, LLM-based agents. Compared to standalone LLMs, LLM-based agents …
agents, ie, LLM-based agents. Compared to standalone LLMs, LLM-based agents …
AssistGUI: Task-Oriented PC Graphical User Interface Automation
Abstract Graphical User Interface (GUI) automation holds significant promise for assisting
users with complex tasks thereby boosting human productivity. Existing works leveraging …
users with complex tasks thereby boosting human productivity. Existing works leveraging …
Mobile-bench: An evaluation benchmark for llm-based mobile agents
S Deng, W Xu, H Sun, W Liu, T Tan, J Liu, A Li… - arxiv preprint arxiv …, 2024 - arxiv.org
With the remarkable advancements of large language models (LLMs), LLM-based agents
have become a research hotspot in human-computer interaction. However, there is a …
have become a research hotspot in human-computer interaction. However, there is a …
Large multimodal agents: A survey
Large language models (LLMs) have achieved superior performance in powering text-
based AI agents, endowing them with decision-making and reasoning abilities akin to …
based AI agents, endowing them with decision-making and reasoning abilities akin to …
Explore, select, derive, and recall: Augmenting llm with human-like memory for mobile task automation
The advent of large language models (LLMs) has opened up new opportunities in the field
of mobile task automation. Their superior language understanding and reasoning …
of mobile task automation. Their superior language understanding and reasoning …
Assistgui: Task-oriented desktop graphical user interface automation
Graphical User Interface (GUI) automation holds significant promise for assisting users with
complex tasks, thereby boosting human productivity. Existing works leveraging Large …
complex tasks, thereby boosting human productivity. Existing works leveraging Large …
Guardian: A Runtime Framework for LLM-Based UI Exploration
Tests for feature-based UI testing have been indispensable for ensuring the quality of mobile
applications (apps for short). The high manual labor costs to create such tests have led to a …
applications (apps for short). The high manual labor costs to create such tests have led to a …