A survey on evaluation of multimodal large language models

J Huang, J Zhang - arxiv preprint arxiv:2408.15769, 2024 - arxiv.org
Multimodal Large Language Models (MLLMs) mimic human perception and reasoning
system by integrating powerful Large Language Models (LLMs) with various modality …

A survey on multimodal benchmarks: In the era of large ai models

L Li, G Chen, H Shi, J **ao, L Chen - arxiv preprint arxiv:2409.18142, 2024 - arxiv.org
The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial
advancements in artificial intelligence, significantly enhancing the capability to understand …

Autoglm: Autonomous foundation agents for guis

X Liu, B Qin, D Liang, G Dong, H Lai, H Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org
We present AutoGLM, a new series in the ChatGLM family, designed to serve as foundation
agents for autonomous control of digital devices through Graphical User Interfaces (GUIs) …

Lifelong Learning of Large Language Model based Agents: A Roadmap

J Zheng, C Shi, X Cai, Q Li, D Zhang, C Li, D Yu… - arxiv preprint arxiv …, 2025 - arxiv.org
Lifelong learning, also known as continual or incremental learning, is a crucial component
for advancing Artificial General Intelligence (AGI) by enabling systems to continuously adapt …

Transforming the Hybrid Cloud for Emerging AI Workloads

D Chen, A Youssef, R Pendse, A Schleife… - arxiv preprint arxiv …, 2024 - arxiv.org
This white paper, developed through close collaboration between IBM Research and UIUC
researchers within the IIDAI Institute, envisions transforming hybrid cloud systems to meet …

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Z Wang, H Xu, J Wang, X Zhang, M Yan… - arxiv preprint arxiv …, 2025 - arxiv.org
Smartphones have become indispensable in modern life, yet navigating complex tasks on
mobile devices often remains frustrating. Recent advancements in large multimodal model …

Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset

AG Gabriel, AA Ahmad, SK Jeyakumar - arxiv preprint arxiv:2410.22457, 2024 - arxiv.org
Advancements in Large Language Models (LLMs) are revolutionizing the development of
autonomous agentic systems by enabling dynamic, context-aware task decomposition and …

[PDF][PDF] Continuous or Discrete, That Is the Question: A Survey on Large Multi-Modal Models from the Perspective of Input-Output Space Extension

Z Li, J Zhang, D Wang, Y Wang, X Huang, Z Wei - 2024 - preprints.org
With the success of large language models (LLMs) driving progress towards general-
purpose AI, there has been a growing focus on extending these models to multi-modal …