Google Наука

J Huang, J Zhang - arxiv preprint arxiv:2408.15769, 2024 - arxiv.org

Multimodal Large Language Models (MLLMs) mimic human perception and reasoning
system by integrating powerful Large Language Models (LLMs) with various modality …

Запазване Позоваване С позовавания в 20 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on multimodal benchmarks: In the era of large ai models

L Li, G Chen, H Shi, J **ao, L Chen - arxiv preprint arxiv:2409.18142, 2024 - arxiv.org

The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial
advancements in artificial intelligence, significantly enhancing the capability to understand …

Запазване Позоваване С позовавания в 4 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Autoglm: Autonomous foundation agents for guis

X Liu, B Qin, D Liang, G Dong, H Lai, H Zhang… - arxiv preprint arxiv …, 2024 - arxiv.org

We present AutoGLM, a new series in the ChatGLM family, designed to serve as foundation
agents for autonomous control of digital devices through Graphical User Interfaces (GUIs) …

Запазване Позоваване С позовавания в 4 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Lifelong Learning of Large Language Model based Agents: A Roadmap

J Zheng, C Shi, X Cai, Q Li, D Zhang, C Li, D Yu… - arxiv preprint arxiv …, 2025 - arxiv.org

Lifelong learning, also known as continual or incremental learning, is a crucial component
for advancing Artificial General Intelligence (AGI) by enabling systems to continuously adapt …

Запазване Позоваване Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Transforming the Hybrid Cloud for Emerging AI Workloads

D Chen, A Youssef, R Pendse, A Schleife… - arxiv preprint arxiv …, 2024 - arxiv.org

This white paper, developed through close collaboration between IBM Research and UIUC
researchers within the IIDAI Institute, envisions transforming hybrid cloud systems to meet …

Запазване Позоваване Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Z Wang, H Xu, J Wang, X Zhang, M Yan… - arxiv preprint arxiv …, 2025 - arxiv.org

Smartphones have become indispensable in modern life, yet navigating complex tasks on
mobile devices often remains frustrating. Recent advancements in large multimodal model …

Запазване Позоваване С позовавания в 3 Сродни статии Всички 2 версии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset

AG Gabriel, AA Ahmad, SK Jeyakumar - arxiv preprint arxiv:2410.22457, 2024 - arxiv.org

Advancements in Large Language Models (LLMs) are revolutionizing the development of
autonomous agentic systems by enabling dynamic, context-aware task decomposition and …

Запазване Позоваване Сродни статии Във вид на HTML

[Free GPT-4]
[DeepSeek]

[PDF] preprints.org

[PDF][PDF] Continuous or Discrete, That Is the Question: A Survey on Large Multi-Modal Models from the Perspective of Input-Output Space Extension

Z Li, J Zhang, D Wang, Y Wang, X Huang, Z Wei - 2024 - preprints.org

With the success of large language models (LLMs) driving progress towards general-
purpose AI, there has been a growing focus on extending these models to multi-modal …

Запазване Позоваване Сродни статии Всички 2 версии Във вид на HTML

Създаване на сигнал

Позоваване

Разширено търсене

Запазено в „Моята библиотека“

Visualagentbench: Towards large multimodal models as visual foundation agents

A survey on evaluation of multimodal large language models

A survey on multimodal benchmarks: In the era of large ai models

Autoglm: Autonomous foundation agents for guis

Lifelong Learning of Large Language Model based Agents: A Roadmap

Transforming the Hybrid Cloud for Emerging AI Workloads

Mobile-Agent-E: Self-Evolving Mobile Assistant for Complex Tasks

Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset

[PDF][PDF] Continuous or Discrete, That Is the Question: A Survey on Large Multi-Modal Models from the Perspective of Input-Output Space Extension