Google Tudós

J Huang, J Zhang - arxiv preprint arxiv:2408.15769, 2024 - arxiv.org

Multimodal Large Language Models (MLLMs) mimic human perception and reasoning
system by integrating powerful Large Language Models (LLMs) with various modality …

Mentés Hivatkozás Idézetek száma: 17 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Large Language Models Can Be Contextual Privacy Protection Learners

Y **ao, Y **, Y Bai, Y Wu, X Yang, X Luo… - Proceedings of the …, 2024 - aclanthology.org

Abstract The proliferation of Large Language Models (LLMs) has driven considerable
interest in fine-tuning them with domain-specific data to create specialized language …

Mentés Hivatkozás Idézetek száma: 24 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey

J Kuang, Y Shen, J **e, H Luo, Z Xu, R Li, Y Li… - ACM Computing …, 2024 - dl.acm.org

Visual Question Answering (VQA) is a challenge task that combines natural language
processing and computer vision techniques and gradually becomes a benchmark test task …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 3 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

A survey on multimodal benchmarks: In the era of large ai models

L Li, G Chen, H Shi, J **ao, L Chen - arxiv preprint arxiv:2409.18142, 2024 - arxiv.org

The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial
advancements in artificial intelligence, significantly enhancing the capability to understand …

Mentés Hivatkozás Idézetek száma: 4 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models

J Liu, Y Li, B **ao, Y Jian, Z Qin, T Shao… - arxiv preprint arxiv …, 2024 - arxiv.org

There have been recent efforts to extend the Chain-of-Thought (CoT) paradigm to
Multimodal Large Language Models (MLLMs) by finding visual clues in the input scene …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches

A Mumuni, F Mumuni - arxiv preprint arxiv:2501.03151, 2025 - arxiv.org

Generative artificial intelligence (AI) systems based on large-scale pretrained foundation
models (PFMs) such as vision-language models, large language models (LLMs), diffusion …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

AI Benchmarks and Datasets for LLM Evaluation

T Ivanov, V Penchev - arxiv preprint arxiv:2412.01020, 2024 - arxiv.org

LLMs demand significant computational resources for both pre-training and fine-tuning,
requiring distributed computing capabilities due to their large model sizes\cite …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

Z Wang, C Chen, F Luo, Y Dong, Y Zhang, Y Xu… - arxiv preprint arxiv …, 2024 - arxiv.org

Active perception, a crucial human capability, involves setting a goal based on the current
understanding of the environment and performing actions to achieve that goal. Despite …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] bas.bg

LLM Logical Reasoning Related to Aesthetic Universals

D Baeva, G Ivanova - Proceedings of the Bulgarian Academy of …, 2024 - proceedings.bas.bg

The recent surge in popularity of LLMs has led to increased interest in them and to extensive
research and evaluation of their reasoning abilities. Induction, deduction, and abduction …

Mentés Hivatkozás Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

Logicvista: Multimodal llm logical reasoning benchmark in visual contexts

A survey on evaluation of multimodal large language models

Large Language Models Can Be Contextual Privacy Protection Learners

Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey

A survey on multimodal benchmarks: In the era of large ai models

Enhancing Visual Reasoning with Autonomous Imagination in Multimodal Large Language Models

Large language models for artificial general intelligence (AGI): A survey of foundational principles and approaches

AI Benchmarks and Datasets for LLM Evaluation

ActiView: Evaluating Active Perception Ability for Multimodal Large Language Models

LLM Logical Reasoning Related to Aesthetic Universals