Google Tudós

D Zhang, Y Yu, J Dong, C Li, D Su, C Chu… - arxiv preprint arxiv …, 2024 - arxiv.org

In the past year, MultiModal Large Language Models (MM-LLMs) have undergone
substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs …

Mentés Hivatkozás Idézetek száma: 210 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] springer.com

Large language models for generative information extraction: A survey

D Xu, W Chen, W Peng, C Zhang, T Xu, X Zhao… - Frontiers of Computer …, 2024 - Springer

Abstract Information Extraction (IE) aims to extract structural knowledge from plain natural
language texts. Recently, generative Large Language Models (LLMs) have demonstrated …

Mentés Hivatkozás Idézetek száma: 133 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]
[DeepSeek]

[PDF] acm.org

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

L Huang, W Yu, W Ma, W Zhong, Z Feng… - ACM Transactions on …, 2024 - dl.acm.org

The emergence of large language models (LLMs) has marked a significant breakthrough in
natural language processing (NLP), fueling a paradigm shift in information acquisition …

Mentés Hivatkozás Idézetek száma: 110 Kapcsolódó cikkek

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Adashield: Safeguarding multimodal large language models from structure-based attack via adaptive shield prompting

Y Wang, X Liu, Y Li, M Chen, C **ao - European Conference on Computer …, 2024 - Springer

With the advent and widespread deployment of Multimodal Large Language Models
(MLLMs), the imperative to ensure their safety has become increasingly pronounced …

Mentés Hivatkozás Idézetek száma: 35 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Sora: A review on background, technology, limitations, and opportunities of large vision models

Y Liu, K Zhang, Y Li, Z Yan, C Gao, R Chen… - arxiv preprint arxiv …, 2024 - arxiv.org

Sora is a text-to-video generative AI model, released by OpenAI in February 2024. The
model is trained to generate videos of realistic or imaginative scenes from text instructions …

Mentés Hivatkozás Idézetek száma: 225 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

BRAVE: Broadening the visual encoding of vision-language models

OF Kar, A Tonioni, P Poklukar, A Kulshrestha… - … on Computer Vision, 2024 - Springer

Vision-language models (VLMs) are typically composed of a vision encoder, eg CLIP, and a
language model (LM) that interprets the encoded features to solve downstream tasks …

Mentés Hivatkozás Idézetek száma: 29 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]
[DeepSeek]

[PDF] frontiersin.org

Vision-language models for medical report generation and visual question answering: A review

I Hartsock, G Rasool - Frontiers in Artificial Intelligence, 2024 - frontiersin.org

Medical vision-language models (VLMs) combine computer vision (CV) and natural
language processing (NLP) to analyze visual and textual medical data. Our paper reviews …

Mentés Hivatkozás Idézetek száma: 42 Kapcsolódó cikkek Mind a(z) 2 változat HTML-változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Unified hallucination detection for multimodal large language models

X Chen, C Wang, Y Xue, N Zhang, X Yang, Q Li… - arxiv preprint arxiv …, 2024 - arxiv.org

Despite significant strides in multimodal tasks, Multimodal Large Language Models (MLLMs)
are plagued by the critical issue of hallucination. The reliable detection of such …

Mentés Hivatkozás Idézetek száma: 42 Kapcsolódó cikkek Mind a(z) 3 változat HTML-változat

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction

H Hua, J Shi, K Kafle, S Jenni, D Zhang… - … on Computer Vision, 2024 - Springer

Recent progress in large-scale pre-training has led to the development of advanced vision-
language models (VLMs) with remarkable proficiency in comprehending and generating …

Mentés Hivatkozás Idézetek száma: 14 Kapcsolódó cikkek Mind a(z) 2 változat

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Hal-eval: A universal and fine-grained hallucination evaluation framework for large vision language models

C Jiang, H Jia, M Dong, W Ye, H Xu, M Yan… - Proceedings of the …, 2024 - dl.acm.org

Large Vision-Language Models (LVLMs) exhibit remarkable capabilities but struggle
with''hallucinations''-inconsistencies between images and their descriptions. Previous …

Mentés Hivatkozás Idézetek száma: 14 Kapcsolódó cikkek Mind a(z) 2 változat

Értesítés létrehozása

Hivatkozás

Speciális keresés

Mentve a Saját könyvtárba

A survey on hallucination in large vision-language models

Mm-llms: Recent advances in multimodal large language models

Large language models for generative information extraction: A survey

A survey on hallucination in large language models: Principles, taxonomy, challenges, and open questions

Adashield: Safeguarding multimodal large language models from structure-based attack via adaptive shield prompting

Sora: A review on background, technology, limitations, and opportunities of large vision models

BRAVE: Broadening the visual encoding of vision-language models

Vision-language models for medical report generation and visual question answering: A review

Unified hallucination detection for multimodal large language models

FineMatch: Aspect-Based Fine-Grained Image and Text Mismatch Detection and Correction

Hal-eval: A universal and fine-grained hallucination evaluation framework for large vision language models