Google 학술 검색

R Zhao, H Chen, W Wang, F Jiao, XL Do, C Qin… - arxiv preprint arxiv …, 2023 - arxiv.org

As Large Language Models (LLMs) become popular, there emerged an important trend of
using multimodality to augment the LLMs' generation ability, which enables LLMs to better …

저장 인용 61회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Auslan-daily: Australian sign language translation for daily communication and news

X Shen, S Yuan, H Sheng, H Du… - Advances in Neural …, 2024 - proceedings.neurips.cc

Sign language translation (SLT) aims to convert a continuous sign language video clip into a
spoken language. Considering different geographic regions generally have their own native …

저장 인용 13회 인용 관련 학술자료 전체 4개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Improving personalized explanation generation through visualization

S Geng, Z Fu, Y Ge, L Li, G De Melo… - Proceedings of the 60th …, 2022 - aclanthology.org

In modern recommender systems, there are usually comments or reviews from users that
justify their ratings for different items. Trained on such textual corpus, explainable …

저장 인용 36회 인용 관련 학술자료 전체 7개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Learning to imagine: Visually-augmented natural language generation

T Tang, Y Chen, Y Du, J Li, WX Zhao… - arxiv preprint arxiv …, 2023 - arxiv.org

People often imagine relevant scenes to aid in the writing process. In this work, we aim to
utilize visual information for composition in the same manner as humans. We propose a …

저장 인용 13회 인용 관련 학술자료 전체 6개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Autograph: Enabling visual context via graph alignment in open domain multi-modal dialogue generation

D Zhao, D Han, Y Yuan, B Ning, M Li, Z He… - Proceedings of the 32nd …, 2024 - dl.acm.org

Open-domain multi-modal dialogue system heavily relies on visual information to generate
contextually relevant responses. The existing open-domain multi-modal dialog generation …

저장 인용 2회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Zrigf: An innovative multimodal framework for zero-resource image-grounded dialogue generation

B Zhang, J Wang, H Ma, B Xu, H Lin - Proceedings of the 31st ACM …, 2023 - dl.acm.org

Image-grounded dialogue systems benefit greatly from integrating visual information,
resulting in high-quality response generation. However, current models struggle to …

저장 인용 4회 인용 관련 학술자료 전체 4개의 버전

Distilling implicit multimodal knowledge into large language models for zero-resource dialogue generation

B Zhang, H Ma, J Ding, J Wang, B Xu, H Lin - Information Fusion, 2025 - Elsevier

Integrating multimodal knowledge into large language models (LLMs) represents a
significant advancement in dialogue generation capabilities. However, the effective …

저장 인용 관련 학술자료

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Identifying untrustworthy samples: Data filtering for open-domain dialogues with bayesian optimization

L Shen, H Zhan, X Shen, H Chen, X Zhao… - Proceedings of the 30th …, 2021 - dl.acm.org

Being able to reply with a related, fluent, and informative response is an indispensable
requirement for building high-quality conversational agents. In order to generate better …

저장 인용 12회 인용 관련 학술자료 전체 3개의 버전

[Free GPT-4]
[DeepSeek]

[PDF] aclanthology.org

Think beyond words: Exploring context-relevant visual commonsense for diverse dialogue generation

Y Liu, L Li, B Zhang, Q Huang - Findings of the Association for …, 2022 - aclanthology.org

Commonsense knowledge has been widely considered for building intelligent open-domain
dialogue agents, aiming to generate meaningful and diverse responses. Previous works in …

저장 인용 4회 인용 관련 학술자료 전체 2개의 버전 HTML 버전

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Resee: Responding through seeing fine-grained visual knowledge in open-domain dialogue

H Tu, Y Li, F Mi, Z Yang - arxiv preprint arxiv:2305.13602, 2023 - arxiv.org

Incorporating visual knowledge into text-only dialogue systems has become a potential
direction to imitate the way humans think, imagine, and communicate. However, existing …

저장 인용 5회 인용 관련 학술자료 전체 5개의 버전 HTML 버전

알림 만들기

인용

고급 검색

라이브러리에 저장됨

Text is not enough: Integrating visual impressions into open-domain dialogue generation

Retrieving multimodal information for augmented generation: A survey

Auslan-daily: Australian sign language translation for daily communication and news

Improving personalized explanation generation through visualization

Learning to imagine: Visually-augmented natural language generation

Autograph: Enabling visual context via graph alignment in open domain multi-modal dialogue generation

Zrigf: An innovative multimodal framework for zero-resource image-grounded dialogue generation

Distilling implicit multimodal knowledge into large language models for zero-resource dialogue generation

Identifying untrustworthy samples: Data filtering for open-domain dialogues with bayesian optimization

Think beyond words: Exploring context-relevant visual commonsense for diverse dialogue generation

Resee: Responding through seeing fine-grained visual knowledge in open-domain dialogue