- Academic Search

S Wiegreffe, A Marasović - ar** language-image pre-training for unified vision-language understanding and generation

J Li, D Li, C **ong, S Hoi - International conference on …, 2022 - proceedings.mlr.press

Abstract Vision-Language Pre-training (VLP) has advanced the performance for many vision-
language tasks. However, most existing pre-trained models only excel in either …

Zapisz Cytuj Cytowane przez 4215 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Large language models are visual reasoning coordinators

L Chen, B Li, S Shen, J Yang, C Li… - Advances in …, 2024 - proceedings.neurips.cc

Visual reasoning requires multimodal perception and commonsense cognition of the world.
Recently, multiple vision-language models (VLMs) have been proposed with excellent …

Zapisz Cytuj Cytowane przez 35 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Align before fuse: Vision and language representation learning with momentum distillation

J Li, R Selvaraju, A Gotmare, S Joty… - Advances in neural …, 2021 - proceedings.neurips.cc

Large-scale vision and language representation learning has shown promising
improvements on various vision-language tasks. Most existing methods employ a …

Zapisz Cytuj Cytowane przez 2090 Powiązane artykuły Wszystkie wersje 10 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] mdpi.com

Sentence representation method based on multi-layer semantic network

W Zheng, X Liu, L Yin - Applied sciences, 2021 - mdpi.com

With the development of artificial intelligence, more and more people hope that computers
can understand human language through natural language technology, learn to think like …

Zapisz Cytuj Cytowane przez 153 Powiązane artykuły Wszystkie wersje 9 Kopia

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Measuring association between labels and free-text rationales

S Wiegreffe, A Marasović, NA Smith - arxiv preprint arxiv:2010.12762, 2020 - arxiv.org

In interpretable NLP, we require faithful rationales that reflect the model's decision-making
process for an explained instance. While prior work focuses on extractive rationales (a …

Zapisz Cytuj Cytowane przez 174 Powiązane artykuły Wszystkie wersje 5 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] neurips.cc

Probing inter-modality: Visual parsing with self-attention for vision-and-language pre-training

H Xue, Y Huang, B Liu, H Peng, J Fu… - Advances in Neural …, 2021 - proceedings.neurips.cc

Abstract Vision-Language Pre-training (VLP) aims to learn multi-modal representations from
image-text pairs and serves for downstream vision-language tasks in a fine-tuning fashion …

Zapisz Cytuj Cytowane przez 93 Powiązane artykuły Wszystkie wersje 9 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] thecvf.com

Nlx-gpt: A model for natural language explanations in vision and vision-language tasks

F Sammani, T Mukherjee… - proceedings of the …, 2022 - openaccess.thecvf.com

Natural language explanation (NLE) models aim at explaining the decision-making process
of a black box system via generating natural language sentences which are human-friendly …

Zapisz Cytuj Cytowane przez 70 Powiązane artykuły Wszystkie wersje 8 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Rigorously assessing natural language explanations of neurons

J Huang, A Geiger, K D'Oosterlinck, Z Wu… - arxiv preprint arxiv …, 2023 - arxiv.org

Natural language is an appealing medium for explaining how large language models
process and store information, but evaluating the faithfulness of such explanations is …

Zapisz Cytuj Cytowane przez 29 Powiązane artykuły Wszystkie wersje 4 Wersja HTML

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Natural language rationales with full-stack visual reasoning: From pixels to semantic frames to commonsense graphs

A Marasović, C Bhagavatula, JS Park, RL Bras… - arxiv preprint arxiv …, 2020 - arxiv.org

Natural language rationales could provide intuitive, higher-level explanations that are easily
understandable by humans, complementing the more broadly studied lower-level …

Zapisz Cytuj Cytowane przez 69 Powiązane artykuły Wszystkie wersje 3 Wersja HTML

Utwórz alert

Cytuj

Szukanie zaawansowane

Zapisano w Mojej bibliotece

e-snli-ve: Corrected visual-textual entailment with natural language explanations

Teach me to explain: A review of datasets for explainable natural language processing

Large language models are visual reasoning coordinators

Align before fuse: Vision and language representation learning with momentum distillation

Sentence representation method based on multi-layer semantic network

Measuring association between labels and free-text rationales

Probing inter-modality: Visual parsing with self-attention for vision-and-language pre-training

Nlx-gpt: A model for natural language explanations in vision and vision-language tasks

Rigorously assessing natural language explanations of neurons

Natural language rationales with full-stack visual reasoning: From pixels to semantic frames to commonsense graphs