Google Akademik

P Xu, X Zhu, DA Clifton - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Transformer is a promising neural network learner, and has achieved great success in
various machine learning tasks. Thanks to the recent prevalence of multimodal applications …

Kaydet Alıntı yap Alıntılanma sayısı: 644 İlgili makaleler 9 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

JA Mendez, E Eaton - arxiv preprint arxiv:2207.07730, 2022 - arxiv.org

A major goal of artificial intelligence (AI) is to create an agent capable of acquiring a general
understanding of the world. Such an agent would require the ability to continually …

Kaydet Alıntı yap Alıntılanma sayısı: 33 İlgili makaleler 3 sürümün hepsi HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] ecva.net

Dynamically transformed instance normalization network for generalizable person re-identification

B Jiao, L Liu, L Gao, G Lin, L Yang, S Zhang… - European conference on …, 2022 - Springer

Existing person re-identification methods often suffer significant performance degradation on
unseen domains, which fuels interest in domain generalizable person re-identification (DG …

Kaydet Alıntı yap Alıntılanma sayısı: 42 İlgili makaleler 3 sürümün hepsi

Interpretability for reliable, efficient, and self-cognitive DNNs: From theories to applications

X Kang, J Guo, B Song, B Cai, H Sun, Z Zhang - Neurocomputing, 2023 - Elsevier

In recent years, remarkable achievements have been made in artificial intelligence tasks
and applications based on deep neural networks (DNNs), especially in the fields of vision …

Kaydet Alıntı yap Alıntılanma sayısı: 6 İlgili makaleler 2 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] cell.com Full View

CX-ToM: Counterfactual explanations with theory-of-mind for enhancing human trust in image recognition models

AR Akula, K Wang, C Liu, S Saba-Sadiya, H Lu… - Iscience, 2022 - cell.com

We propose CX-ToM, short for counterfactual explanations with theory-of-mind, a new
explainable AI (XAI) framework for explaining decisions made by a deep convolutional …

Kaydet Alıntı yap Alıntılanma sayısı: 55 İlgili makaleler 9 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Knowledge-augmented deep learning and its applications: A survey

Z Cui, T Gao, K Talamadupula… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Deep learning models, though having achieved great success in many different fields over
the past years, are usually data-hungry, fail to perform well on unseen samples, and lack …

Kaydet Alıntı yap Alıntılanma sayısı: 22 İlgili makaleler 8 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] arxiv.org

Reconstructing action-conditioned human-object interactions using commonsense knowledge priors

X Wang, G Li, YL Kuo, M Kocabas… - … Conference on 3D …, 2022 - ieeexplore.ieee.org

We present a method for inferring diverse 3D models of human-object interactions from
images. Reasoning about how humans interact with objects in complex scenes from a single …

Kaydet Alıntı yap Alıntılanma sayısı: 23 İlgili makaleler 10 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] openreview.net

Eqa-mx: Embodied question answering using multimodal expression

MM Islam, A Gladstone, R Islam… - The Twelfth International …, 2023 - openreview.net

Humans predominantly use verbal utterances and nonverbal gestures (eg, eye gaze and
pointing gestures) in their natural interactions. For instance, pointing gestures and verbal …

Kaydet Alıntı yap Alıntılanma sayısı: 8 İlgili makaleler HTML olarak görüntüle

[Free GPT-4]
[DeepSeek]

[PDF] ecva.net

Compositional Substitutivity of Visual Reasoning for Visual Question Answering

C Li, Z Li, C **g, Y Wu, M Zhai, Y Jia - European Conference on Computer …, 2024 - Springer

Compositional generalization has received much attention in vision-and-language and
visual reasoning recently. Substitutivity, the capability to generalize to novel compositions …

Kaydet Alıntı yap Alıntılanma sayısı: 2 İlgili makaleler 5 sürümün hepsi

[Free GPT-4]
[DeepSeek]

[PDF] aaai.org

Patron: perspective-aware multitask model for referring expression grounding using embodied multimodal cues

MM Islam, A Gladstone, T Iqbal - … of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Humans naturally use referring expressions with verbal utterances and nonverbal gestures
to refer to objects and events. As these referring expressions can be interpreted differently …

Kaydet Alıntı yap Alıntılanma sayısı: 4 İlgili makaleler 2 sürümün hepsi HTML olarak görüntüle

Uyarı oluştur

Alıntı yap

Gelişmiş arama

Kitaplığım'a kaydedildi

Robust visual reasoning via language guided neural module networks

Multimodal learning with transformers: A survey

How to reuse and compose knowledge for a lifetime of tasks: A survey on continual learning and functional composition

Dynamically transformed instance normalization network for generalizable person re-identification

Interpretability for reliable, efficient, and self-cognitive DNNs: From theories to applications

CX-ToM: Counterfactual explanations with theory-of-mind for enhancing human trust in image recognition models

Knowledge-augmented deep learning and its applications: A survey

Reconstructing action-conditioned human-object interactions using commonsense knowledge priors

Eqa-mx: Embodied question answering using multimodal expression

Compositional Substitutivity of Visual Reasoning for Visual Question Answering

Patron: perspective-aware multitask model for referring expression grounding using embodied multimodal cues