- Academic Search

L Chen, O Sinavski, J Hünermann… - … on Robotics and …, 2024 - ieeexplore.ieee.org

Large Language Models (LLMs) have shown promise in the autonomous driving sector,
particularly in generalization and interpretability. We introduce a unique objectlevel …

Save Cite Cited by 148 Related articles All 3 versions Free GPT-4

Gradient-based visual explanation for transformer-based clip

C Zhao, K Wang, X Zeng, R Zhao… - … on Machine Learning, 2024 - proceedings.mlr.press

Significant progress has been achieved on the improvement and downstream usages of the
Contrastive Language-Image Pre-training (CLIP) vision-language model, while less …

Save Cite Cited by 6 Related articles All 4 versions Free GPT-4 Cached

[Free GPT-4]

[PDF] arxiv.org

ToxiSpanSE: An explainable toxicity detection in code review comments

J Sarker, S Sultana, SR Wilson… - 2023 ACM/IEEE …, 2023 - ieeexplore.ieee.org

Background: The existence of toxic conversations in open-source platforms can degrade
relationships among software developers and may negatively impact software product …

Save Cite Cited by 17 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

Eatformer: Improving vision transformer inspired by evolutionary algorithm

J Zhang, X Li, Y Wang, C Wang, Y Yang, Y Liu… - International Journal of …, 2024 - Springer

Motivated by biological evolution, this paper explains the rationality of Vision Transformer by
analogy with the proven practical evolutionary algorithm (EA) and derives that both have …

Save Cite Cited by 33 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] thecvf.com

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

J Wu, B Duan, W Kang, H Tang… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

While Transformers have rapidly gained popularity in various computer vision applications
post-hoc explanations of their internal mechanisms remain largely unexplored. Vision …

Save Cite Cited by 5 Related articles All 3 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] arxiv.org

A simple interpretable transformer for fine-grained image classification and analysis

D Paul, A Chowdhury, X **ong, FJ Chang… - arxiv preprint arxiv …, 2023 - arxiv.org

We present a novel usage of Transformers to make image classification interpretable. Unlike
mainstream classifiers that wait until the last fully connected layer to incorporate class …

Save Cite Cited by 10 Related articles All 5 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

SkipPLUS: Skip the First Few Layers to Better Explain Vision Transformers

F Mehri, M Fayyaz, MS Baghshah… - Proceedings of the …, 2024 - openaccess.thecvf.com

Despite their remarkable performance the explainability of Vision Transformers (ViTs)
remains a challenge. While forward attention-based token attribution techniques have …

Save Cite Cited by 1 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

Reduction of class activation uncertainty with background information

HM Kabir - arxiv preprint arxiv:2305.03238, 2023 - arxiv.org

Multitask learning is a popular approach to training high-performing neural networks with
improved generalization. In this paper, we propose a background class to achieve improved …

Save Cite Cited by 16 Related articles View as HTML

[Free GPT-4]

[PDF] arxiv.org

Sparse-Tuning: Adapting vision transformers with efficient fine-tuning and inference

T Liu, X Liu, S Huang, L Shi, Z Xu, Y **n, Q Yin… - arxiv preprint arxiv …, 2024 - arxiv.org

Parameter-efficient fine-tuning (PEFT) has emerged as a popular solution for adapting pre-
trained Vision Transformer (ViT) models to downstream applications. While current PEFT …

Save Cite Cited by 3 Related articles All 2 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] thecvf.com

On the Faithfulness of Vision Transformer Explanations

J Wu, W Kang, H Tang, Y Hong… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract To interpret Vision Transformers post-hoc explanations assign salience scores to
input pixels providing human-understandable heatmaps. However whether these …

Save Cite Cited by 3 Related articles All 3 versions Free GPT-4 View as HTML

Create alert

Cite

Advanced search

Saved to My library

Attcat: Explaining transformers via attentive class activation tokens

Driving with llms: Fusing object-level vector modality for explainable autonomous driving

Gradient-based visual explanation for transformer-based clip

ToxiSpanSE: An explainable toxicity detection in code review comments

Eatformer: Improving vision transformer inspired by evolutionary algorithm

Token Transformation Matters: Towards Faithful Post-hoc Explanation for Vision Transformer

A simple interpretable transformer for fine-grained image classification and analysis

SkipPLUS: Skip the First Few Layers to Better Explain Vision Transformers

Reduction of class activation uncertainty with background information

Sparse-Tuning: Adapting vision transformers with efficient fine-tuning and inference

On the Faithfulness of Vision Transformer Explanations