Leveraging Scale-aware Representations for improved Concept-Representation Alignment in ViTs

S Sinha, G **ong, A Zhang - arxiv preprint arxiv:2501.09221, 2025 - arxiv.org
Vision Transformers (ViTs) are increasingly being adopted in various sensitive vision
applications-like medical diagnosis, facial recognition, etc. To improve the interpretability of …

Structural Causality-based Generalizable Concept Discovery Models

S Sinha, G **ong, A Zhang - arxiv preprint arxiv:2410.15491, 2024 - arxiv.org
The rising need for explainable deep neural network architectures has utilized semantic
concepts as explainable units. Several approaches utilizing disentangled representation …