Do Vision-Language Pretrained Models Learn Composable Primitive Concepts? T Yun, U Bhalla, E Pavlick, C Sun Transactions on Machine Learning Research, 2022 | 40* | 2022 |
Interpreting CLIP with Sparse Linear Concept Embeddings (SpLiCE) U Bhalla, A Oesterling, S Srinivas, FP Calmon, H Lakkaraju Advances in Neural Information Processing Systems 37, 2024 | 24 | 2024 |
Discriminative Feature Attributions: Bridging Post Hoc Explainability and Inherent Interpretability U Bhalla, S Srinivas, H Lakkaraju Advances in Neural Information Processing Systems 36, 2023 | 14* | 2023 |
Operationalizing the Blueprint for an AI Bill of Rights: Recommendations for Practitioners, Researchers, and Policy Makers A Oesterling, U Bhalla, S Venkatasubramanian, H Lakkaraju arXiv preprint arXiv:2407.08689, 2024 | 2 | 2024 |
Towards Unifying Interpretability and Control: Evaluation via Intervention U Bhalla, S Srinivas, A Ghandeharioun, H Lakkaraju arXiv preprint arXiv:2411.04430, 2024 | 1 | 2024 |
Building Bridges, Not Walls--Advancing Interpretability by Unifying Feature, Data, and Model Component Attribution S Zhang, T Han, U Bhalla, H Lakkaraju arXiv preprint arXiv:2501.18887, 2025 | | 2025 |
All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models C Badrinath, U Bhalla, A Oesterling, S Srinivas, H Lakkaraju ICML 2024 Workshops, 2024 | | 2024 |
Analysis of Accuracy and Precision of Recommended Protocols for Dynamic Susceptibility Contrast MRI for Brain Metastases NB Semmineh, U Bhalla, LC Bell, AM Stokes, MD Lee, L Hu, ... | | |