Seuraa
Satyapriya Krishna
Satyapriya Krishna
Vahvistettu sähköpostiosoite verkkotunnuksessa g.harvard.edu - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation
J Dhamala, T Sun, V Kumar, S Krishna, Y Pruksachatkun, KW Chang, ...
ACM FAccT Conference 2021, 2021
3752021
The Disagreement Problem in Explainable Machine Learning: A Practitioner's Perspective
S Krishna, T Han, A Gu, J Pombra, S Jabbari, S Wu, H Lakkaraju
Transactions on Machine Learning Research, 2024, 2024
2452024
Openxai: Towards a transparent evaluation of model explanations
C Agarwal, S Krishna, E Saxena, M Pawelczyk, N Johnson, I Puri, M Zitnik, ...
Advances in neural information processing systems 35, 15784-15799, 2022
1802022
Explaining machine learning models with interactive natural language conversations using TalkToModel
D Slack, S Krishna, H Lakkaraju, S Singh
Nature Machine Intelligence, 1-11, 2023
106*2023
Black-Box Access is Insufficient for Rigorous AI Audits
S Casper, C Ezell, C Siegmann, N Kolt, TL Curtis, B Bucknall, A Haupt, ...
ACM FAccT Conference 2024, 2024
782024
Post Hoc Explanations of Language Models Can Improve Language Models
S Krishna, J Ma, D Slack, A Ghandeharioun, S Singh, H Lakkaraju
Advances in Neural Information Processing Systems, 2023 36, 2023
642023
Rethinking Stability for Attribution-based Explanations
C Agarwal, N Johnson, M Pawelczyk, S Krishna, E Saxena, M Zitnik, ...
ICLR 2022 Workshop on PAIR^2Struct: Privacy, Accountability …, 2022
582022
Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal
U Gupta, J Dhamala, V Kumar, A Verma, Y Pruksachatkun, S Krishna, ...
Findings of the Association for Computational Linguistics: ACL 2022, 2022
502022
Adept: Auto-encoder based differentially private text transformation
S Krishna, R Gupta, C Dupuy
Proceedings of the 16th Conference of the European Chapter of the …, 2021
492021
Eagle and finch: Rwkv with matrix-valued states and dynamic recurrence
B Peng, D Goldstein, Q Anthony, A Albalak, E Alcaide, S Biderman, ...
arXiv preprint arXiv:2404.05892 3, 2024
452024
Does Robustness Improve Fairness? Approaching Fairness with Word Substitution Robustness Methods for Text Classification
S Krishna*, Y Pruksachatkun*, J Dhamala, R Gupta, KW Chang
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, 2021
34*2021
Croissant: A Metadata Format for ML-Ready Datasets
Published at the NeurIPS 2024 Datasets and Benchmark Track. A shorter …, 2024
31*2024
Are large language models post hoc explainers?
N Kroeger, D Ley, S Krishna, C Agarwal, H Lakkaraju
292023
The disagreement problem in explainable machine learning: A practitioner’s perspective, 2022
S Krishna, T Han, A Gu, J Pombra, S Jabbari, S Wu, H Lakkaraju
arXiv preprint arXiv:2202.01602, 0
16
Towards Bridging the Gaps between the Right to Explanation and the Right to be Forgotten
S Krishna, J Ma, H Lakkaraju
The Fortieth International Conference on Machine Learning (ICML), 2023, 2023
152023
The disagreement problem in explainable machine learning: A practitioner’s perspective. arXiv
S Krishna, T Han, A Gu, J Pombra, S Jabbari, S Wu, H Lakkaraju
arXiv preprint arXiv:2202.01602 10, 2022
142022
Measuring Fairness of Text Classifiers via Prediction Sensitivity
S Krishna, R Gupta, A Verma, J Dhamala, Y Pruksachatkun, KW Chang
Proceedings of the 60th Annual Meeting of the Association for Computational …, 2022
112022
Operationalizing a threat model for red-teaming large language models (llms)
A Verma, S Krishna, S Gehrmann, M Seshadri, A Pradhan, T Ault, ...
arXiv preprint arXiv:2407.14937, 2024
92024
Understanding the Effects of Iterative Prompting on Truthfulness
S Krishna, C Agarwal, H Lakkaraju
Forty-first International Conference on Machine Learning, 2024, 2024
92024
On the Intersection of Self-Correction and Trust in Language Models
S Krishna
arXiv preprint arXiv:2311.02801, 2023
82023
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20