A survey on bias and fairness in machine learning N Mehrabi, F Morstatter, N Saxena, K Lerman, A Galstyan ACM computing surveys (CSUR) 54 (6), 1-35, 2021 | 5610 | 2021 |
Exacerbating Algorithmic Bias through Fairness Attacks N Mehrabi, M Naveed, F Morstatter, A Galstyan Proceedings of the AAAI Conference on Artificial Intelligence, 2021 | 90 | 2021 |
Man is to person as woman is to location: Measuring gender bias in named entity recognition N Mehrabi, T Gowda, F Morstatter, N Peng, A Galstyan Proceedings of the 31st ACM conference on Hypertext and Social Media, 231-232, 2020 | 72 | 2020 |
Flirt: Feedback loop in-context red teaming N Mehrabi, P Goyal, C Dupuy, Q Hu, S Ghosh, R Zemel, KW Chang, ... EMNLP 2024, 2023 | 47 | 2023 |
Dynamicgem: A library for dynamic graph embedding methods P Goyal, SR Chhetri, N Mehrabi, E Ferrara, A Canedo arXiv preprint arXiv:1811.10734, 2018 | 45 | 2018 |
Lawyers are Dishonest? Quantifying Representational Harms in Commonsense Knowledge Resources N Mehrabi, P Zhou, F Morstatter, J Pujara, X Ren, A Galstyan Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 43 | 2021 |
Debiasing community detection: the importance of lowly connected nodes N Mehrabi, F Morstatter, N Peng, A Galstyan Proceedings of the 2019 IEEE/ACM international conference on advances in …, 2019 | 41 | 2019 |
Attributing fair decisions with attention interventions N Mehrabi, U Gupta, F Morstatter, GV Steeg, A Galstyan Proceedings of the 2nd Workshop on Trustworthy Natural Language Processing …, 2021 | 35 | 2021 |
Robust Conversational Agents against Imperceptible Toxicity Triggers N Mehrabi, A Beirami, F Morstatter, A Galstyan Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 31 | 2022 |
Is the elephant flying? resolving ambiguities in text-to-image generative models N Mehrabi, P Goyal, A Verma, J Dhamala, V Kumar, Q Hu, KW Chang, ... ACL 2023, 2022 | 16* | 2022 |
On the steerability of large language models toward data-driven personas J Li, N Mehrabi, C Peris, P Goyal, KW Chang, A Galstyan, R Zemel, ... NAACL 2024, 2023 | 14 | 2023 |
Towards multi-objective statistically fair federated learning N Mehrabi, C de Lichy, J McKay, C He, W Campbell Trustable, Verifiable and Auditable Federated Learning @ AAAI 2022, 2022 | 13 | 2022 |
Statistical equity: A fairness classification objective N Mehrabi, Y Huang, F Morstatter arXiv preprint arXiv:2005.07293, 2020 | 11 | 2020 |
Prompt Perturbation Consistency Learning for Robust Language Models Y Qiang, S Nandi, N Mehrabi, GV Steeg, A Kumar, A Rumshisky, ... EACL 2024 Findings, 2024 | 9 | 2024 |
Are you talking to ['xem'] or ['x','em']? On Tokenization and Addressing Misgendering in LLMs with Pronoun Tokenization Parity A Ovalle, N Mehrabi, P Goyal, J Dhamala, KW Chang, R Zemel, ... NAACL 2024 Findings, 2023 | 9 | 2023 |
The leaky pipeline in physics publishing CO Ross, A Gupta, N Mehrabi, G Muric, K Lerman arXiv preprint arXiv:2010.08912, 2020 | 6 | 2020 |
Tokenization matters: Navigating data-scarce tokenization for gender inclusive language technologies A Ovalle, N Mehrabi, P Goyal, J Dhamala, KW Chang, R Zemel, ... Findings of the Association for Computational Linguistics: NAACL 2024, 1739-1756, 2024 | 5 | 2024 |
Jab: Joint adversarial prompting and belief augmentation N Mehrabi, P Goyal, A Ramakrishna, J Dhamala, S Ghosh, R Zemel, ... R0-FoMo @ NeurIPS 2023, 2023 | 5 | 2023 |
Data advisor: Dynamic data curation for safety alignment of large language models F Wang, N Mehrabi, P Goyal, R Gupta, KW Chang, A Galstyan arXiv preprint arXiv:2410.05269, 2024 | 4 | 2024 |
Where Does Bias in Common Sense Knowledge Models Come From? S Melotte, F Ilievski, L Zhang, A Malte, N Mutha, F Morstatter, N Mehrabi IEEE Internet Computing 26 (4), 12-20, 2022 | 4 | 2022 |