‪Zidi Xiong‬ - ‪Academic Search‬

Get my own profile

Cited by

	All	Since 2020
Citations	534	532
h-index	8	8
i10-index	5	5

0

440

220

110

330

20232024202547 436 44

Public access

3 articles

0 articles

available

not available

Based on funding mandates

Zidi Xiong

Zidi Xiong

Harvard University

Verified email at g.harvard.edu - Homepage

Trustworthy machine learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models. B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... NeurIPS, 2023	383	2023
Badchain: Backdoor chain-of-thought prompting for large language models Z Xiang, F Jiang, Z Xiong, B Ramasubramanian, R Poovendran, B Li arXiv preprint arXiv:2401.12242, 2024	57	2024
Rigorllm: Resilient guardrails for large language models against undesired content Z Yuan, Z Xiong, Y Zeng, N Yu, R Jia, D Song, B Li arXiv preprint arXiv:2403.13031, 2024	30	2024
Umd: Unsupervised model detection for x2x backdoor attacks Z Xiang, Z Xiong, B Li International Conference on Machine Learning, 38013-38038, 2023	15	2023
GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning Z Xiang, L Zheng, Y Li, J Hong, Q Li, H Xie, J Zhang, Z Xiong, C Xie, ... arXiv preprint arXiv:2406.09187, 2024	10	2024
CBD: A certified backdoor detector based on local dominant probability Z Xiang, Z Xiong, B Li Advances in Neural Information Processing Systems 36, 2024	9	2024
Label-smoothed backdoor attack M Peng, Z Xiong, M Sun, P Li arXiv preprint arXiv:2202.11203, 2022	9	2022
DecodingTrust: A comprehensive assessment of trustworthiness in GPT models. arXiv B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... arXiv preprint arXiv:2306.11698, 2024	8	2024
DecodingTrust: A Comprehensive Assessment of Trustworthiness in GPT Models (2023) B Wang, W Chen, H Pei, C Xie, M Kang, C Zhang, C Xu, Z Xiong, R Dutta, ... Cited on, 28, 0	7
Backdoor chain-of-thought prompting for large language models Z Xiang, F Jiang, Z Xiong, B Ramasubramanian, R Poovendran, BB Li NeurIPS Workshops, 2023	5	2023
Rethinking the Necessity of Labels in Backdoor Removal Z Xiong, D Wu, Y Wang, Y Wang ICLR 2023 Workshop on Backdoor Attacks and Defenses in Machine Learning, 2023	1	2023

The system can't perform the operation now. Try again later.

Articles 1–11