Yi Zeng

Geciteerd door

	Alles	Sinds 2020
Citaties	2605	2585
h-index	21	21
i10-index	30	30

1400

700

350

1050

201920202021202220232024202517 56 141 292 567 1346 175

Openbare toegang

Alles bekijken

11 artikelen

6 artikelen

beschikbaar

niet beschikbaar

Op basis van financieringsmachtigingen

Medeauteurs

Ruoxi JiaAssistant Professor, Virginia TechGeverifieerd e-mailadres voor vt.edu
Meikang Qiu, ACM Distinguished Scie...Professor, Highly Cited Researcher, IEEE CS Distinguished Contributor, Augusta University, GAGeverifieerd e-mailadres voor augusta.edu
Lingjuan LyuSonyGeverifieerd e-mailadres voor sony.com
Han Qiu (邱寒)Tsinghua UniversityGeverifieerd e-mailadres voor tsinghua.edu.cn
Peter HendersonPrinceton UniversityGeverifieerd e-mailadres voor princeton.edu
Tianwei ZhangNanyang Technological UniversityGeverifieerd e-mailadres voor ntu.edu.sg
Bo LiUniversity of Illinois at Urbana–ChampaignGeverifieerd e-mailadres voor illinois.edu
Ming JinVirginia TechGeverifieerd e-mailadres voor vt.edu
Prateek MittalProfessor, Princeton UniversityGeverifieerd e-mailadres voor princeton.edu
Dawn SongProfessor of Computer Science, UC BerkeleyGeverifieerd e-mailadres voor cs.berkeley.edu
Pin-Yu ChenPrincipal Research Scientist, IBM Research AI; MIT-IBM Watson AI Lab; RPI-IBM AIRCGeverifieerd e-mailadres voor ibm.com
Weiyan ShiNortheastern UniversityGeverifieerd e-mailadres voor northeastern.edu
Diyi YangStanford UniversityGeverifieerd e-mailadres voor stanford.edu
Percy LiangAssociate Professor of Computer Science, Stanford UniversityGeverifieerd e-mailadres voor cs.stanford.edu
Yangsibo HuangGoogleGeverifieerd e-mailadres voor google.com
Z. Morley Mao (茅斫青)University of MichiganGeverifieerd e-mailadres voor umich.edu
Jingwen ZhangAssociate Professor, Department of Communication, Department of Public Health Sciences, UCDGeverifieerd e-mailadres voor ucdavis.edu
Arvind NarayananProfessor, Princeton UniversityGeverifieerd e-mailadres voor cs.princeton.edu
Ning YuNetflix Eyeline StudiosGeverifieerd e-mailadres voor scanlinevfx.com
Cho-Jui HsiehUniversity of California, Los AngelesGeverifieerd e-mailadres voor cs.ucla.edu

Volgen

Yi Zeng

PhD Candidate, Virginia Tech

Geverifieerd e-mailadres voor vt.edu - Homepage

AI Security AI Safety Deep Learning Responsible AI


Titel Sorteren op citaties Sorteren op jaar Sorteren op titel	Geciteerd door Geciteerd door	Jaar
Fine-tuning aligned language models compromises safety, even when users do not intend to! X Qi, Y Zeng, T Xie, PY Chen, R Jia, P Mittal, P Henderson ICLR 2024 Oral (top 1.2%), 2024	431	2024
: A Deep Learning Based Network Encrypted Traffic Classification and Intrusion Detection Framework Y Zeng, H Gu, W Wei, Y Guo IEEE Access 7, 45182-45190, 2019	270	2019
Rethinking the Backdoor Attacks' Triggers: A Frequency Perspective Y Zeng, W Park, ZM Mao, R Jia International Conference on Computer Vision (ICCV), 2021, 2021	257	2021
Deepsweep: An evaluation framework for mitigating DNN backdoor attacks using data augmentation H Qiu, Y Zeng, S Guo, T Zhang, M Qiu, B Thuraisingham Proceedings of the 2021 ACM Asia Conference on Computer and Communications …, 2021	236*	2021
Adversarial Unlearning of Backdoors via Implicit Hypergradient Y Zeng, S Chen, W Park, ZM Mao, M Jin, R Jia The Tenth International Conference on Learning Representations (ICLR 2022), 2021	198	2021
Narcissus: A practical clean-label backdoor attack with limited information Y Zeng, M Pan, HA Just, L Lyu, M Qiu, R Jia ACM SIGSAC Conference on Computer and Communications Security (CCS), 2023	190	2023
How johnny can persuade llms to jailbreak them: Rethinking persuasion to challenge ai safety by humanizing llms Y Zeng, H Lin, J Zhang, D Yang, R Jia, W Shi ACL 2024 (Best Social Impact Award), 2024	188	2024
Cater: Intellectual property protection on text generation apis via conditional watermarks X He, Q Xu, Y Zeng, L Lyu, F Wu, J Li, R Jia Advances in Neural Information Processing Systems 35, 5431-5445, 2022	81	2022
A data augmentation-based defense method against adversarial attacks in neural networks Y Zeng, H Qiu, G Memmi, M Qiu Algorithms and Architectures for Parallel Processing: 20th International …, 2020	78	2020
DeepVCM: A deep learning based intrusion detection method in VANET Y Zeng, M Qiu, D Zhu, Z Xue, J Xiong, M Liu 2019 IEEE 5th intl conference on big data security on cloud (BigDataSecurity …, 2019	72	2019
LAVA: Data Valuation without Pre-Specified Learning Algorithms HA Just, F Kang, JT Wang, Y Zeng, M Ko, M Jin, R Jia The Eleventh International Conference on Learning Representations (ICLR 2023), 2023	60	2023
Senior2local: A machine learning based intrusion detection method for vanets Y Zeng, M Qiu, Z Ming, M Liu Smart Computing and Communication: Third International Conference, SmartCom …, 2018	59	2018
Fine-tuning Is Not Enough: A Simple yet Effective Watermark Removal Attack for DNN Models S Guo, T Zhang, H Qiu, Y Zeng, T Xiang, Y Liu International Joint Conference on Artificial Intelligence (IJCAI), 2021, 2021	58*	2021
An efficient preprocessing-based approach to mitigate advanced adversarial attacks H Qiu, Y Zeng, Q Zheng, S Guo, T Zhang, H Li IEEE Transactions on Computers 73 (3), 645-655, 2021	46*	2021
A safe harbor for ai evaluation and red teaming S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ... ICML 2024, 2024	33	2024
Sorry-bench: Systematically evaluating large language model safety refusal behaviors T Xie, X Qi, Y Zeng, Y Huang, UM Sehwag, K Huang, L He, B Wei, D Li, ... ICLR 2025, 2025	31	2025
Introducing v0. 5 of the ai safety benchmark from mlcommons B Vidgen, A Agrawal, AM Ahmed, V Akinwande, N Al-Nuaimi, N Alfaraj, ... arXiv preprint arXiv:2404.12241, 2024	31	2024
RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content Z Yuan, Z Xiong, Y Zeng, N Yu, R Jia, D Song, B Li ICML 2024, 2024	30	2024
META-SIFT: How to Sift Out a Clean Data Subset in the Presence of Data Poisoning? Y Zeng, M Pan, H Jahagirdar, M Jin, L Lyu, R Jia USENIX Security Symposium, 2023, 2023	29*	2023
ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms M Pan, Y Zeng, L Lyu, X Lin, R Jia USENIX Security Symposium, 2023, 2023	28	2023

Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.

Artikelen 1–20

Citaties per jaar

Dubbele citaties

Samengevoegde citaties

Medeauteurs toevoegenMedeauteurs

Volgen

Geciteerd door

Medeauteurs