팔로우
Yihan Wang
Yihan Wang
ucla.edu의 이메일 확인됨 - 홈페이지
제목
인용
인용
연도
Automatic perturbation analysis for scalable certified robustness and beyond
K Xu, Z Shi, H Zhang, Y Wang, KW Chang, M Huang, B Kailkhura, X Lin, ...
Advances in Neural Information Processing Systems 33, 1129-1141, 2020
3052020
Fast and complete: Enabling complete neural network verification with rapid and massively parallel incomplete verifiers
K Xu, H Zhang, S Wang, Y Wang, S Jana, X Lin, CJ Hsieh
arXiv preprint arXiv:2011.13824, 2020
2182020
Fast certified robust training with short warmup
Z Shi*, Y Wang*, H Zhang, J Yi, CJ Hsieh
Advances in Neural Information Processing Systems 34, 18335-18349, 2021
762021
Red teaming language model detectors with language models
Z Shi*, Y Wang*, F Yin*, X Chen, KW Chang, CJ Hsieh
Transactions of the Association for Computational Linguistics 12, 174-189, 2024
642024
Efficiently computing local lipschitz constants of neural networks via bound propagation
Z Shi, Y Wang, H Zhang, JZ Kolter, CJ Hsieh
Advances in Neural Information Processing Systems 35, 2350-2364, 2022
392022
A branch and bound framework for stronger adversarial attacks of ReLU networks
H Zhang, S Wang, K Xu, Y Wang, S Jana, CJ Hsieh, Z Kolter
International Conference on Machine Learning, 26591-26604, 2022
332022
On -norm Robustness of Ensemble Stumps and Trees
Y Wang, H Zhang, H Chen, D Boning, CJ Hsieh
arXiv preprint arXiv:2008.08755, 2020
25*2020
Defending llms against jailbreaking attacks via backtranslation
Y Wang*, Z Shi*, A Bai, CJ Hsieh
arXiv preprint arXiv:2402.16459, 2024
212024
Universality and limitations of prompt tuning
Y Wang, J Chauhan, W Wang, CJ Hsieh
Advances in Neural Information Processing Systems 36, 2024
182024
Two-stage LLM Fine-tuning with Less Specialization and More Generalization
Y Wang, S Si, D Li, M Lukasik, F Yu, CJ Hsieh, IS Dhillon, S Kumar
arXiv preprint arXiv:2211.00635, 2022
17*2022
On the convergence of certified robust training with interval bound propagation
Y Wang, Z Shi, Q Gu, CJ Hsieh
arXiv preprint arXiv:2203.08961, 2022
102022
Improving the generation quality of watermarked large language models via word importance scoring
Y Li*, Y Wang*, Z Shi, CJ Hsieh
arXiv preprint arXiv:2311.09668, 2023
32023
Evaluating Worst Case Adversarial Weather Perturbations Robustness
Y Wang, Y Ba, HC Zhang, H Zhang, A Kadambi, S Soatto, A Wong, ...
NeurIPS ML Safety Workshop, 2022
32022
On the loss of context-awareness in general instruction fine-tuning
Y Wang, A Bai, N Peng, CJ Hsieh
arXiv preprint arXiv:2411.02688, 2024
12024
현재 시스템이 작동되지 않습니다. 나중에 다시 시도해 주세요.
학술자료 1–14