Følg
Jiancong Xiao
Tittel
Sitert av
Sitert av
År
Stability analysis and generalization bounds of adversarial training
J Xiao, Y Fan, R Sun, J Wang, ZQ Luo
NeurIPS 2022, Spotlight, 2022
422022
Adversarial rademacher complexity of deep neural networks
J Xiao, Y Fan, R Sun, ZQ Luo
arXiv preprint arXiv:2211.14966, 2022
192022
Understanding adversarial robustness against on-manifold adversarial examples
J Xiao, L Yang, Y Fan, J Wang, ZQ Luo
Pattern Recognition 159, 111071, 2022
142022
PAC-bayesian spectrally-normalized bounds for adversarially robust generalization
J Xiao, R Sun, ZQ Luo
NeurIPS 2023, 2023
102023
Improving Adversarial Training for Multiple Perturbations through the Lens of Uniform Stability
J Xiao, Z Qin, Y Fan, B Wu, J Wang, ZQ Luo
ICML 2023 AdvML-Frontiers Workshop, 2023
10*2023
On the algorithmic bias of aligning large language models with rlhf: Preference collapse and matching regularization
J Xiao, Z Li, X Xie, E Getzen, C Fang, Q Long, WJ Su
arXiv preprint arXiv:2405.16455, 2024
82024
Smoothed-sgdmax: A stability-inspired algorithm to improve adversarial generalization
J Xiao, J Zhang, ZQ Luo, AE Ozdaglar
NeurIPS 2022 ML Safety Workshop, 2022
72022
Pac-bayesian adversarially robust generalization bounds for deep neural networks
J Xiao, R Sun, ZQ Luo
The Second Workshop on New Frontiers in Adversarial Machine Learning, 2023
52023
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity
Z Li, C Chen, T Xu, Z Qin, J Xiao, R Sun, ZQ Luo
ICLR 2025, 2024
4*2024
Bridging the Gap: Rademacher Complexity in Robust and Standard Generalization
J Xiao, R Sun, Q Long, WJ Su
COLT 2024, 2024
4*2024
Uniformly Stable Algorithms for Adversarial Training and Beyond
J Xiao, J Zhang, ZQ Luo, A Ozdaglar
ICML 2024, 2024
32024
Fine-Tuning Attention Modules Only: Enhancing Weight Disentanglement in Task Arithmetic
R Jin, B Hou, J Xiao, WJ Su, L Shen
ICLR 2025, 2024
3*2024
Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment
M Wang, C Ma, Q Chen, L Meng, Y Han, J Xiao, Z Zhang, J Huo, WJ Su, ...
arXiv preprint arXiv:2410.16714, 2024
2024
Magnetic Mirror Descent Self-play Preference Optimization
M Wang, C Ma, Q Chen, L Meng, Y Han, J Xiao, Z Zhang, J Huo, WJ Su, ...
ICLR 2025, 2024
2024
Understanding Adversarially Robust Generalization: A Learning Theory Perspective
J Xiao
The Chinese University of Hong Kong, Shenzhen, 2023
2023
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–15