Følg
Xiaoying Zhang
Xiaoying Zhang
Bytedance Inc.
Verifisert e-postadresse på bytedance.com
Tittel
Sitert av
Sitert av
År
Trustworthy llms: a survey and guideline for evaluating large language models' alignment
Y Liu, Y Yao, JF Ton, X Zhang, R Guo, H Cheng, Y Klochkov, MF Taufiq, ...
arXiv preprint arXiv:2308.05374, 2023
2892023
Conversational contextual bandit: Algorithm and application
X Zhang, H Xie, H Li, J CS Lui
Proceedings of the web conference 2020, 662-672, 2020
1082020
Modeling the assimilation-contrast effects in online product rating systems: Debiasing and recommendations
X Zhang, J Zhao, JCS Lui
Proceedings of the Eleventh ACM Conference on Recommender Systems, 98-106, 2017
492017
Debiasing recommendation by learning identifiable latent confounders
Q Zhang, X Zhang, Y Liu, H Wang, M Gao, J Zhang, R Guo
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
182023
Sybil detection in social-activity networks: Modeling, algorithms and evaluations
X Zhang, H Xie, JCS Lui
2018 IEEE 26th International Conference on Network Protocols (ICNP), 44-54, 2018
172018
Disentangled representation for diversified recommendations
X Zhang, H Wang, H Li
Proceedings of the Sixteenth ACM International Conference on Web Search and …, 2023
142023
Understanding assimilation-contrast effects in online rating systems: modelling, debiasing, and applications
X Zhang, H Xie, J Zhao, JCS Lui
ACM Transactions on Information Systems (TOIS) 38 (1), 1-25, 2019
142019
Overcoming reward overoptimization via adversarial policy optimization with lightweight uncertainty estimation
X Zhang, JF Ton, W Shen, H Wang, Y Liu
arXiv preprint arXiv:2403.05171, 2024
132024
Toward building conversational recommender systems: A contextual bandit approach
X Zhang, H Xie, H Li, JCS Lui
arXiv preprint arXiv:1906.01219, 2019
122019
Enhancing sybil detection via social-activity networks: A random walk approach
X Zhang, H Xie, P Yi, JCS Lui
IEEE Transactions on Dependable and Secure Computing 20 (2), 1213-1227, 2022
102022
Improving reinforcement learning from human feedback using contrastive rewards
W Shen, X Zhang, Y Yao, R Zheng, H Guo, Y Liu
arXiv preprint arXiv:2403.07708, 2024
82024
Uncertainty-aware instance reweighting for off-policy learning
X Zhang, J Chen, H Wang, H Xie, Y Liu, J Lui, H Li
Advances in Neural Information Processing Systems 36, 73691-73718, 2023
72023
Heterogeneous information assisted bandit learning: Theory and application
X Zhang, H Xie, JCS Lui
2021 IEEE 37th International Conference on Data Engineering (ICDE), 2135-2140, 2021
32021
Toward optimal llm alignments using two-player games
R Zheng, H Guo, Z Liu, X Zhang, Y Yao, X Xu, Z Wang, Z Xi, T Gui, ...
arXiv preprint arXiv:2406.10977, 2024
22024
Improving bandit learning via heterogeneous information networks: algorithms and applications
X Zhang, H Xie, JCS Lui
ACM Transactions on Knowledge Discovery from Data (TKDD) 16 (6), 1-25, 2022
12022
Retention Depolarization in Recommender System
X Zhang, H Wang, Y Liu
Proceedings of the ACM Web Conference 2024, 1126-1137, 2024
2024
Systemet kan ikke utføre handlingen. Prøv på nytt senere.
Artikler 1–16