Urmăriți
Jingwei Yi
Jingwei Yi
Adresă de e-mail confirmată pe mail.ustc.edu.cn - Pagina de pornire
Titlu
Citat de
Citat de
Anul
Defending ChatGPT against jailbreak attack via self-reminders
Y Xie*, J Yi*, J Shao, J Curl, L Lyu, Q Chen, X Xie, F Wu
Nature Machine Intelligence, 1-11, 2023
205*2023
Benchmarking and defending against indirect prompt injection attacks on large language models
J Yi*, Y Xie*, B Zhu, E Kiciman, G Sun, X Xie, F Wu
The 31th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
772023
Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation
J Yi, F Wu, C Wu, R Liu, G Sun, X Xie
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
732021
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark
W Peng*, J Yi*, F Wu, S Wu, B Zhu, L Lyu, B Jiao, T Xu, G Sun, X Xie
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
54*2023
Tiny-newsrec: Effective and efficient plm-based news recommendation
Y Yu, F Wu, C Wu, J Yi, Q Liu
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2021
36*2021
On the Vulnerability of Safety Alignment in Open-Access LLMs
J Yi*, R Ye*, Q Chen, BB Zhu, S Chen, D Lian, G Sun, X Xie, F Wu
Findings of the Association for Computational Linguistics: ACL 2024, 2023
23*2023
UA-FedRec: Untargeted Attack on Federated News Recommendation
J Yi, F Wu, B Zhu, J Yao, Z Tao, G Sun, X Xie
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022
162022
Debiasedrec: Bias-aware user modeling and click prediction for personalized news recommendation
J Yi, F Wu, C Wu, Q Li, G Sun, X Xie
arXiv preprint arXiv:2104.07360, 2021
142021
Control Risk for Potential Misuse of Artificial Intelligence in Science
J He*, W Feng*, Y Min*, J Yi*, K Tang, S Li, J Zhang, K Chen, W Zhou, ...
arXiv preprint arXiv:2312.06632, 2023
82023
Non-iid always bad? semi-supervised heterogeneous federated learning with local knowledge enhancement
C Zhang, F Wu, J Yi, D Xu, Y Yu, J Wang, Y Wang, T Xu, X Xie, E Chen
Proceedings of the 32nd ACM International Conference on Information and …, 2023
82023
Effective and Efficient Query-aware Snippet Extraction for Web Search
J Yi, F Wu, C Wu, X Huang, B Jiao, G Sun, X Xie
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
32022
Robust Quantity-Aware Aggregation for Federated Learning
J Yi, F Wu, H Zhang, B Zhu, G Sun, X Xie
arXiv preprint arXiv:2205.10848, 2022
32022
ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models
J Yi, J Yin, J Xu, P Bao, Y Wang, W Fan, H Wang
arXiv preprint arXiv:2501.12418, 2025
2025
Elephant in the Room: Unveiling the Impact of Reward Model Quality in Alignment
Y Liu, X Yi, X Chen, J Yao, J Yi, D Zan, Z Liu, X Xie, TY Ho
arXiv preprint arXiv:2409.19024, 2024
2024
Measuring Human Contribution in AI-Assisted Content Generation
Y Xie, T Qi, J Yi, R Whalen, J Huang, Q Ding, Y Xie, X Xie, F Wu
arXiv preprint arXiv:2408.14792, 2024
2024
Elephant in the Room: Unveiling the Pitfalls of Human Proxies in Alignment
Y Liu, X Yi, X Chen, J Yao, J Yi, D Zan, Z Liu, X Xie, TY Ho
Sistemul nu poate realiza operația în acest moment. Încercați din nou mai târziu.
Articole 1–16