Obserwuj
Jingwei Yi
Tytuł
Cytowane przez
Cytowane przez
Rok
Defending ChatGPT against jailbreak attack via self-reminders
Y Xie*, J Yi*, J Shao, J Curl, L Lyu, Q Chen, X Xie, F Wu
Nature Machine Intelligence, 1-11, 2023
174*2023
Efficient-FedRec: Efficient Federated Learning Framework for Privacy-Preserving News Recommendation
J Yi, F Wu, C Wu, R Liu, G Sun, X Xie
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
732021
Benchmarking and defending against indirect prompt injection attacks on large language models
J Yi*, Y Xie*, B Zhu, E Kiciman, G Sun, X Xie, F Wu
The 31th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
642023
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark
W Peng*, J Yi*, F Wu, S Wu, B Zhu, L Lyu, B Jiao, T Xu, G Sun, X Xie
Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023
53*2023
Tiny-newsrec: Effective and efficient plm-based news recommendation
Y Yu, F Wu, C Wu, J Yi, Q Liu
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2021
37*2021
On the Vulnerability of Safety Alignment in Open-Access LLMs
J Yi*, R Ye*, Q Chen, BB Zhu, S Chen, D Lian, G Sun, X Xie, F Wu
Findings of the Association for Computational Linguistics: ACL 2024, 2023
20*2023
UA-FedRec: Untargeted Attack on Federated News Recommendation
J Yi, F Wu, B Zhu, J Yao, Z Tao, G Sun, X Xie
The 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022
182022
Debiasedrec: Bias-aware user modeling and click prediction for personalized news recommendation
J Yi, F Wu, C Wu, Q Li, G Sun, X Xie
arXiv preprint arXiv:2104.07360, 2021
132021
Control Risk for Potential Misuse of Artificial Intelligence in Science
J He*, W Feng*, Y Min*, J Yi*, K Tang, S Li, J Zhang, K Chen, W Zhou, ...
arXiv preprint arXiv:2312.06632, 2023
82023
Non-IID always Bad? Semi-Supervised Heterogeneous Federated Learning with Local Knowledge Enhancement
C Zhang, F Wu, J Yi, D Xu, Y Yu, J Wang, Y Wang, T Xu, X Xie, E Chen
Proceedings of the 32nd ACM International Conference on Information and …, 2023
82023
Effective and Efficient Query-aware Snippet Extraction for Web Search
J Yi, F Wu, C Wu, X Huang, B Jiao, G Sun, X Xie
Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022
32022
Robust Quantity-Aware Aggregation for Federated Learning
J Yi, F Wu, H Zhang, B Zhu, G Sun, X Xie
arXiv preprint arXiv:2205.10848, 2022
32022
ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models
J Yi, J Yin, J Xu, P Bao, Y Wang, W Fan, H Wang
arXiv preprint arXiv:2501.12418, 2025
2025
Elephant in the Room: Unveiling the Impact of Reward Model Quality in Alignment
Y Liu, X Yi, X Chen, J Yao, J Yi, D Zan, Z Liu, X Xie, TY Ho
arXiv preprint arXiv:2409.19024, 2024
2024
Measuring Human Contribution in AI-Assisted Content Generation
Y Xie, T Qi, J Yi, R Whalen, J Huang, Q Ding, Y Xie, X Xie, F Wu
arXiv preprint arXiv:2408.14792, 2024
2024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–15