Прати
Changnan Xiao
Changnan Xiao
Bytedance
Верификована је имејл адреса на bytedance.com
Наслов
Навело
Навело
Година
A theoretical study on solving continual learning
G Kim, C Xiao, T Konishi, Z Ke, B Liu
Advances in neural information processing systems 35, 5065-5079, 2022
892022
Learnability and algorithm for continual learning
G Kim, C Xiao, T Konishi, B Liu
International Conference on Machine Learning, 16877-16896, 2023
362023
Continual learning based on ood detection and task masking
G Kim, S Esmaeilpour, C Xiao, B Liu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
292022
Open-world continual learning: Unifying novelty detection and continual learning
G Kim, C Xiao, T Konishi, Z Ke, B Liu
Artificial Intelligence 338, 104237, 2025
152025
Generalized data distribution iteration
J Fan, C Xiao
arXiv preprint arXiv:2206.03192, 2022
152022
Gdi: Rethinking what makes reinforcement learning different from supervised learning
J Fan, C Xiao, Y Huang
arXiv preprint arXiv:2106.06232, 2021
142021
Mastering strategy card game (Hearthstone) with improved techniques
C Xiao, Y Zhang, X Huang, Q Huang, J Chen
2023 IEEE Conference on Games (CoG), 1-8, 2023
122023
Mastering strategy card game (legends of code and magic) via end-to-end policy and optimistic smooth fictitious play
W Xi, Y Zhang, C Xiao, X Huang, S Deng, H Liang, J Chen, P Sun
arXiv preprint arXiv:2303.04096, 2023
112023
Conditions for length generalization in learning reasoning skills
C Xiao, B Liu
arXiv preprint arXiv:2311.16173, 2023
72023
A theory for length generalization in learning to reason
C Xiao, B Liu
arXiv preprint arXiv:2404.00560, 2024
52024
An entropy regularization free mechanism for policy-based reinforcement learning
C Xiao, H Shi, J Fan, S Deng
arXiv preprint arXiv:2106.00707, 2021
52021
CASA: A bridge between gradient of policy improvement and policy evaluation
C Xiao, H Shi, J Fan, S Deng
CoRR, abs/2105.03923, 2021a. URL https://arxiv. org/abs/2105.03923, 2021
32021
ParliRobo: Participant lightweight ai robots for massively multiplayer online games (MMOGs)
J Zheng, C Xiao, M Li, Z Li, F Qian, W Liu, X Wu
Proceedings of the 31st ACM International Conference on Multimedia, 9093-9102, 2023
12023
Hierarchical meta reinforcement learning for multi-task environments
D Zhao, Y Huang, C Xiao, Y Li, S Deng
12021
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration
C Xiao, H Shi, J Fan, S Deng, H Yin
arXiv preprint arXiv:2105.03923, 2021
2021
Solving Continual Learning via Problem Decomposition
G Kim, C Xiao, T Konishi, Z Ke, B Liu
Систем тренутно не може да изврши ову радњу. Пробајте поново касније.
Чланци 1–16