Changnan Xiao

Навело

	Све	Од 2020
Наводи	243	243
h-индекс	8	8
i10-индекс	8	8

160

120

202120222023202420253 11 57 143 24

Јавни приступ

Прикажи све

3 чланка

0 чланака

доступно

није доступно

На основу услова финансирања

Прати

Changnan Xiao

Bytedance

Верификована је имејл адреса на bytedance.com

Machine Learning Reinforcement Learning


Наслов Сортирај по наводима Сортирај по години Сортирај по наслову	Навело Навело	Година
A theoretical study on solving continual learning G Kim, C Xiao, T Konishi, Z Ke, B Liu Advances in neural information processing systems 35, 5065-5079, 2022	89	2022
Learnability and algorithm for continual learning G Kim, C Xiao, T Konishi, B Liu International Conference on Machine Learning, 16877-16896, 2023	36	2023
Continual learning based on ood detection and task masking G Kim, S Esmaeilpour, C Xiao, B Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	29	2022
Open-world continual learning: Unifying novelty detection and continual learning G Kim, C Xiao, T Konishi, Z Ke, B Liu Artificial Intelligence 338, 104237, 2025	15	2025
Generalized data distribution iteration J Fan, C Xiao arXiv preprint arXiv:2206.03192, 2022	15	2022
Gdi: Rethinking what makes reinforcement learning different from supervised learning J Fan, C Xiao, Y Huang arXiv preprint arXiv:2106.06232, 2021	14	2021
Mastering strategy card game (Hearthstone) with improved techniques C Xiao, Y Zhang, X Huang, Q Huang, J Chen 2023 IEEE Conference on Games (CoG), 1-8, 2023	12	2023
Mastering strategy card game (legends of code and magic) via end-to-end policy and optimistic smooth fictitious play W Xi, Y Zhang, C Xiao, X Huang, S Deng, H Liang, J Chen, P Sun arXiv preprint arXiv:2303.04096, 2023	11	2023
Conditions for length generalization in learning reasoning skills C Xiao, B Liu arXiv preprint arXiv:2311.16173, 2023	7	2023
A theory for length generalization in learning to reason C Xiao, B Liu arXiv preprint arXiv:2404.00560, 2024	5	2024
An entropy regularization free mechanism for policy-based reinforcement learning C Xiao, H Shi, J Fan, S Deng arXiv preprint arXiv:2106.00707, 2021	5	2021
CASA: A bridge between gradient of policy improvement and policy evaluation C Xiao, H Shi, J Fan, S Deng CoRR, abs/2105.03923, 2021a. URL https://arxiv. org/abs/2105.03923, 2021	3	2021
ParliRobo: Participant lightweight ai robots for massively multiplayer online games (MMOGs) J Zheng, C Xiao, M Li, Z Li, F Qian, W Liu, X Wu Proceedings of the 31st ACM International Conference on Multimedia, 9093-9102, 2023	1	2023
Hierarchical meta reinforcement learning for multi-task environments D Zhao, Y Huang, C Xiao, Y Li, S Deng	1	2021
CASA: Bridging the Gap between Policy Improvement and Policy Evaluation with Conflict Averse Policy Iteration C Xiao, H Shi, J Fan, S Deng, H Yin arXiv preprint arXiv:2105.03923, 2021		2021
Solving Continual Learning via Problem Decomposition G Kim, C Xiao, T Konishi, Z Ke, B Liu

Систем тренутно не може да изврши ову радњу. Пробајте поново касније.

Чланци 1–16

Годишњи број навода

Дупли наводи

Обједињени наводи

Додавање коаутораКоаутори

Прати

Навело