Suivre
Kaixin Wang
Kaixin Wang
Microsoft Research
Adresse e-mail validée de microsoft.com - Page d'accueil
Titre
Citée par
Citée par
Année
Panet: Few-shot image semantic segmentation with prototype alignment
K Wang, JH Liew, Y Zou, D Zhou, J Feng
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
14352019
Understanding and Resolving Performance Degradation in Deep Graph Convolutional Networks
K Zhou, Y Dong, K Wang, WS Lee, B Hooi, H Xu, J Feng
Proceedings of the 30th ACM International Conference on Information …, 2021
147*2021
Improving generalization in reinforcement learning with mixture regularization
K Wang, B Kang, J Shao, J Feng
Advances in Neural Information Processing Systems 33, 7968-7978, 2020
1432020
Towards Better Laplacian Representation in Reinforcement Learning with Generalized Graph Drawing
K Wang, K Zhou, Q Zhang, J Shao, B Hooi, J Feng
International Conference on Machine Learning, 11003-11012, 2021
232021
Efficient Value Iteration for s-rectangular Robust Markov Decision Processes
N Kumar, K Wang, KY Levy, S Mannor
Forty-first International Conference on Machine Learning, 0
22*
Neural epitome search for architecture-agnostic network compression
D Zhou, X Jin, Q Hou, K Wang, J Yang, J Feng
arXiv preprint arXiv:1907.05642, 2019
19*2019
Revisiting Intrinsic Reward for Exploration in Procedurally Generated Environments
K Wang, K Zhou, B Kang, J Feng, YAN Shuicheng
The Eleventh International Conference on Learning Representations, 0
15*
Relational reasoning via set transformers: Provable efficiency and applications to MARL
F Zhang, B Liu, K Wang, V Tan, Z Yang, Z Wang
Advances in Neural Information Processing Systems 35, 35825-35838, 2022
112022
How Far is Video Generation from World Model: A Physical Law Perspective
B Kang, Y Yue, R Lu, Z Lin, Y Zhao, K Wang, G Huang, J Feng
arXiv preprint arXiv:2411.02385, 2024
82024
The geometry of robust value functions
K Wang, N Kumar, K Zhou, B Hooi, J Feng, S Mannor
International Conference on Machine Learning, 22727-22751, 2022
72022
Policy Gradient for Reinforcement Learning with General Utilities
N Kumar, K Wang, K Levy, S Mannor
arXiv preprint arXiv:2210.00991, 2022
32022
Improving Token-Based World Models with Parallel Observation Prediction
L Cohen, K Wang, B Kang, S Mannor
arXiv preprint arXiv:2402.05643, 2024
22024
Jointly Modelling Uncertainty and Diversity for Active Molecular Property Prediction
K Zhou, K Wang, J Tang, J Feng, B Hooi, P Zhao, T Xu, X Wang
Learning on Graphs Conference, 29: 1-29: 21, 2022
22022
Reachability-Aware Laplacian Representation in Reinforcement Learning
K Wang, K Zhou, J Feng, B Hooi, X Wang
arXiv preprint arXiv:2210.13153, 2022
22022
Tyger: Task-Type-Generic Active Learning for Molecular Property Prediction
K Zhou, K Wang, J Feng, J Tang, T Xu, X Wang
arXiv preprint arXiv:2205.11279, 2022
22022
Bring Your Own (Non-Robust) Algorithm to Solve Robust MDPs by Estimating The Worst Kernel
U Gadot, K Wang, N Kumar, KY Levy, S Mannor
Forty-first International Conference on Machine Learning, 0
2*
PPG reloaded: an empirical study on what matters in phasic policy gradient
K Wang, D Zhou, J Feng, S Mannor
12023
Q-Learning for Lp Robust Markov Decision Processes
N Kumar, K Wang, K Levy, S Mannor
2022
Implicit Curriculum in Procgen Made Explicit
Z Tan, K Wang, X Wang
The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0
Learning the Uncertainty Set in Robust Markov Decision Process
N Kumar, K Wang, U Gadot, KY Levy, S Mannor
The Second Tiny Papers Track at ICLR 2024, 0
Le système ne peut pas réaliser cette opération maintenant. Veuillez réessayer plus tard.
Articles 1–20