Volgen
Xiangyu Zhu
Xiangyu Zhu
Institute for AI Industry Research, Tsinghua University
Geverifieerd e-mailadres voor air.tsinghua.edu.cn
Titel
Geciteerd door
Geciteerd door
Jaar
Constraints penalized q-learning for safe offline reinforcement learning
H Xu, X Zhan, X Zhu
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 8753-8760, 2022
922022
Deepthermal: Combustion optimization for thermal power generating units using offline reinforcement learning
X Zhan, H Xu, Y Zhang, Y Huo, X Zhu, H Yin, Y Zheng
Proceedings of the AAAI Conference on Artificial Intelligence 36 (4), 4680-4688, 2022
792022
Model-based offline planning with trajectory pruning
X Zhan, X Zhu, H Xu
Proceedings of the Thirty-First International Joint Conference on Artificial …, 2022
332022
When data geometry meets deep function: Generalizing offline reinforcement learning
J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang
The Eleventh International Conference on Learning Representations, 2023
252023
Three-layer graph framework with the sumD feature for alpha matting
C Li, P Wang, X Zhu, H Pi
Computer Vision and Image Understanding 162, 34-45, 2017
252017
ECoalVis: visual analysis of control strategies in coal-fired power plants
S Liu, D Weng, Y Tian, Z Deng, H Xu, X Zhu, H Yin, X Zhan, Y Wu
IEEE transactions on visualization and computer graphics 29 (1), 1091-1101, 2022
142022
Distance-sensitive offline reinforcement learning
J Li, X Zhan, H Xu, X Zhu, J Liu, YQ Zhang
arXiv preprint arXiv:2205.11027 3, 2022
142022
Adaptive propagation matting based on transparency of image
X Zhu, P Wang, Z Huang
Multimedia Tools and Applications 77, 19089-19112, 2018
72018
H2O+: an improved framework for hybrid offline-and-online RL with dynamics gaps
H Niu, T Ji, B Liu, H Zhao, X Zhu, J Zheng, P Huang, G Zhou, J Hu, X Zhan
arXiv preprint arXiv:2309.12716, 2023
42023
TESLA: Thermally Safe, Load-Aware, and Energy-Efficient Cooling Control System for Data Centers
H Geng, Y Sun, Y Li, J Leng, X Zhu, X Zhan, Y Li, F Zhao, Y Liu
Proceedings of the 53rd International Conference on Parallel Processing, 939-949, 2024
22024
Data Center Cooling System Optimization Using Offline Reinforcement Learning
X Zhan, X Zhu, P Cheng, X Hu, Z He, H Geng, J Leng, H Zheng, C Liu, ...
arXiv preprint arXiv:2501.15085, 2025
2025
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–11