关注
Weichao Mao
Weichao Mao
在 illinois.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Provably efficient reinforcement learning in decentralized general-sum Markov games
W Mao, T Başar
Dynamic Games and Applications 13 (1), 165-186, 2023
852023
On improving model-free algorithms for decentralized multi-agent reinforcement learning
W Mao, L Yang, K Zhang, T Basar
International Conference on Machine Learning, 15007-15049, 2022
78*2022
Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
W Mao, K Zhang, R Zhu, D Simchi-Levi, T Basar
International Conference on Machine Learning, 7447-7458, 2021
51*2021
Pricing for revenue maximization in IoT data markets: An information design perspective
W Mao, Z Zheng, F Wu
IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1837-1845, 2019
462019
{AWARE}: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud Systems
H Qiu, W Mao, C Wang, H Franke, A Youssef, ZT Kalbarczyk, T Başar, ...
2023 USENIX Annual Technical Conference (USENIX ATC 23), 387-402, 2023
362023
Reinforcement learning for resource management in multi-tenant serverless platforms
H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ...
Proceedings of the 2nd European Workshop on Machine Learning and Systems, 20-28, 2022
272022
Information State Embedding in Partially Observable Cooperative Multi-Agent Reinforcement Learning
W Mao, K Zhang, E Miehling, T Başar
2020 59th IEEE Conference on Decision and Control (CDC), 6124-6131, 2020
272020
POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis
W Mao, K Zhang, Q Xie, T Başar
Advances in Neural Information Processing Systems 33, 2020
252020
A mean-field game approach to cloud resource management with function approximation
W Mao, H Qiu, C Wang, H Franke, Z Kalbarczyk, R Iyer, T Basar
Advances in Neural Information Processing Systems 35, 36243-36258, 2022
242022
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction
H Qiu, W Mao, A Patke, S Cui, S Jha, C Wang, H Franke, ZT Kalbarczyk, ...
arXiv preprint arXiv:2404.08509, 2024
222024
Model-Free Nonstationary Reinforcement Learning: Near-Optimal Regret and Applications in Multiagent Reinforcement Learning and Inventory Control
W Mao, K Zhang, R Zhu, D Simchi-Levi, T Başar
Management Science, 2024
20*2024
SIMPPO: a scalable and incremental online learning framework for serverless resource management
H Qiu, W Mao, A Patke, C Wang, H Franke, ZT Kalbarczyk, T Başar, ...
Proceedings of the 13th Symposium on Cloud Computing, 306-322, 2022
202022
Adjusting Matching Algorithm to Adapt to Workload Fluctuations in Content-based Publish/Subscribe Systems
S Qian, W Mao, J Cao, F Le Mouël, M Li
IEEE INFOCOM 2019-IEEE Conference on Computer Communications, 1936-1944, 2019
202019
Online Pricing for Revenue Maximization with Unknown Time Discounting Valuations.
W Mao, Z Zheng, F Wu, G Chen
IJCAI, 440-446, 2018
162018
Power-aware Deep Learning Model Serving with {μ-Serve}
H Qiu, W Mao, A Patke, S Cui, S Jha, C Wang, H Franke, Z Kalbarczyk, ...
2024 USENIX Annual Technical Conference (USENIX ATC 24), 75-93, 2024
112024
A fast and anti-matchability matching algorithm for content-based publish/subscribe systems
S Qian, J Cao, W Mao, Y Zhu, J Yu, M Li, J Wang
Computer Networks 149, 213-225, 2019
102019
Challenges and Opportunities in IoT Data Markets
Z Zheng, W Mao, F Wu, G Chen
Proceedings of the Fourth International Workshop on Social Sensing, 1-2, 2019
82019
Controlgym: Large-scale control environments for benchmarking reinforcement learning algorithms
X Zhang, W Mao, S Mowlavi, M Benosman, T Başar
6th Annual Learning for Dynamics & Control Conference, 181-196, 2024
7*2024
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks
W Mao, R Desai, ML Iuzzolino, N Kamra
arXiv preprint arXiv:2302.05330, 2023
62023
FLASH: Fast Model Adaptation in ML-Centric Cloud Platforms
H Qiu, W Mao, A Patke, S Cui, C Wang, H Franke, Z Kalbarczyk, T Basar, ...
Proceedings of Machine Learning and Systems 6, 524-544, 2024
52024
系统目前无法执行此操作,请稍后再试。
文章 1–20