A Dual-Agent Scheduler for Distributed Deep Learning Jobs on Public Cloud via Reinforcement Learning

M **ng, H Mao, S Yin, L Pan, Z Zhang, Z **ao… - Proceedings of the 29th …, 2023 - dl.acm.org
Public cloud GPU clusters are becoming emerging platforms for training distributed deep
learning jobs. Under this training paradigm, the job scheduler is a crucial component to …

Adversarial Distillation Based on Slack Matching and Attribution Region Alignment

S Yin, Z **ao, M Song, J Long - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Adversarial distillation (AD) is a highly effective method for enhancing the robustness of
small models. Contrary to expectations a high-performing teacher model does not always …

Embracing Adaptation: An Effective Dynamic Defense Strategy Against Adversarial Examples

S Yin, K Yao, Z **ao, J Long - … of the 32nd ACM International Conference …, 2024 - dl.acm.org
Existing adversarial example defense methods are static, meaning they remain unchanged
once training is completed, regardless of how attack methods change. Consequently, static …

SPRING: Improving the Throughput of Sharding Blockchain via Deep Reinforcement Learning Based State Placement

P Li, M Song, M **ng, Z **ao, Q Ding, S Guan… - Proceedings of the ACM …, 2024 - dl.acm.org
Sharding provides an opportunity to overcome the inherent scalability challenges of the
blockchain, which is the infrastructure for the next generation of the Web. In a sharding …

[HTML][HTML] Optimization of High-Performance Computing Job Scheduling Based on Offline Reinforcement Learning

S Li, W Dai, Y Chen, B Liang - Applied Sciences, 2024 - mdpi.com
In large-scale, distributed high-performance computing systems, the increasing complexity
of job scheduling has expanded along with the growth of computational resources and job …

A Spatio-Temporal Diffusion Model for Missing and Real-Time Financial Data Inference

Y Fang, R Liu, H Huang, P Zhao, Q Wu - Proceedings of the 33rd ACM …, 2024 - dl.acm.org
Missing values and unreleased figures are common but highly important for backtesting and
real-time analysis in the financial industry, yet underexploited in the existing literature. In this …

Optimizing communication in deep reinforcement learning with **ngTian

L Pan, J Qian, W **a, H Mao, J Yao, P Li… - Proceedings of the 23rd …, 2022 - dl.acm.org
Deep Reinforcement Learning (DRL) achieves great success in various domains.
Communication in today's DRL algorithms takes non-negligible time compared to the …