A Dual-Agent Scheduler for Distributed Deep Learning Jobs on Public Cloud via Reinforcement Learning
Public cloud GPU clusters are becoming emerging platforms for training distributed deep
learning jobs. Under this training paradigm, the job scheduler is a crucial component to …
learning jobs. Under this training paradigm, the job scheduler is a crucial component to …
Adversarial Distillation Based on Slack Matching and Attribution Region Alignment
Adversarial distillation (AD) is a highly effective method for enhancing the robustness of
small models. Contrary to expectations a high-performing teacher model does not always …
small models. Contrary to expectations a high-performing teacher model does not always …
Embracing Adaptation: An Effective Dynamic Defense Strategy Against Adversarial Examples
Existing adversarial example defense methods are static, meaning they remain unchanged
once training is completed, regardless of how attack methods change. Consequently, static …
once training is completed, regardless of how attack methods change. Consequently, static …
SPRING: Improving the Throughput of Sharding Blockchain via Deep Reinforcement Learning Based State Placement
Sharding provides an opportunity to overcome the inherent scalability challenges of the
blockchain, which is the infrastructure for the next generation of the Web. In a sharding …
blockchain, which is the infrastructure for the next generation of the Web. In a sharding …
[HTML][HTML] Optimization of High-Performance Computing Job Scheduling Based on Offline Reinforcement Learning
S Li, W Dai, Y Chen, B Liang - Applied Sciences, 2024 - mdpi.com
In large-scale, distributed high-performance computing systems, the increasing complexity
of job scheduling has expanded along with the growth of computational resources and job …
of job scheduling has expanded along with the growth of computational resources and job …
A Spatio-Temporal Diffusion Model for Missing and Real-Time Financial Data Inference
Missing values and unreleased figures are common but highly important for backtesting and
real-time analysis in the financial industry, yet underexploited in the existing literature. In this …
real-time analysis in the financial industry, yet underexploited in the existing literature. In this …
Optimizing communication in deep reinforcement learning with **ngTian
Deep Reinforcement Learning (DRL) achieves great success in various domains.
Communication in today's DRL algorithms takes non-negligible time compared to the …
Communication in today's DRL algorithms takes non-negligible time compared to the …