Dynamic GPU energy optimization for machine learning training workloads F Wang, W Zhang, S Lai, M Hao, Z Wang IEEE Transactions on Parallel and Distributed Systems 33 (11), 2943-2954, 2021 | 41 | 2021 |
Fine-Grained Powercap Allocation for Power-Constrained Systems Based on Multi-Objective Machine Learning M Hao, W Zhang, Y Wang, G Lu, F Wang, AV Vasilakos IEEE Transactions on Parallel and Distributed Systems 32 (7), 1789-1801, 2021 | 27 | 2021 |
Online power management for multi-cores: A reinforcement learning based approach Y Wang, W Zhang, M Hao, Z Wang IEEE Transactions on Parallel and Distributed Systems 33 (4), 751-764, 2021 | 27 | 2021 |
Predicting HPC parallel program performance based on LLVM compiler W Zhang, M Hao, M Snir Cluster Computing 20, 1179-1192, 2017 | 20 | 2017 |
Automatic generation of benchmarks for I/O-intensive parallel applications M Hao, W Zhang, Y Zhang, M Snir, LT Yang Journal of Parallel and Distributed Computing 124, 1-13, 2019 | 16 | 2019 |
Automatic translation of data parallel programs for heterogeneous parallelism through OpenMP offloading F Wang, W Zhang, H Guo, M Hao, G Lu, Z Wang The Journal of Supercomputing 77 (5), 4957-4987, 2021 | 6 | 2021 |
DRLCAP: Runtime GPU Frequency Capping With Deep Reinforcement Learning Y Wang, M Hao, H He, W Zhang, Q Tang, X Sun, Z Wang IEEE Transactions on Sustainable Computing 9 (5), 712-726, 2024 | 4 | 2024 |
Communication optimization for RDMA-based science data transmission tools W Zhang, M Hao, Z Xu The Journal of Supercomputing 72, 3312-3327, 2016 | 4 | 2016 |
An efficient personalized federated learning approach in heterogeneous environments: a reinforcement learning perspective H Yang, J Li, M Hao, W Zhang, H He, AK Sangaiah Scientific Reports 14 (1), 28877, 2024 | 2 | 2024 |
Model-free gpu online energy optimization F Wang, M Hao, W Zhang, Z Wang IEEE Transactions on Sustainable Computing 9 (2), 141-154, 2023 | 2 | 2023 |
Multi-parameter performance modeling based on machine learning with basic block features M Hao, W Zhang, Y Wang, D Li, W Xia, H Wang, C Lou 2019 IEEE Intl Conf on Parallel & Distributed Processing with Applications …, 2019 | 2 | 2019 |
Research of adaptive virtual machine memory scheduling algorithm in cloud computing environment W Desheng, Z Weizhe, HAO Meng, LU Gangzhao, BAI Enci Journal of Frontiers of Computer Science & Technology 11 (1), 70, 2017 | 2 | 2017 |
Profpred: A compiler-level ir based performance prediction framework for mpi industrial applications M Hao, W Zhang, LT Yang 2019 IEEE 21st International Conference on High Performance Computing and …, 2019 | 1 | 2019 |
Dynamic Power Management Through Multi-agent Deep Reinforcement Learning for Heterogeneous Systems Y Wang, W Zhang, M Hao, W Kong, Y Wen ACM Transactions on Architecture and Code Optimization, 2025 | | 2025 |
Optimizing depthwise separable convolution on DCU Z Liu, M Hao, W Zhang, G Lu, X Tian, S Yang, M Xie, J Dai, C Yuan, ... CCF Transactions on High Performance Computing, 1-19, 2024 | | 2024 |
隐私计算环境下深度学习的 GPU 加速技术综述 秦智翔, 杨洪伟, 郝萌, 何慧, 张伟哲 信息安全研究 10 (7), 586, 2024 | | 2024 |
零知识证明硬件加速研究综述 谢明东, 郝萌, 杨洪伟, 何慧, 张伟哲 信息安全研究 10 (7), 594, 2024 | | 2024 |
Transplantation and Optimization of Graph Matching Algorithm Based on Domestic DCU Heterogeneous Platform M Hao, X Tian, G Lu, Y Liu, W Zhang, H He Computer Science 51 (4), 67-77, 2024 | | 2024 |
SEPPDL: A Secure and Efficient Privacy-Preserving Deep Learning Inference Framework for Autonomous Driving W Bobo, H Yang, M Hao, J Zhang, H He, W Zhang ACM Transactions on Autonomous and Adaptive Systems, 0 | | |