DVFS-Aware DNN Inference on GPUs: Latency Modeling and Performance Analysis

Y Han, Z Nan, S Zhou, Z Niu - arxiv preprint arxiv:2502.06295, 2025‏ - arxiv.org
The rapid development of deep neural networks (DNNs) is inherently accompanied by the
problem of high computational costs. To tackle this challenge, dynamic voltage frequency …