UNICO: Unified Hardware Software Co-Optimization for Robust Neural Network Acceleration

B Rashidi, C Gao, S Lu, Z Wang, C Zhou… - Proceedings of the 56th …, 2023 - dl.acm.org
Specialized hardware has become an indispensable component to deep neural network
(DNN) acceleration. To keep up with the rapid evolution of neural networks, holistic and …

Learning to Schedule Online Tasks with Bandit Feedback

Y Xu, S Wang, H Guo, X Liu, Z Shao - arxiv preprint arxiv:2402.16463, 2024 - arxiv.org
Online task scheduling serves an integral role for task-intensive applications in cloud
computing and crowdsourcing. Optimal scheduling can enhance system performance …

HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks

Z Zhang, B He, Z Zhang - … of the 51st International Conference on …, 2022 - dl.acm.org
To efficiently perform inference with neural networks, the underlying tensor programs require
sufficient tuning efforts before being deployed into production environments. Usually …

Exploring Compiler Optimization: A Survey of ML, DL and RL Techniques

C Mithul, DM Abdulla, MH Virinchi… - 2024 8th …, 2024 - ieeexplore.ieee.org
The past few years, traditional compiler optimization methods have been found to be further
enhanced by machine learning (ML), deep learning (DL) and reinforcement learning (RL) …

A Memory-Bounded Best-First Beam Search and Its Application to Scheduling Halide Programs

C Gao, J Chen, T Mo, T Sajed, S Jui, M Qin… - Proceedings of the …, 2022 - ojs.aaai.org
Beam search is a popular algorithm for solving real-world problems---especially where
search space is an enormously large tree but real-time solutions are most preferred. We …

UNICO: Efficient Unified Hardware-Software Co-Optimization For Deep Neural Networks

B Rashidi, C Gao, S Lu, W Zhisheng, L Wei, S JUI… - 2022 - openreview.net
Specialized hardware has become an indispensable component to deep neural network
acceleration. To keep up with the rapid evolution of neural networks, recently, holistic and …

A domain-extensible compiler with controllable automation of optimisations

T Koehler - arxiv preprint arxiv:2212.12035, 2022 - arxiv.org
In high performance domains like image processing, physics simulation or machine
learning, program performance is critical. Programmers called performance engineers are …