UNICO: Unified Hardware Software Co-Optimization for Robust Neural Network Acceleration
Specialized hardware has become an indispensable component to deep neural network
(DNN) acceleration. To keep up with the rapid evolution of neural networks, holistic and …
(DNN) acceleration. To keep up with the rapid evolution of neural networks, holistic and …
Learning to Schedule Online Tasks with Bandit Feedback
Online task scheduling serves an integral role for task-intensive applications in cloud
computing and crowdsourcing. Optimal scheduling can enhance system performance …
computing and crowdsourcing. Optimal scheduling can enhance system performance …
HARL: Hierarchical Adaptive Reinforcement Learning Based Auto Scheduler for Neural Networks
To efficiently perform inference with neural networks, the underlying tensor programs require
sufficient tuning efforts before being deployed into production environments. Usually …
sufficient tuning efforts before being deployed into production environments. Usually …
Exploring Compiler Optimization: A Survey of ML, DL and RL Techniques
C Mithul, DM Abdulla, MH Virinchi… - 2024 8th …, 2024 - ieeexplore.ieee.org
The past few years, traditional compiler optimization methods have been found to be further
enhanced by machine learning (ML), deep learning (DL) and reinforcement learning (RL) …
enhanced by machine learning (ML), deep learning (DL) and reinforcement learning (RL) …
A Memory-Bounded Best-First Beam Search and Its Application to Scheduling Halide Programs
Beam search is a popular algorithm for solving real-world problems---especially where
search space is an enormously large tree but real-time solutions are most preferred. We …
search space is an enormously large tree but real-time solutions are most preferred. We …
UNICO: Efficient Unified Hardware-Software Co-Optimization For Deep Neural Networks
B Rashidi, C Gao, S Lu, W Zhisheng, L Wei, S JUI… - 2022 - openreview.net
Specialized hardware has become an indispensable component to deep neural network
acceleration. To keep up with the rapid evolution of neural networks, recently, holistic and …
acceleration. To keep up with the rapid evolution of neural networks, recently, holistic and …
A domain-extensible compiler with controllable automation of optimisations
T Koehler - arxiv preprint arxiv:2212.12035, 2022 - arxiv.org
In high performance domains like image processing, physics simulation or machine
learning, program performance is critical. Programmers called performance engineers are …
learning, program performance is critical. Programmers called performance engineers are …