Weight pruning is an effective model compression technique to tackle the challenges of
achieving real-time deep neural network (DNN) inference on mobile devices. However, prior …
achieving real-time deep neural network (DNN) inference on mobile devices. However, prior …