Adapting neural networks at runtime: Current trends in at-runtime optimizations for deep learning

M Sponner, B Waschneck, A Kumar - ACM Computing Surveys, 2024 - dl.acm.org
Adaptive optimization methods for deep learning adjust the inference task to the current
circumstances at runtime to improve the resource footprint while maintaining the model's …

Efficient spatially sparse inference for conditional gans and diffusion models

M Li, J Lin, C Meng, S Ermon… - Advances in neural …, 2022 - proceedings.neurips.cc
During image editing, existing deep generative models tend to re-synthesize the entire
output from scratch, including the unedited regions. This leads to a significant waste of …

Distrifusion: Distributed parallel inference for high-resolution diffusion models

M Li, T Cai, J Cao, Q Zhang, H Cai… - Proceedings of the …, 2024 - openaccess.thecvf.com
Diffusion models have achieved great success in synthesizing high-quality images.
However generating high-resolution images with diffusion models is still challenging due to …

Adaframe: Adaptive frame selection for fast video recognition

Z Wu, C **ong, CY Ma, R Socher… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present AdaFrame, a framework that adaptively selects relevant frames on a per-input
basis for fast video recognition. AdaFrame contains a Long Short-Term Memory network …

Efficient visual recognition: A survey on recent advances and brain-inspired methodologies

Y Wu, DH Wang, XT Lu, F Yang, M Yao… - Machine Intelligence …, 2022 - Springer
Visual recognition is currently one of the most important and active research areas in
computer vision, pattern recognition, and even the general field of artificial intelligence. It …

Torchsparse++: Efficient training and inference framework for sparse convolution on gpus

H Tang, S Yang, Z Liu, K Hong, Z Yu, X Li… - Proceedings of the 56th …, 2023 - dl.acm.org
Sparse convolution plays a pivotal role in emerging workloads, including point cloud
processing in AR/VR, autonomous driving, and graph understanding in recommendation …

Deltacnn: End-to-end cnn inference of sparse frame differences in videos

M Parger, C Tang, CD Twigg… - Proceedings of the …, 2022 - openaccess.thecvf.com
Convolutional neural network inference on video data requires powerful hardware for real-
time processing. Given the inherent coherence across consecutive frames, large parts of a …

A dynamic frame selection framework for fast video recognition

Z Wu, H Li, C **ong, YG Jiang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
We introduce AdaFrame, a conditional computation framework that adaptively selects
relevant frames on a per-input basis for fast video recognition. AdaFrame, which contains a …

[PDF][PDF] Extd: Extremely tiny face detector via iterative filter reuse

YJ Yoo, D Han, S Yun - arxiv preprint arxiv:1906.06579, 2019 - researchgate.net
In this paper, we propose a new multi-scale face detector having an extremely tiny number
of parameters (EXTD), less than 0.1 million, as well as achieving comparable performance …

LTC-SUM: Lightweight client-driven personalized video summarization framework using 2D CNN

G Mujtaba, A Malik, ES Ryu - IEEE access, 2022 - ieeexplore.ieee.org
This paper proposes a novel lightweight thumbnail container-based summarization (LTC-
SUM) framework for full feature-length videos. This framework generates a personalized …