A survey of resource-efficient llm and multimodal foundation models

M Xu, W Yin, D Cai, R Yi, D Xu, Q Wang, B Wu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large foundation models, including large language models (LLMs), vision transformers
(ViTs), diffusion, and LLM-based multimodal models, are revolutionizing the entire machine …

Melon: Breaking the memory wall for resource-efficient on-device machine learning

Q Wang, M Xu, C **, X Dong, J Yuan, X **… - Proceedings of the 20th …, 2022 - dl.acm.org
On-device learning is a promising technique for emerging privacy-preserving machine
learning paradigms. However, through quantitative experiments, we find that commodity …

Mandheling: Mixed-precision on-device dnn training with dsp offloading

D Xu, M Xu, Q Wang, S Wang, Y Ma, K Huang… - Proceedings of the 28th …, 2022 - dl.acm.org
This paper proposes Mandheling, the first system that enables highly resource-efficient on-
device training by orchestrating mixed-precision training with on-chip Digital Signal …

A comprehensive deep learning library benchmark and optimal library selection

Q Zhang, X Che, Y Chen, X Ma, M Xu… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Deploying deep learning (DL) on mobile devices has been a notable trend in recent years.
To support fast inference of on-device DL, DL libraries play a critical role as algorithms and …

[PDF][PDF] Mobile Foundation Model as Firmware

J Yuan, C Yang, D Cai, S Wang, X Yuan… - arxiv preprint arxiv …, 2023 - caidongqi.com
In today's landscape, smartphones have evolved into hubs for hosting a multitude of deep
learning models aimed at local execution. A key realization driving this work is the notable …

A probabilistic approach to blood glucose prediction in type 1 diabetes under meal uncertainties

S Langarica, M Rodriguez-Fernandez… - IEEE Journal of …, 2023 - ieeexplore.ieee.org
Currently, most reliable and commercialized artificial pancreas systems for type 1 diabetes
are hybrid closed-loop systems, which require the user to announce every meal and its size …

On-device training: A first overview on existing systems

S Zhu, T Voigt, F Rahimian, J Ko - ACM Transactions on Sensor …, 2024 - dl.acm.org
The recent breakthroughs in machine learning (ML) and deep learning (DL) have catalyzed
the design and development of various intelligent systems over wide application domains …

RoboCA3T: A Robot‐Inspired Computer‐Assisted adaptive autism therapy for improving joint attention and imitation skills through learning and computing …

Z Zahid, S Ali, S Shariq, Y Ayaz… - Journal of Computer …, 2024 - Wiley Online Library
Background This study presents a Robot‐Inspired Computer‐Assisted Adaptive Autism
Therapy (RoboCA3T) focusing on improving joint attention and imitation skills of children …

Federated neural architecture search

J Yuan, M Xu, Y Zhao, K Bian, G Huang, X Liu… - arxiv preprint arxiv …, 2020 - arxiv.org
To preserve user privacy while enabling mobile intelligence, techniques have been
proposed to train deep neural networks on decentralized data. However, training over …

Anatomizing Deep Learning Inference in Web Browsers

Q Wang, S Jiang, Z Chen, X Cao, Y Li, A Li… - ACM Transactions on …, 2024 - dl.acm.org
Web applications have increasingly adopted Deep Learning (DL) through in-browser
inference, wherein DL inference performs directly within Web browsers. The actual …