Machine learning for microcontroller-class hardware: A review

SS Saha, SS Sandha, M Srivastava - IEEE Sensors Journal, 2022 - ieeexplore.ieee.org
The advancements in machine learning (ML) opened a new opportunity to bring intelligence
to the low-end Internet-of-Things (IoT) nodes, such as microcontrollers. Conventional ML …

Training uncertainty-aware classifiers with conformalized deep learning

BS Einbinder, Y Romano, M Sesia… - Advances in Neural …, 2022 - proceedings.neurips.cc
Deep neural networks are powerful tools to detect hidden patterns in data and leverage
them to make predictions, but they are not designed to understand uncertainty and estimate …

Monitored distillation for positive congruent depth completion

TY Liu, P Agrawal, A Chen, BW Hong… - European Conference on …, 2022 - Springer
We propose a method to infer a dense depth map from a single image, its calibration, and
the associated sparse point cloud. In order to leverage existing models (teachers) that …

Good teachers explain: Explanation-enhanced knowledge distillation

A Parchami-Araghi, M Böhle, S Rao… - European Conference on …, 2024 - Springer
Abstract Knowledge Distillation (KD) has proven effective for compressing large teacher
models into smaller student models. While it is well known that student models can achieve …

AI model disgorgement: Methods and choices

A Achille, M Kearns, C Klingenberg… - Proceedings of the …, 2024 - National Acad Sciences
Over the past few years, machine learning models have significantly increased in size and
complexity, especially in the area of generative AI such as large language models. These …

Neurecover: Regression-controlled repair of deep neural networks with training history

S Tokui, S Tokumoto, A Yoshii… - … on Software Analysis …, 2022 - ieeexplore.ieee.org
Systematic techniques to improve quality of deep neural networks (DNNs) are critical given
the increasing demand for practical applications including safety-critical ones. The key …

Backward-compatible prediction updates: A probabilistic approach

F Träuble, J Von Kügelgen… - Advances in …, 2021 - proceedings.neurips.cc
When machine learning systems meet real world applications, accuracy is only one of
several requirements. In this paper, we assay a complementary perspective originating from …

Asymmetric feature fusion for image retrieval

H Wu, M Wang, W Zhou, Z Lu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
In asymmetric retrieval systems, models with different capacities are deployed on platforms
with different computational and storage resources. Despite the great progress, existing …

MUSCLE: A Model Update Strategy for Compatible LLM Evolution

J Echterhoff, F Faghri, R Vemulapalli, TY Hu… - arxiv preprint arxiv …, 2024 - arxiv.org
Large Language Models (LLMs) are regularly updated to enhance performance, typically
through changes in data or architecture. Within the update process, developers often …

Towards regression-free neural networks for diverse compute platforms

R Duggal, H Zhou, S Yang, J Fang, Y **ong… - European Conference on …, 2022 - Springer
Our work tackles the emergent problem of reducing predictive inconsistencies arising as
negative flips: test samples that are correctly predicted by a less accurate model, but …