Machine learning for microcontroller-class hardware: A review
The advancements in machine learning (ML) opened a new opportunity to bring intelligence
to the low-end Internet-of-Things (IoT) nodes, such as microcontrollers. Conventional ML …
to the low-end Internet-of-Things (IoT) nodes, such as microcontrollers. Conventional ML …
Training uncertainty-aware classifiers with conformalized deep learning
Deep neural networks are powerful tools to detect hidden patterns in data and leverage
them to make predictions, but they are not designed to understand uncertainty and estimate …
them to make predictions, but they are not designed to understand uncertainty and estimate …
Monitored distillation for positive congruent depth completion
We propose a method to infer a dense depth map from a single image, its calibration, and
the associated sparse point cloud. In order to leverage existing models (teachers) that …
the associated sparse point cloud. In order to leverage existing models (teachers) that …
Good teachers explain: Explanation-enhanced knowledge distillation
Abstract Knowledge Distillation (KD) has proven effective for compressing large teacher
models into smaller student models. While it is well known that student models can achieve …
models into smaller student models. While it is well known that student models can achieve …
AI model disgorgement: Methods and choices
Over the past few years, machine learning models have significantly increased in size and
complexity, especially in the area of generative AI such as large language models. These …
complexity, especially in the area of generative AI such as large language models. These …
Neurecover: Regression-controlled repair of deep neural networks with training history
Systematic techniques to improve quality of deep neural networks (DNNs) are critical given
the increasing demand for practical applications including safety-critical ones. The key …
the increasing demand for practical applications including safety-critical ones. The key …
Backward-compatible prediction updates: A probabilistic approach
When machine learning systems meet real world applications, accuracy is only one of
several requirements. In this paper, we assay a complementary perspective originating from …
several requirements. In this paper, we assay a complementary perspective originating from …
Asymmetric feature fusion for image retrieval
In asymmetric retrieval systems, models with different capacities are deployed on platforms
with different computational and storage resources. Despite the great progress, existing …
with different computational and storage resources. Despite the great progress, existing …
MUSCLE: A Model Update Strategy for Compatible LLM Evolution
Large Language Models (LLMs) are regularly updated to enhance performance, typically
through changes in data or architecture. Within the update process, developers often …
through changes in data or architecture. Within the update process, developers often …
Towards regression-free neural networks for diverse compute platforms
Our work tackles the emergent problem of reducing predictive inconsistencies arising as
negative flips: test samples that are correctly predicted by a less accurate model, but …
negative flips: test samples that are correctly predicted by a less accurate model, but …