How to certify machine learning based safety-critical systems? A systematic literature review

F Tambon, G Laberge, L An, A Nikanjam… - Automated Software …, 2022 - Springer
Abstract Context Machine Learning (ML) has been at the heart of many innovations over the
past years. However, including it in so-called “safety-critical” systems such as automotive or …

Trajectory-Prediction Techniques for Unmanned Aerial Vehicles (UAVs): A Comprehensive Survey

P Shukla, S Shukla, AK Singh - IEEE Communications Surveys …, 2024 - ieeexplore.ieee.org
Unmanned Aerial Vehicles (UAVs) have witnessed remarkable significance in diverse
sectors, ranging from environmental monitoring, infrastructure inspection, disaster response …

Online reinforcement learning for a continuous space system with experimental validation

O Dogru, N Wieczorek, K Velswamy, F Ibrahim… - Journal of Process …, 2021 - Elsevier
Reinforcement learning (RL) for continuous state/action space systems has remained a
challenge for nonlinear multivariate dynamical systems even at a simulation level …

Safe adaptive learning-based control for constrained linear quadratic regulators with regret guarantees

Y Li, S Das, J Shamma, N Li - arxiv preprint arxiv:2111.00411, 2021 - arxiv.org
We study the adaptive control of an unknown linear system with a quadratic cost function
subject to safety constraints on both the states and actions. The challenges of this problem …

Online adaptive policy selection in time-varying systems: No-regret via contractive perturbations

Y Lin, JA Preiss, E Anand, Y Li… - Advances in Neural …, 2024 - proceedings.neurips.cc
We study online adaptive policy selection in systems with time-varying costs and dynamics.
We develop the Gradient-based Adaptive Policy Selection (GAPS) algorithm together with a …

Bayesian inference for data-efficient, explainable, and safe robotic motion planning: A review

C Zhou, C Wang, H Hassan, H Shah, B Huang… - arxiv preprint arxiv …, 2023 - arxiv.org
Bayesian inference has many advantages in robotic motion planning over four perspectives:
The uncertainty quantification of the policy, safety (risk-aware) and optimum guarantees of …

KPC: Learning-based model predictive control with deterministic guarantees

ET Maddalena, P Scharnhorst… - … for Dynamics and …, 2021 - proceedings.mlr.press
Abstract We propose Kernel Predictive Control (KPC), a learning-based predictive control
strategy that enjoys deterministic guarantees of safety. Noise-corrupted samples of the …

Uncertain-aware safe exploratory planning using Gaussian process and neural control contraction metric

D Sun, MJ Khojasteh, S Shekhar… - Learning for Dynamics …, 2021 - proceedings.mlr.press
Robots operating in unstructured, complex, and changing real-world environments should
navigate and maintain safety while collecting data about its environment and updating its …

Non-asymptotic system identification for linear systems with nonlinear policies

Y Li, T Zhang, S Das, J Shamma, N Li - IFAC-PapersOnLine, 2023 - Elsevier
This paper considers a single-trajectory system identification problem for linear systems
under general nonlinear and/or time-varying policies with iid random excitation noises. The …

Regret analysis of learning-based mpc with partially-unknown cost function

I Dogan, ZJM Shen, A Aswani - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org
The exploration–exploitation tradeoff is an inherent challenge in data-driven adaptive
control. Though this tradeoff has been studied for multiarmed bandits (MABs) and …