Human-in-the-loop machine learning: a state of the art

E Mosqueira-Rey, E Hernández-Pereira… - Artificial Intelligence …, 2023 - Springer
Researchers are defining new types of interactions between humans and machine learning
algorithms generically called human-in-the-loop machine learning. Depending on who is in …

Long-term personalization of an in-home socially assistive robot for children with autism spectrum disorders

C Clabaugh, K Mahajan, S Jain, R Pakkar… - Frontiers in Robotics …, 2019 - frontiersin.org
Socially assistive robots (SAR) have shown great potential to augment the social and
educational development of children with autism spectrum disorders (ASD). As SAR …

Communicative learning: A unified learning formalism

L Yuan, SC Zhu - Engineering, 2023 - Elsevier
In this article, we propose a communicative learning (CL) formalism that unifies existing
machine learning paradigms, such as passive learning, active learning, algorithmic …

Interactive teaching algorithms for inverse reinforcement learning

P Kamalaruban, R Devidze, V Cevher… - arxiv preprint arxiv …, 2019 - arxiv.org
We study the problem of inverse reinforcement learning (IRL) with the added twist that the
learner is assisted by a helpful teacher. More formally, we tackle the following algorithmic …

Nonparametric iterative machine teaching

C Zhang, X Cao, W Liu, I Tsang… - … Conference on Machine …, 2023 - proceedings.mlr.press
In this paper, we consider the problem of Iterative Machine Teaching (IMT), where the
teacher provides examples to the learner iteratively such that the learner can achieve fast …

Reward poisoning in reinforcement learning: Attacks against unknown learners in unknown environments

A Rakhsha, X Zhang, X Zhu, A Singla - arxiv preprint arxiv:2102.08492, 2021 - arxiv.org
We study black-box reward poisoning attacks against reinforcement learning (RL), in which
an adversary aims to manipulate the rewards to mislead a sequence of RL agents with …

Teaching inverse reinforcement learners via features and demonstrations

L Haug, S Tschiatschek… - Advances in Neural …, 2018 - proceedings.neurips.cc
Learning near-optimal behaviour from an expert's demonstrations typically relies on the
assumption that the learner knows the features that the true reward function depends on. In …

Nonparametric teaching for multiple learners

C Zhang, X Cao, W Liu, I Tsang… - Advances in Neural …, 2023 - proceedings.neurips.cc
We study the problem of teaching multiple learners simultaneously in the nonparametric
iterative teaching setting, where the teacher iteratively provides examples to the learner for …

Learner-aware teaching: Inverse reinforcement learning with preferences and constraints

S Tschiatschek, A Ghosh, L Haug… - Advances in neural …, 2019 - proceedings.neurips.cc
Inverse reinforcement learning (IRL) enables an agent to learn complex behavior by
observing demonstrations from a (near-) optimal policy. The typical assumption is that the …

Locality sensitive teaching

Z Xu, B Chen, C Li, W Liu, L Song… - Advances in …, 2021 - proceedings.neurips.cc
The emergence of the Internet-of-Things (IoT) sheds light on applying the machine teaching
(MT) algorithms for online personalized education on home devices. This direction becomes …