Turnitin
降AI改写
早检测系统
早降重系统
Turnitin-UK版
万方检测-期刊版
维普编辑部版
Grammarly检测
Paperpass检测
checkpass检测
PaperYY检测
Human-in-the-loop machine learning: a state of the art
Researchers are defining new types of interactions between humans and machine learning
algorithms generically called human-in-the-loop machine learning. Depending on who is in …
algorithms generically called human-in-the-loop machine learning. Depending on who is in …
Long-term personalization of an in-home socially assistive robot for children with autism spectrum disorders
Socially assistive robots (SAR) have shown great potential to augment the social and
educational development of children with autism spectrum disorders (ASD). As SAR …
educational development of children with autism spectrum disorders (ASD). As SAR …
Communicative learning: A unified learning formalism
L Yuan, SC Zhu - Engineering, 2023 - Elsevier
In this article, we propose a communicative learning (CL) formalism that unifies existing
machine learning paradigms, such as passive learning, active learning, algorithmic …
machine learning paradigms, such as passive learning, active learning, algorithmic …
Interactive teaching algorithms for inverse reinforcement learning
We study the problem of inverse reinforcement learning (IRL) with the added twist that the
learner is assisted by a helpful teacher. More formally, we tackle the following algorithmic …
learner is assisted by a helpful teacher. More formally, we tackle the following algorithmic …
Nonparametric iterative machine teaching
In this paper, we consider the problem of Iterative Machine Teaching (IMT), where the
teacher provides examples to the learner iteratively such that the learner can achieve fast …
teacher provides examples to the learner iteratively such that the learner can achieve fast …
Reward poisoning in reinforcement learning: Attacks against unknown learners in unknown environments
We study black-box reward poisoning attacks against reinforcement learning (RL), in which
an adversary aims to manipulate the rewards to mislead a sequence of RL agents with …
an adversary aims to manipulate the rewards to mislead a sequence of RL agents with …
Teaching inverse reinforcement learners via features and demonstrations
L Haug, S Tschiatschek… - Advances in Neural …, 2018 - proceedings.neurips.cc
Learning near-optimal behaviour from an expert's demonstrations typically relies on the
assumption that the learner knows the features that the true reward function depends on. In …
assumption that the learner knows the features that the true reward function depends on. In …
Nonparametric teaching for multiple learners
We study the problem of teaching multiple learners simultaneously in the nonparametric
iterative teaching setting, where the teacher iteratively provides examples to the learner for …
iterative teaching setting, where the teacher iteratively provides examples to the learner for …
Learner-aware teaching: Inverse reinforcement learning with preferences and constraints
Inverse reinforcement learning (IRL) enables an agent to learn complex behavior by
observing demonstrations from a (near-) optimal policy. The typical assumption is that the …
observing demonstrations from a (near-) optimal policy. The typical assumption is that the …
Locality sensitive teaching
The emergence of the Internet-of-Things (IoT) sheds light on applying the machine teaching
(MT) algorithms for online personalized education on home devices. This direction becomes …
(MT) algorithms for online personalized education on home devices. This direction becomes …