Reinforcement Learning in Education: A Literature Review

B Fahad Mon, A Wasfi, M Hayajneh, A Slim, N Abu Ali - Informatics, 2023 - mdpi.com
The utilization of reinforcement learning (RL) within the field of education holds the potential
to bring about a significant shift in the way students approach and engage with learning and …

Reinforcement learning for education: Opportunities and challenges

A Singla, AN Rafferty, G Radanovic… - arxiv preprint arxiv …, 2021 - arxiv.org
This survey article has grown out of the RL4ED workshop organized by the authors at the
Educational Data Mining (EDM) 2021 conference. We organized this workshop as part of a …

Inference for batched bandits

K Zhang, L Janson, S Murphy - Advances in neural …, 2020 - proceedings.neurips.cc
As bandit algorithms are increasingly utilized in scientific studies and industrial applications,
there is an associated increasing need for reliable inference methods based on the resulting …

Statistical inference with m-estimators on adaptively collected data

K Zhang, L Janson, S Murphy - Advances in neural …, 2021 - proceedings.neurips.cc
Bandit algorithms are increasingly used in real-world sequential decision-making problems.
Associated with this is an increased desire to be able to use the resulting datasets to answer …

Simulated learners in educational technology: A systematic literature review and a turing-like test

T Käser, G Alexandron - International Journal of Artificial Intelligence in …, 2024 - Springer
Simulation is a powerful approach that plays a significant role in science and technology.
Computational models that simulate learner interactions and data hold great promise for …

Evaluating online bandit exploration in large-scale recommender system

H Guo, R Naeff, A Nikulkov, Z Zhu - arxiv preprint arxiv:2304.02572, 2023 - arxiv.org
Bandit learning has been an increasingly popular design choice for recommender system.
Despite the strong interest in bandit learning from the community, there remains multiple …

A reinforcement learning approach to adaptive remediation in online training

R Spain, J Rowe, A Smith, B Goldberg… - The Journal of …, 2022 - journals.sagepub.com
Advances in artificial intelligence (AI) and machine learning can be leveraged to tailor
training based on the goals, learning needs, and preferences of learners. A key component …

The MOOClet framework: unifying experimentation, dynamic improvement, and personalization in online courses

M Reza, J Kim, A Bhattacharjee, AN Rafferty… - Proceedings of the …, 2021 - dl.acm.org
How can educational platforms be instrumented to accelerate the use of research to improve
students' experiences? We show how modular components of any educational interface-eg …

Automatic interpretable personalized learning

E Prihar, A Haim, A Sales, N Heffernan - Proceedings of the Ninth ACM …, 2022 - dl.acm.org
Personalized learning stems from the idea that students benefit from instructional material
tailored to their needs. Many online learning platforms purport to implement some form of …

Efficient inference without trading-off regret in bandits: An allocation probability test for Thompson sampling

N Deliu, JJ Williams, SS Villar - arxiv preprint arxiv:2111.00137, 2021 - arxiv.org
Using bandit algorithms to conduct adaptive randomised experiments can minimise regret,
but it poses major challenges for statistical inference (eg, biased estimators, inflated type-I …