- Academic Search

S Reed, N De Freitas - arxiv preprint arxiv:1511.06279, 2015 - arxiv.org

We propose the neural programmer-interpreter (NPI): a recurrent and compositional neural
network that learns to represent and execute programs. NPI has three learnable …

Save Cite Cited by 548 Related articles All 10 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] ieee.org

Pac-Man Conquers Academia: Two Decades of Research Using a Classic Arcade Game

P Rohlfshagen, J Liu, D Perez-Liebana… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org

Pac-Man and its equally popular successor Ms. Pac-Man are often attributed to being the
frontrunners of the golden age of arcade video games. Their impact goes well beyond the …

Save Cite Cited by 64 Related articles All 3 versions Free GPT-4

[Free GPT-4]

[PDF] utexas.edu

[PDF][PDF] Reinforcement learning from simultaneous human and MDP reward.

WB Knox, P Stone - AAMAS, 2012 - cs.utexas.edu

As computational agents are increasingly used beyond research labs, their success will
depend on their ability to learn new skills and adapt to their dynamic, complex environments …

Save Cite Cited by 242 Related articles All 13 versions Free GPT-4 View as HTML

[Free GPT-4]

[PDF] github.io

Tsc-dl: Unsupervised trajectory segmentation of multi-modal surgical demonstrations with deep learning

A Murali, A Garg, S Krishnan… - … on robotics and …, 2016 - ieeexplore.ieee.org

The growth of robot-assisted minimally invasive surgery has led to sizable datasets of fixed-
camera video and kinematic recordings of surgical subtasks. Segmentation of these …

Save Cite Cited by 97 Related articles All 11 versions Free GPT-4

[Free GPT-4]

[PDF] arxiv.org

A conceptual framework for externally-influenced agents: An assisted reinforcement learning review

A Bignold, F Cruz, ME Taylor, T Brys, R Dazeley… - Journal of Ambient …, 2023 - Springer

A long-term goal of reinforcement learning agents is to be able to perform tasks in complex
real-world scenarios. The use of external information is one way of scaling agents to more …

Save Cite Cited by 34 Related articles All 11 versions Free GPT-4

[Free GPT-4]

[PDF] utexas.edu

Learning from human-generated reward

WB Knox - 2012 - repositories.lib.utexas.edu

Robots and other computational agents are increasingly becoming part of our daily lives.
They will need to be able to learn to perform new tasks, adapt to novel situations, and …

Save Cite Cited by 61 Related articles All 7 versions Free GPT-4 View as HTML

[Free GPT-4]

[HTML] nih.gov

Discovering multimodal behavior in Ms. Pac-Man through evolution of modular neural networks

J Schrum, R Miikkulainen - IEEE transactions on computational …, 2015 - ieeexplore.ieee.org

Ms. Pac-Man is a challenging video game in which multiple modes of behavior are required:
Ms. Pac-Man must escape ghosts when they are threats and catch them when they are …

Save Cite Cited by 40 Related articles All 6 versions Free GPT-4

[Free GPT-4]

[PDF] aaai.org

Learning via human feedback in continuous state and action spaces

NA Vien, W Ertel, TC Chung - Applied intelligence, 2013 - Springer

This paper considers the problem of extending Training an Agent Manually via Evaluative
Reinforcement (TAMER) in continuous state and action spaces. Investigative research using …

Save Cite Cited by 44 Related articles All 13 versions Free GPT-4

[Free GPT-4]

[PDF] researchgate.net

Performance evaluation of machine learning algorithms in Apache spark for intrusion detection

A Dobson, K Roy, X Yuan, J Xu - 2018 28th International …, 2018 - ieeexplore.ieee.org

As the Internet continues to get stronger, so does the potential risk of malicious users trying
to harm others. An intrusion detection system (IDS) can be used to alert the appropriate …

Save Cite Cited by 17 Related articles All 4 versions Free GPT-4

[Free GPT-4]

[PDF] github.io

[PDF][PDF] Efficient exploration in monte carlo tree search using human action abstractions

K Subramanian, J Scholz, CL Isbell… - Proceedings of the …, 2016 - kausubbu.github.io

Abstract Monte Carlo Tree Search (MCTS) is a family of methods for planning in large
domains. It focuses on finding a good action for a particular state, making its complexity …

Save Cite Cited by 9 Related articles View as HTML

Create alert

Cite

Advanced search

Saved to My library

Learning options through human interaction

Neural programmer-interpreters

Pac-Man Conquers Academia: Two Decades of Research Using a Classic Arcade Game

[PDF][PDF] Reinforcement learning from simultaneous human and MDP reward.

Tsc-dl: Unsupervised trajectory segmentation of multi-modal surgical demonstrations with deep learning

A conceptual framework for externally-influenced agents: An assisted reinforcement learning review

Learning from human-generated reward

Discovering multimodal behavior in Ms. Pac-Man through evolution of modular neural networks

Learning via human feedback in continuous state and action spaces

Performance evaluation of machine learning algorithms in Apache spark for intrusion detection

[PDF][PDF] Efficient exploration in monte carlo tree search using human action abstractions