Modular deep reinforcement learning from reward and punishment for robot navigation J Wang, S Elfwing, E Uchibe Neural Networks 135, 115-126, 2021 | 57 | 2021 |
EM-based policy hyper parameter exploration: application to standing and balancing of a two-wheeled smartphone robot J Wang, E Uchibe, K Doya Artificial Life and Robotics 21, 125-131, 2016 | 14 | 2016 |
Adaptive baseline enhances em-based policy search: validation in a view-based positioning task of a smartphone balancer J Wang, E Uchibe, K Doya Frontiers in Neurorobotics 11, 1, 2017 | 13 | 2017 |
Deep Reinforcement Learning by Parallelizing Reward and Punishment using the MaxPain Architecture J Wang, S Elfwing, E Uchibe | 9 | 2018 |
Control of two-wheel balancing and standing-up behaviors by an android phone robot J Wang, E Uchibe, K Doya Annual Conference on Robotics Society of Japan–RSJ, 2014 | 6 | 2014 |
Standing-up and balancing behaviors of android phone robot J Wang, E Uchibe, K Doya Technical committee on Nonlinear Problems, IEICE, Hong Kong, China, 2013 | 3 | 2013 |
ロボット制御のための決定論的方策探査法 内部英治, 王潔心 日本神経回路学会誌 24 (4), 195-203, 2017 | 2 | 2017 |
Reward-punishment reinforcement learning with maximum entropy J Wang, E Uchibe 2024 International Joint Conference on Neural Networks (IJCNN), 1-7, 2024 | 1 | 2024 |
EM-based policy search for learning foraging and mating behaviors E Uchibe, J Wang In proceedings of the 30th robotics and mechatronics conference, 2018 | | 2018 |
アンドロイド携帯搭載型ロボットの起き上がりとバランス行動: バネを取り付けた二輪倒立ロボットの制御 内部英治, 銅谷賢治 電子情報通信学会技術研究報告= IEICE technical report: 信学技報 113 (341), 49-54, 2013 | | 2013 |