Tetsuro Morimura

อ้างโดย

	ทั้งหมด	ตั้งแต่ปี 2020
การอ้างอิง	1272	722
ดัชนี h	15	12
ดัชนี i10	25	15

200

100

150

20072008200920102011201220132014201520162017201820192020202120222023202420256 5 5 14 4 20 43 49 36 42 52 69 105 112 151 119 183 137 20

ผู้เขียนร่วม

Takayuki OsogamiIBMยืนยันอีเมลแล้วที่ jp.ibm.com
Takayuki KatsukiIBM Research - Tokyoยืนยันอีเมลแล้วที่ jp.ibm.com
Tsuyoshi IdeIBM T. J. Watson Research Centerยืนยันอีเมลแล้วที่ us.ibm.com
Masashi SugiyamaDirector, RIKEN Center for Advanced Intelligence Project / Professor, The University of Tokyoยืนยันอีเมลแล้วที่ k.u-tokyo.ac.jp
Hisashi KashimaProfessor, Kyoto Universityยืนยันอีเมลแล้วที่ i.kyoto-u.ac.jp
Kenji DoyaOkinawa Institute of Science and Technologyยืนยันอีเมลแล้วที่ oist.jp
Rudy RaymondJPMorgan Chaseยืนยันอีเมลแล้วที่ jpmchase.com
Eiji UchibeDept. of Brain Robot Interface, ATR Computational Neuroscience Labs.ยืนยันอีเมลแล้วที่ atr.jp
Toshiyuki TanakaGraduate School of Informatics, Kyoto Universityยืนยันอีเมลแล้วที่ i.kyoto-u.ac.jp
Tomoyuki SHIRAIKyushu Universityยืนยันอีเมลแล้วที่ imi.kyushu-u.ac.jp
Satoshi HaraProfessor, The University of Electro-Communicationยืนยันอีเมลแล้วที่ uec.ac.jp
Jan PetersProfessor for Intelligent Autonomous Systems/TU Darmstadt, Dept. Head/German AI Research Center DFKIยืนยันอีเมลแล้วที่ ias.tu-darmstadt.de
Takamitsu MatsubaraNara Institute of Science and Technologyยืนยันอีเมลแล้วที่ is.naist.jp
Rikiya Takahashi

ติดตาม

Tetsuro Morimura

CyberAgent, Inc.

ยืนยันอีเมลแล้วที่ cyberagent.co.jp

Machine learning reinforcement learning


ชื่อ เรียงตามการอ้างอิง เรียงตามปี เรียงตามชื่อ	อ้างโดย อ้างโดย	ปี
Nonparametric return distribution approximation for reinforcement learning T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka Proceedings of the 27th International Conference on Machine Learning (ICML …, 2010	295	2010
Parametric return density estimation for reinforcement learning T Morimura, M Sugiyama, H Kashima, H Hachiya, T Tanaka arXiv preprint arXiv:1203.3497, 2012	146	2012
Map matching with hidden Markov model on sampled road network R Raymond, T Morimura, T Osogami, N Hirosue Proceedings of the 21st International Conference on Pattern Recognition …, 2012	81	2012
これからの強化学習牧野，貴樹，澁谷，長史，白川，浅田 (No Title), 2016	53	2016
Ibm mega traffic simulator T Osogami, T Imamichi, H Mizuta, T Morimura, R Raymond, T Suzumura, ... IBM Res., Tokyo, Japan, IBM Res. Rep. RT0896, 2012	46	2012
City-wide traffic flow estimation from a limited number of low-quality cameras T Idé, T Katsuki, T Morimura, R Morris IEEE Transactions on Intelligent Transportation Systems 18 (4), 950-959, 2016	44	2016
Utilizing the natural gradient in temporal difference reinforcement learning with eligibility traces T Morimura, E Uchibe, K Doya International Symposium on Information Geometry and Its Applications, 256-263, 2005	44	2005
Solving inverse problem of Markov chain with partial observations T Morimura, T Osogami, T Idé Advances in neural information processing systems 26, 2013	41	2013
Derivatives of logarithmic stationary distributions for policy gradient reinforcement learning T Morimura, E Uchibe, J Yoshimoto, J Peters, K Doya Neural computation 22 (2), 342-376, 2010	37	2010
Assistance generation T Katsuki, T Morimura US Patent 10,878,337, 2020	29	2020
Updating policy parameters under Markov decision process system environment T Morimura, T Osogami, T Shirai US Patent 8,818,925, 2014	25	2014
A generalized natural actor-critic algorithm T Morimura, E Uchibe, J Yoshimoto, K Doya Advances in neural information processing systems 22, 2009	22	2009
強化学習森村哲郎講談社, 2019	21	2019
A new natural policy gradient by stationary distribution metric T Morimura, E Uchibe, J Yoshimoto, K Doya Machine Learning and Knowledge Discovery in Databases: European Conference …, 2008	21	2008
Cooperative neural network reinforcement learning S Dasgupta, T Morimura, T Osogami US Patent App. 15/647,543, 2019	18	2019
Adaptive step-size policy gradients with average reward metric T Matsubara, T Morimura, J Morimoto Proceedings of 2nd Asian Conference on Machine Learning, 285-298, 2010	15	2010
Determining optimal action in consideration of risk T Morimura, T Osogami US Patent 8,639,556, 2014	14	2014
A consistent method for graph based anomaly localization S Hara, T Morimura, T Takahashi, H Yanagisawa, T Suzuki Artificial intelligence and statistics, 333-341, 2015	13	2015
Statistical origin-destination generation with multiple sources T Morimura, S Kato Proceedings of the 21st International Conference on Pattern Recognition …, 2012	13	2012
Filtered direct preference optimization T Morimura, M Sakamoto, Y Jinnai, K Abe, K Ariu arXiv preprint arXiv:2404.13846, 2024	12	2024

ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง

บทความ 1–20

การอ้างอิงต่อปี

การอ้างอิงซ้ำกัน

การอ้างอิงที่รวมเข้าด้วยกัน

เพิ่มผู้เขียนร่วมผู้เขียนร่วม

ติดตาม

อ้างโดย

ผู้เขียนร่วม