‪Filip Wolski‬ - ‪Academic Search‬

รับโปรไฟล์ของฉันเอง

อ้างโดย

	ทั้งหมด	ตั้งแต่ปี 2020
การอ้างอิง	30054	27665
ดัชนี h	11	11
ดัชนี i10	11	11

0

9000

4500

2250

6750

20172018201920202021202220232024202578 636 1555 2714 3877 4900 6608 8169 1397

ติดตาม

Filip Wolski

Filip Wolski

Member of Technical Staff, OpenAI

ยืนยันอีเมลแล้วที่ openai.com

artificial intelligence reinforcement learning algorithms


ชื่อ เรียงตามการอ้างอิง เรียงตามปี เรียงตามชื่อ	อ้างโดย อ้างโดย	ปี
Proximal policy optimization algorithms J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov arXiv preprint arXiv:1707.06347, 2017	24084	2017
Hindsight experience replay M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ... Advances in neural information processing systems 30, 2017	3140	2017
Dota 2 with large scale deep reinforcement learning C Berner, G Brockman, B Chan, V Cheung, P Dębiak, C Dennison, ... arXiv preprint arXiv:1912.06680, 2019	2118	2019
Evolved policy gradients R Houthooft, Y Chen, P Isola, B Stadie, F Wolski, OAI Jonathan Ho, ... Advances in Neural Information Processing Systems 31, 2018	293	2018
Dota 2 with large scale deep reinforcement learning G Brockman, B Chan, V Cheung, P Debiak, C Dennison, D Farhi, ... arXiv preprint arXiv:1912.06680 2, 2019	123	2019
Advances in neural information processing systems M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ... Proceedings of the 30th International Conference on Neural Information …, 2017	69	2017
Dota 2 with large scale deep reinforcement learning. arXiv 2019 C Berner, G Brockman, B Chan, V Cheung, P Debiak, C Dennison, ... arXiv preprint arXiv:1912.06680, 1912	64	1912
Klimov J Schulman, F Wolski, P Dhariwal, A Radford Proximal policy optimization algorithms, 1-12, 2017	54	2017
Proximal Policy Optimization Algorithms, August 2017 J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov arXiv preprint arXiv:1707.06347, 0	50
Proximal policy optimization J Schulman, O Klimov, F Wolski, P Dhariwal, A Radford arXiv preprint arXiv:1707.06347, 1-12, 2017	41	2017
Long-term planning and situational awareness in openai five J Raiman, S Zhang, F Wolski arXiv preprint arXiv:1912.06721, 2019	18	2019
Evolved Policy Gradients: Supplementary Materials R Houthooft, RY Chen, P Isola, BC Stadie, F Wolski, J Ho, P Abbeel

ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง

บทความ 1–12