ติดตาม
Filip Wolski
Filip Wolski
Member of Technical Staff, OpenAI
ยืนยันอีเมลแล้วที่ openai.com
ชื่อ
อ้างโดย
อ้างโดย
ปี
Proximal policy optimization algorithms
J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov
arXiv preprint arXiv:1707.06347, 2017
240842017
Hindsight experience replay
M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ...
Advances in neural information processing systems 30, 2017
31402017
Dota 2 with large scale deep reinforcement learning
C Berner, G Brockman, B Chan, V Cheung, P Dębiak, C Dennison, ...
arXiv preprint arXiv:1912.06680, 2019
21182019
Evolved policy gradients
R Houthooft, Y Chen, P Isola, B Stadie, F Wolski, OAI Jonathan Ho, ...
Advances in Neural Information Processing Systems 31, 2018
2932018
Dota 2 with large scale deep reinforcement learning
G Brockman, B Chan, V Cheung, P Debiak, C Dennison, D Farhi, ...
arXiv preprint arXiv:1912.06680 2, 2019
1232019
Advances in neural information processing systems
M Andrychowicz, F Wolski, A Ray, J Schneider, R Fong, P Welinder, ...
Proceedings of the 30th International Conference on Neural Information …, 2017
692017
Dota 2 with large scale deep reinforcement learning. arXiv 2019
C Berner, G Brockman, B Chan, V Cheung, P Debiak, C Dennison, ...
arXiv preprint arXiv:1912.06680, 1912
641912
Klimov
J Schulman, F Wolski, P Dhariwal, A Radford
Proximal policy optimization algorithms, 1-12, 2017
542017
Proximal Policy Optimization Algorithms, August 2017
J Schulman, F Wolski, P Dhariwal, A Radford, O Klimov
arXiv preprint arXiv:1707.06347, 0
50
Proximal policy optimization
J Schulman, O Klimov, F Wolski, P Dhariwal, A Radford
arXiv preprint arXiv:1707.06347, 1-12, 2017
412017
Long-term planning and situational awareness in openai five
J Raiman, S Zhang, F Wolski
arXiv preprint arXiv:1912.06721, 2019
182019
Evolved Policy Gradients: Supplementary Materials
R Houthooft, RY Chen, P Isola, BC Stadie, F Wolski, J Ho, P Abbeel
ระบบไม่สามารถดำเนินการได้ในขณะนี้ โปรดลองใหม่อีกครั้งในภายหลัง
บทความ 1–12