I am a Research Scientist at OpenAI working at the intersection of machine learning and robotics. My research focuses on applying deep reinforcement learning, generative models, and synthetic data to problems in robotic perception and control.
Additionally, I co-organize a machine learning training program for engineers to learn about production-ready deep learning called Full Stack Deep Learning.
Josh Tobin, Lukas Biewald, Rocky Duan, Marcin Andrychowicz, Ankur Handa, Vikash Kumar, Bob McGrew, Alex Ray, Jonas Schneider, Peter Welinder, Wojciech Zaremba, Pieter Abbeel. In the proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Madrid, Spain, October 2018. [pdf, video]
OpenAI: Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba. arXiv:1808.00177, August 2018. [pdf, blog post, video 1, video 2]
Matthias Plappert, Marcin Andrychowicz, Alex Ray, Bob McGrew, Bowen Baker, Glenn Powell, Jonas Schneider, Josh Tobin, Maciek Chociej, Peter Welinder, Vikash Kumar, Wojciech Zaremba. arXiv:1802.09464, February 2018. [pdf, blog post]
Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, Pieter Abbeel, Wojciech Zaremba. In Neural Information Processing Systems (NIPS), Long Beach, CA, December 2017. [pdf, video]
Josh Tobin, Rachel Fong, Alex Ray, Jonas Schneider, Wojciech Zaremba, Pieter Abbeel. In the proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Vancouver, Canada, October 2017. [pdf, video 1, video 2, video 3, Blog post 1, Blog post 2, Blog post 3]
Paul Christiano, Zain Shah, Igor Mordatch, Jonas Schneider, Trevor Blackwell, Josh Tobin, Pieter Abbeel, Wojciech Zaremba. arXiv:1610.03518, October 2016. [pdf]
How is domain randomization being used now, and where should we go from here? Slides from a talk at the Sim2Real Workshop at RSS, June 23, 2019.
A decision tree for building bug-free models and improving their performance. Originally from a talk at Full Stack Deep Learning, March 2019.