Saturday 10:30 AM–11:00 AM in C11

Proximal Policy Optimization : The new kid in the RL Jungle

Shubham Gupta

Audience level:


My talk will enlighten the audience with respect to the newly introduced class of Reinforcement Learning Algorithms called Proximal Policy optimization. These algorithms were recently released by OpenAI and have been found to perform better than the current state of the art while being simpler to implement and tune, Interested in RL ? or even training a beast of an Atari player? This is the Talk.


The Reinforcement Learning problem

Basic time tested Strategies : a shallow dive

Deep Q-learning

Enter Proximal Policy Optimization Techniques

Comparison of these Algorithms

Minimum takeaway

This would just be a conclusion with a simplifed takeaway points so that audience at all levels gain something from this talk.

