PyData Warsaw 2017 - Presentation: Learning to solve Rubik's cube

Learning to solve Rubik's cube

Audience level:

Novice

Description

AI is making huge progress in the recent years. Everything started with deep learning techniques for image recognition and later, we've seen game-changing advances in learning to take actions (autonomous cars, Alpha Go engine, robots). Turns out basic problems like solving Rubik's cube can be solved in a few dozens of lines of code and described on one slide. Don't believe? Let me convince you.

Abstract

Problem introduction

Show a Rubik's cube, describe why it isn't easy to solve it.
Show my Python model of the cube and some traditional (non-AI) methods for solving it.

The concept of reinforcement learning (RL)

Multi-armed bandit problem.
Introduce agent, environment, action, and reward.
Introduce Q-function.

Basic algorithms for RL and their motivation

REINFORCE algorithm.
Reward-Weighted regression.
Deep learning algorithms.

Implementing RL for Rubik's cube

Introduce libraries:

Theano/Tensorflow
Keras
Openai's RLLab and Gym.

Present 3 iterations of cube's solving and compare their learning rate and results.

The code will be available on my Github for everyone to try.

Thursday 12:35–13:05 in Track 3