TODO:

This project is not finished yet.

TODO:

Implement saving and restoring trained neural networks [done]
Implement logging [done]
Actually train the neural network using cloud computing

Solving the Rubik's Cube Without Human Knowledge

In this article Stephen McAleer, Forest Agostinelli, Alexander Shmakov, Pierre Baldi describe how they approached a problem of solving a Rubik's Cube by a computer without supervision. General idea was to apply methods from reinforcement learning and use neural networks as functions approximating value of a given state as well as decision functions what move to make. Main challenge to overcome was to account for the fact that randomly doing moves would not result in a solved cube even after a long time. That's why authors trained these networks using what they have named "Autodidactic iteration", that is starting from simple positions (cube being only a few moves away from solved) and moving to more complicated cases when the network is already trained.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
kociemba		kociemba
src		src
test		test
.gitignore		.gitignore
README.md		README.md
Solving_Rubiks_Cube_Without_Human_Knowledge.pdf		Solving_Rubiks_Cube_Without_Human_Knowledge.pdf
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kociemba

kociemba

src

src

test

test

.gitignore

.gitignore

README.md

README.md

Solving_Rubiks_Cube_Without_Human_Knowledge.pdf

Solving_Rubiks_Cube_Without_Human_Knowledge.pdf

requirements.txt

requirements.txt

Repository files navigation

TODO:

Solving the Rubik's Cube Without Human Knowledge

About

Releases

Packages

Languages

kaletap/deep-cube-rl

Folders and files

Latest commit

History

Repository files navigation

TODO:

Solving the Rubik's Cube Without Human Knowledge

About

Resources

Stars

Watchers

Forks

Languages