Skip to content

Remember the sad Marvin from "Hitchhiker's guide to the galaxy"? In this project we train him to walk from the scratch using only pure python with numpy!

Notifications You must be signed in to change notification settings

livankrekh/WalkingMarvin

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Walking Marvin (bipedal walker problem)

Remember the sad Marvin from "Hitchhiker's guide to the galaxy"? In this project we train him to walk from the scratch using only pure python with numpy!

gif_sad_marvin

Installation guide

To install Marvin environment and dependencies, please, run following shell script:

> ./install.sh

It installs python3.7 with all dependencies inside the virtualenv named venv.

To run Marvin, please, use the following commands:

> ./marvin.py <-r>

or

> source venv/bin/activate
> python3 marvin.py <-r>

Flag -r is used for running trained Marvin (by default pretrained model is used)

Methodology

As main model used fully-connected network as part of reinforce algorithm with policy gradient methodology.
Model predicts only mean of the actions distribution without standart deviation. Loss function consist only from predicted mean.

Results

Basically, Marvin starts from the folowing state:

gif_marvin1

The pretrained Marvin can walk in the next way:

gif_marvin1

And in the trained state he can moves with the next speed:

gif_marvin1

Bibliography

About

Remember the sad Marvin from "Hitchhiker's guide to the galaxy"? In this project we train him to walk from the scratch using only pure python with numpy!

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published