RC2020

ML Reproducibility Challenge 2020 is a community challenge for machine learning enthusiasts, students, and researchers in which participants select a paper from one of the prestigious ML conferences of the year (NeurIPS, ICML, ICLR, ACL, EMNLP, CVPR or ECCV). In an attempt to relicate the paper, participants provide additional support either towards or against the claims and results of the work. Alongside the goal of evaluating the validity/legitemacy of recent research, this project primary serves to assess the reproducibility/replicability of Machine Learning research.

Here is a collection of my work-in-progress code for the challenge. I am working on Discovering Reinforcement Learning Algorithms by DeepMind.

This code is not expected to be very well organized until towards the end of the project. I am merely dropping the source code thus far into this repo each time I update it.

Progress Log

Thurday, November 5, 2020:

Implemented all Grid World environments, including Tabular Grid World and Random Grid World, and the five maps for each type of grid world.

Sunday, November 8, 2020

Implemented all Delayed Chain MDP environments, which includes 4 standard maps and 1 unique mode.

Saturday, November 14, 2020

Rewrote the Grid World environments, doubling the speed of simulation and improving the rendering graphics

Tuesday, November 17, 2020

Wrote the Agent abstract class, and all subsequent agent concrete classes, including TabularAgent (for the Tabula Grid World environment), FunctionalAgent (for environments demanding function approximation, ie. Random Grid World and Delayed Chain MDP + State Distraction), and BinaryAgent (for the standard Delayed Chain MDP environments without state distraction).

Wednesday, November 18, 2020

Wrote the LPG Model class and the Embedding layer it uses to encode the categorical prediction vector $y$

Sunday, November 22, 2020

Began to attempt the first tests at a simple implementation. Work in progress, but starting to write the overall code.

Saturday, November 28, 2020

Began writing the final code implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
DelayedChainMDP		DelayedChainMDP
GridWorld		GridWorld
RC2020_Notebook.ipynb		RC2020_Notebook.ipynb
README.md		README.md
agents.py		agents.py
info.txt		info.txt
lpg.py		lpg.py
utils.py		utils.py
work-in-progress.py		work-in-progress.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DelayedChainMDP

DelayedChainMDP

GridWorld

GridWorld

RC2020_Notebook.ipynb

RC2020_Notebook.ipynb

README.md

README.md

agents.py

agents.py

info.txt

info.txt

lpg.py

lpg.py

utils.py

utils.py

work-in-progress.py

work-in-progress.py

Repository files navigation

RC2020

Progress Log

Task Log

About

Releases 3

Packages

Languages

ryanrudes/rc2020

Folders and files

Latest commit

History

Repository files navigation

RC2020

Progress Log

Task Log

About

Topics

Resources

Stars

Watchers

Forks

Languages