reinforcement

This was more of an experiment to learn how to cuda and parallelize things. Thus, there are many many things that can be improved on. Feel free to use it and contribute if you find it useful.

learning reinforcement tilecoding

Updated Jul 10, 2017
Cuda

0xDaksh / pycsgo

Star

Exploring Reinforcement Learning using Python and CS:GO

python learning tensorflow csgo cnn reinforcement plays pycsgo

Updated Jul 12, 2017

hippover / keras-rl

Star

Deep Reinforcement Learning for Keras.

learning reinforcement

Updated Aug 1, 2017
Jupyter Notebook

VPanjeta / GameBot

Star

A game bot using OpenAI gym and Reinforcement Learinng

learning deep-neural-networks reinforcement-learning deep-learning openai-gym deep openai-universe openai convolutional-neural-networks reinforcement

Updated Sep 13, 2017
Python

dmmiller612 / Deep-Reinforcement-Learning-Keras

Star

Basic deep reinforcement learning algorithms implemented with Keras

reinforcement-learning simple keras deep-reinforcement-learning reinforcement

Updated Oct 6, 2017
Python

alextanhongpin / q-learning

Sponsor

Star

Basic Q-Learning :D

q-learning reinforcement

Updated Nov 17, 2017
Jupyter Notebook

prabhatnagarajan / birl

Star

A Python Implementation of Bayesian Inverse Reinforcement Learning (BIRL)

reinforcement-learning mdp rl bayesian reinforcement inverse-reinforcement-learning

Updated Dec 5, 2017
Python

mabirck / CS294-DeepRL

Star

My content of CS294 Deep Reinforcement Learning course, conduced by Sergey Levine from UC Berkeley.

deep-neural-networks reinforcement-learning deep-learning deep-reinforcement-learning pytorch neural-networks policy-gradient reinforcement pytorch-tutorials cs294 on-policy off-policy

Updated Jan 15, 2018
Python

stabgan / Thompson-Sampling

Star

I applied The Thompson Sampling model in both python and R

learning machine sampling reinforcement thompson

Updated Jan 20, 2018
Python

CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Star

We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting

learning policy multi-agent gradient reinforcement bandit contextual

Updated Mar 9, 2018
Jupyter Notebook

stabgan / Upper-Confidence-Bounds

Star

I implemented the reinforcement learning based model Upper Confidence Bound in both Python and R

learning machine learning-by-doing reinforcement upper bound confidence

Updated Mar 9, 2018
Python

BardOfCodes / DRL_in_CV

Star

A course on Deep Reinforcement Learning in Computer Vision. Visit Website:

computer-vision course-materials deep-reinforcement-learning q-learning policy-gradient reinforcement temporal-differencing-learning

Updated Mar 21, 2018
HTML

persiyanov / just-paper-notes

Star

Notes on DL/RL papers I read

reinforcement-learning deep-learning deeplearning reinforcement

Updated Apr 6, 2018

jarenal / tictactoe-reinforcement-learning

Star

This is a test project for to try Reinforcement Learning (Q-Learning) and machine learning on PHP

learning machine-learning q-learning artificial-intelligence tictactoe reinforcement

Updated May 2, 2018
PHP

sohamghosh121 / PacmanGym

Star

Open AI Gym version of Berkeley AI Pacman with images as states

learning berkeley pacman openai gym rl reinforcement reinforcementlearning

Updated May 4, 2018
Python

Improve this page

Add a description, image, and links to the reinforcement topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the reinforcement topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reinforcement

Here are 101 public repositories matching this topic...

CarsonScott / Dual-Process-Reinforcement

calclavia / rl

vibhorv / Unsupervised_Learning_Game

Alfredvc / paac

ivan-v-kush / vizdoom_rl

JadenTravnik / parallelTileCoding

0xDaksh / pycsgo

hippover / keras-rl

VPanjeta / GameBot

dmmiller612 / Deep-Reinforcement-Learning-Keras

alextanhongpin / q-learning

prabhatnagarajan / birl

mabirck / CS294-DeepRL

stabgan / Thompson-Sampling

CSKrishna / Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

stabgan / Upper-Confidence-Bounds

BardOfCodes / DRL_in_CV

persiyanov / just-paper-notes

jarenal / tictactoe-reinforcement-learning

sohamghosh121 / PacmanGym

Improve this page

Add this topic to your repo