Build software better, together

iamjagdeesh / Artificial-Intelligence-Pac-Man

CSE 571 Artificial Intelligence

reinforcement-learning deep-reinforcement-learning q-learning artificial-intelligence neural-networks epsilon-greedy breadth-first-search alpha-beta-pruning depth-first-search minimax-algorithm policy-iteration value-iteration function-approximation expectimax particle-filter-tracking uniform-cost-search greedy-search a-star-search

Updated Jan 3, 2018
Python

kulinshah98 / Multi-Armed-Bandit-Algorithms

Star

Python implementation of UCB, EXP3 and Epsilon greedy algorithms

epsilon-greedy multi-armed-bandits upper-confidence-bounds bandit-algorithms stochastic-bandit-algorithms adversarial-bandit-algorithms exp3-algorithm

Updated Oct 4, 2018
Python

starkblaze01 / Artificial-Intelligence-Codes

Sponsor

Star

Collection of Artificial Intelligence Algorithms implemented on various problems

reinforcement-learning genetic-algorithm epsilon-greedy gaussian-mixture-models confidence-intervals hidden-markov-model hopfield-network decision-tree-classifier hierarchical-clustering artificial-intelligence-algorithms k-sat travelling-salesman-problem k-means-clustering menace jealous-husband adaptive-smoothing

Updated Oct 23, 2020
Jupyter Notebook

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

Star

implement basic and contextual MAB algorithms for recommendation system

python numpy scikit-learn epsilon-greedy recommendation-system matplotlib upper-confidence-bounds contextual-bandits multiarmed-bandits

Updated Jan 18, 2022
Jupyter Notebook

ShreeshaN / ReinforcementLearningTutorials

Star

This repo contains implementations of algorithms such a Q-learning, SARSA, TD, Policy gradient

q-learning pytorch dqn epsilon-greedy breakout sarsa policy-iteration value-iteration monte-carlo-methods deep-q-learning model-based-rl model-free-rl td-methods model-free-control

Updated Dec 8, 2019
Python

haidarns / ml-based-lb-ryu

Star

Machine Learning based Load Balancing with RYU OpenFlow Controller

machine-learning load-balancer round-robin ryu epsilon-greedy sdn-controller flask-api iperf3 ip-hash d-itg

Updated Oct 16, 2018
Python

viswanath57 / Bandit-Algorithms

Star

algorithms epsilon-greedy multiarm-bandit softmax-algorithm ucb1

Updated Apr 5, 2021
Jupyter Notebook

Hyeon9mak / HCP_2020

Star

🎮 포켓몬 길찾기 게임 (광운대학교 컴퓨터정보공학부 고급C프로그래밍 팀프로젝트)

q-learning epsilon-greedy q-learning-algorithm frozen-lake-game

Updated Dec 6, 2020
C

antoine-hochart / bandit_algo_evaluation

Star

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

KaleabTessera / Multi-Armed-Bandit

Star

Implementation of greedy, E-greedy and Upper Confidence Bound (UCB) algorithm on the Multi-Armed-Bandit problem.

reinforcement-learning greedy epsilon-greedy upper-confidence-bounds multi-armed-bandit

Updated Dec 8, 2022
Python

ChaitanyaC22 / Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

Star

The goal of this project is to build an RL-based algorithm that can help cab drivers maximize their profits by improving their decision-making process on the field. Taking long-term profit as the goal, a method is proposed based on reinforcement learning to optimize taxi driving strategies for profit maximization. This optimization problem is fo…

Updated Jul 9, 2021
Jupyter Notebook

georgedeath / eshotgun

Star

ϵ-shotgun: ϵ-greedy Batch Bayesian Optimisation

optimization epsilon-greedy bayesian-optimization acquisition-functions

Updated Jan 2, 2021
C++

sumanvid97 / FlappyBird-AI

Star

RL algorithms for pygame version of Flappy Bird

reinforcement-learning q-learning epsilon-greedy deep-q-network

Updated May 23, 2018
Python

cyberquill / Riyaaz

Star

A content-based music recommendation system, that suggests playlists made from the locally stored songs, and updates its suggestions based on the user feedback using non-stationary Bayesian reinforcement learning. Created using React and the Electron.js framework.

electron react data-science reinforcement-learning clustering jupyter-notebook music-recommendation artificial-intelligence epsilon-greedy librosa

Updated Oct 5, 2023
Jupyter Notebook

elhamza9 / career-village-recommender-system

Star

3rd Place winning solution on Kaggle Data Science for Good Competition

nlp exploratory-data-analysis plotly data-visualization epsilon-greedy recommendation-engine recommender-system

Updated Apr 27, 2019
Jupyter Notebook

roaked / snake-evolutionary-reinforcement-learning

Star

parameter optimization of a reinforcement learning deep Q network with memory replay buffer using genetic algorithm in the snake game. base code for snake env from codecamp

deep-neural-networks reinforcement-learning optimization genetic-algorithm deep-reinforcement-learning neuroevolution snake-game epsilon-greedy evolutionary-algorithm evolutionary-strategy stochastic-optimization fitness-function snake-ai memory-replay optimistic-exploration

Updated Mar 1, 2024
Python

MoinDalvs / Assignment_East-West_Airlines

Star

Problem Statement Perform clustering (Hierarchical,K means clustering and DBSCAN) for the airlines data to obtain optimum number of clusters

data-science epsilon-greedy clustering-algorithm kmeans-clustering hierarchical-clustering dbscan-clustering

Updated May 2, 2022
Jupyter Notebook

RPG-coder / atari-transfer-learning

Star

Improved Bot Learning process on Atari games by using Transfer Learning. An Extension of Playing Atari with Reinforcement Learning. Part of CS677 NJIT Final Project.

opencv reinforcement-learning local tensorflow keras cnn dqn gym colab epsilon-greedy transfer-learning atari keras-tensorflow tensorflow2 deepmind-atari

Updated Dec 22, 2021
Jupyter Notebook

Rmko4 / RL-Tabular-Rubikscube

Star

Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.

reinforcement-learning q-learning epsilon-greedy sarsa simulated-annealing td-learning softmax menace-matchboxes

Updated Aug 1, 2021
C

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

epsilon-greedy

Here are 77 public repositories matching this topic...

iamjagdeesh / Artificial-Intelligence-Pac-Man

kulinshah98 / Multi-Armed-Bandit-Algorithms

starkblaze01 / Artificial-Intelligence-Codes

akshaykhadse / reinforcement-learning

Heewon-Hailey / multi-armed-bandits-for-recommendation-systems

ShreeshaN / ReinforcementLearningTutorials

haidarns / ml-based-lb-ryu

viswanath57 / Bandit-Algorithms

Hyeon9mak / HCP_2020

antoine-hochart / bandit_algo_evaluation

KaleabTessera / Multi-Armed-Bandit

ChaitanyaC22 / Deep-RL-Project---Maximize-total-profits-earned-by-cab-driver

georgedeath / eshotgun

sumanvid97 / FlappyBird-AI

cyberquill / Riyaaz

elhamza9 / career-village-recommender-system

roaked / snake-evolutionary-reinforcement-learning

MoinDalvs / Assignment_East-West_Airlines

RPG-coder / atari-transfer-learning

Rmko4 / RL-Tabular-Rubikscube

Improve this page

Add this topic to your repo