#

exploration-exploitation

Here are 41 public repositories matching this topic...

zwkcoding / explore_map_standalone

Maintain an environmental exploration map & Update by Bayesian probability **For Autonomous Vehicle**

map exploration-exploitation autonomous-expl

Updated Apr 24, 2018
C++

TianhongDai / self-imitation-learning-pytorch

This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.

reinforcement-learning-algorithms atari-games a2c exploration-exploitation

Updated Nov 4, 2018
Python

tyoon10 / Exploration-and-Exploitation

business strategy exploration-exploitation

Updated Jan 2, 2019
Jupyter Notebook

hashem20 / Active-Passive-Gap-in-Exploration

Active versus Passive exploration

decision-making psychology active-learning exploration-exploitation

Updated Feb 3, 2019
MATLAB

hridayns / Research-Project-on-Reinforcement-learning

Research Thesis - Reinforcement Learning

reinforcement-learning openai-gym dqn ddqn exploration-exploitation

Updated May 22, 2019
Python

gokceuludogan / interactive-music-recommendation

Personalized and Interactive Music Recommendation with Bandit approach

music-recommendation bandit-algorithms exploration-exploitation bayes-ucb

Updated Sep 15, 2019
Jupyter Notebook

Ralami1859 / Action-Elimination-for-Multi-Armed-Bandits

Action elimination for multi-armed bandits

multi-armed-bandit exploration-exploitation action-elimination

Updated Nov 15, 2019
MATLAB

mbhenaff / neural-e3

deep-learning deep-reinforcement-learning model-based-rl exploration-exploitation

Updated Dec 17, 2019
Python

wzhe06 / Reco-papers

Classic papers and resources on recommendation

machine-learning reinforcement-learning deep-learning recommender-system recommendation exploration-exploitation

Updated Jun 13, 2020
Python

rom1mouret / exploration

over-parameterization = exploration ?

global-optimization gradient-descent hypernetworks exploration-exploitation over-parameterization

Updated Aug 23, 2020
Python

pranav0904 / Reinforcement-Learning

OpenAI, gym environment implementation

reinforcement-learning openai gym exploration-exploitation

Updated Nov 14, 2020
Jupyter Notebook

Amshra267 / Thompson-Greedy-Comparison-for-MultiArmed-Bandits

Repository Containing Comparison of two methods for dealing with Exploration-Exploitation dilemma for MultiArmed Bandits

thompson-sampling epsilon-greedy exploration-exploitation optimistic-bayesian-sampling

Updated Jul 2, 2021
Python

kkm24132 / ReinforcementLearning

Focuses on Reinforcement Learning related concepts, use cases, and learning approaches

reinforcement-learning q-learning policy-gradient sarsa multi-armed-bandits montecarlo linear-function-approximation exploration-exploitation temporal-difference-algorithms

Updated Jul 5, 2021
Jupyter Notebook

JiahongXu123 / OSPO-algorithm

OSPO is a novel metaheuristic algorithm which has the potential to solve different kinds of problems with promising performance.

global-optimization adaptive optimization-algorithms metaheuristics exploration-exploitation

Updated Aug 12, 2021

ruqoyyasadiq / deep_RL-multi-arm-bandit-exploration

This is an implementation of the Reinforcement Learning multi-arm-bandit experiment using different exploration techniques.

reinforcement-learning reinforcement-learning-algorithms bandit-algorithms exploration-exploitation exploration-strategy

Updated Oct 4, 2021
Python

nsandholtz / hotspot_paper

A companion repository for 'Inverse Bayesian Optimization: Learning Human Acquisition Functions in an Exploration vs Exploitation Search Task'

optimization bayesian-inference inverse-problems bayesian-optimization directional-statistics exploration-exploitation

Updated Jan 20, 2022
R

kochlisGit / Reinforcement-Learning-Algorithms

This project focuses on comparing different Reinforcement Learning Algorithms, including monte-carlo, q-learning, lambda q-learning epsilon-greedy variations, etc.

python reinforcement-learning monte-carlo openai-gym q-learning policy rl-agents epsilon-greedy dynamic-programming markov-chains approximation-algorithms ucb1 q-lambda exploration-exploitation thomson-sampling frozen-lake multi-bandit-army

Updated Feb 15, 2022
Python

guptav96 / bandit-algorithms

A short implementation of bandit algorithms - ETC, UCB, MOSS and KL-UCB

reinforcement-learning bandit-algorithms exploration-exploitation

Updated Feb 27, 2022
Python

mohitpandey92 / k_arm_bandit

A simple exercise in reinforcement learning

machine-learning reinforcement-learning exploration-exploitation

Updated Mar 24, 2022
Jupyter Notebook

siavashadpey / MultiArmedBandits

reinforcement-learning active-learning bandit-algorithms exploration-exploitation

Updated Mar 27, 2022
Python

Improve this page

Add a description, image, and links to the exploration-exploitation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the exploration-exploitation topic, visit your repo's landing page and select "manage topics."