exploration-exploitation

This project uses Reinforcement Learning to teach an agent to drive by itself and learn from its observations so that it can maximize the reward(180+ lines)

reinforcement-learning q-learning epsilon-greedy loss-functions deep-q-learning exploration-exploitation

Updated Nov 25, 2022
Jupyter Notebook

Ralami1859 / Action-Elimination-for-Multi-Armed-Bandits

Star

Action elimination for multi-armed bandits

multi-armed-bandit exploration-exploitation action-elimination

Updated Nov 15, 2019
MATLAB

Sagarnandeshwar / Bandit_Algorithms

Star

Reinforcement Learning (COMP 579) Project

reinforcement-learning thompson-sampling epsilon-greedy ucb bernoulli-distribution bandit-algorithms exploration-exploitation

Updated Aug 4, 2023
Jupyter Notebook

spoluan / reinforcement_learning

Star

This repository contains a variety of projects related to reinforcement learning, showcasing different approaches to implementing it in various scenarios.

agent environment deep-reinforcement-learning q-learning policy policy-gradient rewards markov-decision-processes bellman-equation discount-factor exploration-exploitation reinforcment-learning model-based-learning

Updated May 16, 2023
Jupyter Notebook

tyoon10 / Exploration-and-Exploitation

Star

business strategy exploration-exploitation

Updated Jan 2, 2019
Jupyter Notebook

Improve this page

Add a description, image, and links to the exploration-exploitation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the exploration-exploitation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

exploration-exploitation

Here are 41 public repositories matching this topic...

mohitpandey92 / k_arm_bandit

hashem20 / Active-Passive-Gap-in-Exploration

JiahongXu123 / OSPO-algorithm

zwkcoding / explore_map_standalone

siavashadpey / MultiArmedBandits

pranav0904 / Reinforcement-Learning

Anjali001 / Reinforcement-Learning

nsandholtz / hotspot_paper

alxndrTL / RL-essais-cliniques

avorozhtsov / shipit

baturaysaglam / Q-Error-Exploration

rom1mouret / exploration

Giovannibriglia / AgentGroup_CausalRL

kalexandriabond / competing-representations-shape-evidence-accumulation

ruqoyyasadiq / deep_RL-multi-arm-bandit-exploration

SXV357 / Inspirit-AI-Deep-Dive-Designing-DL-Systems-FinalProject-RL-for-Autonomous-Vehicles

Ralami1859 / Action-Elimination-for-Multi-Armed-Bandits

Sagarnandeshwar / Bandit_Algorithms

spoluan / reinforcement_learning

tyoon10 / Exploration-and-Exploitation

Improve this page

Add this topic to your repo