Implementation of upper-confidence reinforcement learning algorithm with nearest neighbor function approximator in the game cartpole. See https://arxiv.org/abs/1905.01576v1 for detail.
Implementation of upper-confidence reinforcement learning algorithm with nearest neighbor function approximator in the game cartpole. See https://arxiv.org/abs/1905.01576v1 for detail.