Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
-
Updated
Nov 10, 2021 - Python
Code for the numerical experiments in Zhang, Sheng, Zhe Zhang, and Siva Theja Maguluri. "Finite Sample Analysis of Average-Reward TD Learning and Q-Learning."
Add a description, image, and links to the average-reward topic page so that developers can more easily learn about it.
To associate your repository with the average-reward topic, visit your repo's landing page and select "manage topics."