reinforcement-learning-via-spectral-methods

Model-based reinforcement learning algorithms make decisions by building and utilizing a model of the environment. However, none of the existing algorithms attempts to infer the dynamics of any state-action pair from known state-action pairs before meeting it for sufficient times. We propose a new model-based method called Greedy Inference Model (GIM) that infers the unknown dynamics from known dynamics based on the internal spectral properties of the environment. In other words, GIM can “learn by analogy”. We further introduce a new exploration strategy which ensures that the agent rapidly and evenly visits unknown state-action pairs. GIM is much more computationally efficient than state-of-the-art model-based algorithms, as the number of dynamic programming operations is independent of the environment size. Lower sample complexity could also be achieved under mild conditions compared against methods without inferring. Experimental results demon- strate the effectiveness and efficiency of GIM in a variety of real-world tasks.

Paper Link: https://arxiv.org/abs/1912.10329

Our implementation was modified from simple_rl.

See examples/simple_example.py for how to use GIM. Simply uncomment line 151~154 for multiple tasks that are shown in the paper. The sample experiment results are shown under the folder examples/sample_results.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
examples		examples
tensor_rl		tensor_rl
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

examples

examples

tensor_rl

tensor_rl

.DS_Store

.DS_Store

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Repository files navigation

reinforcement-learning-via-spectral-methods

About

Releases

Packages

Contributors 2

Languages

License

umd-huang-lab/reinforcement-learning-via-spectral-methods

Folders and files

Latest commit

History

Repository files navigation

reinforcement-learning-via-spectral-methods

About

Resources

License

Stars

Watchers

Forks

Languages