Abstract

Deep Hierarchical Reinforcement Learning Algorithms in Partially Observable Markov Decision Processes

Abstract

In recent years, reinforcement learning has achieved many remarkable successes due to the growing adoption of deep learning techniques and the rapid growth in computing power. Nevertheless, it is well-known that flat reinforcement learning algorithms are often not able to learn well and data-efficient in tasks having hierarchical structures, e.g. consisting of multiple subtasks. Hierarchical reinforcement learning is a principled approach that is able to tackle these challenging tasks. On the other hand, many real-world tasks usually have only partial observability in which state measurements are often imperfect and partially observable. The problems of RL in such settings can be formulated as a partially observable Markov decision process (POMDP). In this paper, we study {\color{red}hierarchical} RL in POMDP in which the tasks have only partial observability and possess {\color{red}hierarchical} properties. We propose a hierarchical deep reinforcement learning approach for learning in hierarchical POMDP. The deep hierarchical RL algorithm is proposed to apply to both MDP and POMDP learning. We evaluate the proposed algorithm on various challenging hierarchical POMDP.

Link

https://ieeexplore.ieee.org/document/8421749

Require

Python 2.7
Tensorflow 1.4
OpenCV 3.4.0

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
hdrqn		hdrqn
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hdrqn

hdrqn

README.md

README.md

Repository files navigation

Deep Hierarchical Reinforcement Learning Algorithms in Partially Observable Markov Decision Processes

Abstract

Link

Require

About

Releases

Packages

Languages

lephamtuyen/hdrqn

Folders and files

Latest commit

History

hdrqn

hdrqn

README.md

README.md

Repository files navigation

Deep Hierarchical Reinforcement Learning Algorithms in Partially Observable Markov Decision Processes

Abstract

Link

Require

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages