Readings

Towards Solving Text-based Games by Producing Adaptive Action Spaces

https://arxiv.org/pdf/1812.00855.pdf

Adaptive action spaces are seen as a supervised learning task: (input, label) -> admissible_commands. Admissible means it can change the game's state.

Approaches:

a pointer-softmax model that uses beam search to generate multiple commands (worst performer);
a hierarchical recurrent model with pointer-softmax generating multiple commands at once;
a pointer-softmax model generating multiple commands at once.

Note: Beam search uses breadth-first search to build its search tree. At each level of the tree, it generates all successors of the states at the current level, sorting them in increasing order of heuristic cost.

Related: Seq2Seq. LSTM-DQN. GloVe. Bidirectional RNN.

They use the full inventory and look description for training. These need to be replaced in a more realistic challenge by the learned features from the environment.

Bottomline: How important is it to be able to generate unseen commands?

Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning

https://arxiv.org/pdf/1812.01628.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
img		img
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

img

img

README.md

README.md

Repository files navigation

Readings

Towards Solving Text-based Games by Producing Adaptive Action Spaces

Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning

About

Releases

Packages

projectzork/Readings

Folders and files

Latest commit

History

img

img

README.md

README.md

Repository files navigation

Readings

Towards Solving Text-based Games by Producing Adaptive Action Spaces

Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages