Deep Reinforcement Learning with a Natural Language Action Space [arXiv]

TLDR; The authors train a DQN on text-based games. The main difference is that their Q-Value functions embeds the state (textual context) and action (text-based choice) separately and then takes the dot product between them. The authors call this a Deep Reinforcement Learning Relevance network. Basically, just a different Q function implementation. Empirically, the authors show that their network can learn to solve "Saving John" and "Machine of Death" text games.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

drl-nlp-action.md

drl-nlp-action.md

Deep Reinforcement Learning with a Natural Language Action Space [arXiv]

Files

drl-nlp-action.md

Latest commit

History

drl-nlp-action.md

File metadata and controls

Deep Reinforcement Learning with a Natural Language Action Space [arXiv]