34.
RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our rece…
RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our recent work we explore RL specific hierarchical state representations that allow agents to overcome issues with low quality demonst