@j_foerst on Backlist

34.

RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our rece…

RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our recent work we explore RL specific hierarchical state representations that allow agents to overcome issues with low quality demonst

by @j_foerst (Jakob Foerster) · backlist 2026-05-25 · rubric 95.0