@marcel_hussing on Backlist

59.

This was a very fun project. In Behavior-Consistent Deep RL, we provide a method that aligns the behavior of inde…

This was a very fun project. In Behavior-Consistent Deep RL, we provide a method that aligns the behavior of independently trained policies. It turns out, this works even in high dimensional spaces. Here are 6 seeds of Humanoids (all ca sam

by @marcel_hussing (Marcel Hussing) · backlist 2026-05-26 · rubric 91.0