59.
This was a very fun project. In Behavior-Consistent Deep RL, we provide a method that aligns the behavior of inde…
This was a very fun project. In Behavior-Consistent Deep RL, we provide a method that aligns the behavior of independently trained policies. It turns out, this works even in high dimensional spaces. Here are 6 seeds of Humanoids (all ca sam