15.
Robotics RL is often imitation learning in disguise
Reward shaping, curricula, initialization, and environment design can smuggle human demonstrations into reinforcement-learning systems indirectly
1 appearance on the backlist front page in the last 30 days.
Reward shaping, curricula, initialization, and environment design can smuggle human demonstrations into reinforcement-learning systems indirectly