5.
Self-Play Plus a Pinch of Human Data Makes Driving More Human-Like
Thirty minutes of human demonstrations regularize self-play enough to produce much more human-like driving policies
1 appearance on the backlist front page in the last 30 days.
Thirty minutes of human demonstrations regularize self-play enough to produce much more human-like driving policies