6.
RL post-training pushes vision-language-action models past 95% reliability (t.co)
EXPO-FT reports perfect success on eight tested robot tasks using only about 19 minutes of reinforcement-learning data on average
1 appearance on the backlist front page in the last 30 days.
EXPO-FT reports perfect success on eight tested robot tasks using only about 19 minutes of reinforcement-learning data on average