Two insights from LeapAlign: (x.com)
Two insights from LeapAlign: 1. Gradient descent, rather than GRPO, is native to diffusion post-training. 2. Early generation steps should be trained, such that image layout can be better optimized. Thanks @hillbig for posting this work.