@LiangZheng_06 on Backlist

45.

Two insights from LeapAlign: (x.com)

Two insights from LeapAlign: 1. Gradient descent, rather than GRPO, is native to diffusion post-training. 2. Early generation steps should be trained, such that image layout can be better optimized. Thanks @hillbig for posting this work.

by @LiangZheng_06 (Liang Zheng) · backlist 2026-06-12 · rubric 88.0

15.

DiffusionBench: ImageNet gains no longer predict text-to-image gains

Across 21 recent diffusion methods, improvements on ImageNet did not predict text-to-image improvements under identical DiffusionBench settings

by @LiangZheng_06 (Liang Zheng) · backlist 2026-06-11 · rubric 92.0

80.

Diffusion is differentiable. LLMs aren't. (t.co)

Diffusion is differentiable. LLMs aren't. So why is the diffusion community copying RL methods (GRPO etc.) from LLMs? The native post-training for diffusion is gradient descent such as ReFL and LeapAlign. Paper: http:// arxiv.org/abs/260

by @LiangZheng_06 (Liang Zheng) · backlist 2026-06-10 · rubric 74.0

57.

Diffusion is differentiable. LLMs aren't. (t.co)

Diffusion is differentiable. LLMs aren't. So why is the diffusion community copying RL methods (GRPO etc.) from LLMs? The native post-training for diffusion is gradient descent such as ReFL and LeapAlign. Paper: http:// arxiv.org/abs/260

by @LiangZheng_06 (Liang Zheng) · backlist 2026-06-09 · rubric 74.0