@timlautk on Backlist

1 appearance on the backlist front page in the last 30 days.

38.

(x.com)

1/4 New paper with @weijie444 ! We introduce a symmetry-compatible principle for LLM optimizer design and, as a byproduct, get an end-to-end layerwise optimizer stack where every major matrix-valued parameter (embeddings, LM heads, SwiGLU

by (Tim Lau) · backlist 2026-05-19 · rubric 93.0