Backlist — 19 May 2026 UTC

Top 90 curated tweets ranked for substance on 19 May 2026 UTC.

33.

(t.co)

https:// arxiv.org/abs/2605.15220 Using LoRAs for determining dataset mixture. For a continual training setup, when new datasets are introduced, it is possible to train LoRAs for them and combine them with a LoRA on previous datasets.

by (Rosinality) · backlist 2026-05-19 · rubric 96.0
38.

(x.com)

1/4 New paper with @weijie444 ! We introduce a symmetry-compatible principle for LLM optimizer design and, as a byproduct, get an end-to-end layerwise optimizer stack where every major matrix-valued parameter (embeddings, LM heads, SwiGLU

by (Tim Lau) · backlist 2026-05-19 · rubric 93.0
47.

FutureSim Update We evaluated Opus 4.7 at max reasoning in Claude Code. Despite potential test-set contamination with knowledge cutoff of Jan '26, it scored just 21%, barely edging past Opus 4.6 and still behind GPT 5.5! Will Mythos

by (Nikhil Chandak) · backlist 2026-05-19 · rubric 92.0
48.

(x.com)

All Firewall mitigations are now fully free on @vercel . Not just DDoS and system-level mitigations, but also any rule you configure. Vercel now absorbs the computational and network costs of any size of attack or traffic mitigation for y

by (Guillermo Rauch) · backlist 2026-05-19 · rubric 92.0
61.

Excited to share our new paper: Continuous Diffusion Scales Competitively with Discrete Diffusion for Language We introduce RePlaid , a continuous diffusion language model (DLM) with Discrete likelihood bound Scaling laws competitive with

by (Zhihan Yang) · backlist 2026-05-19 · rubric 91.0
71.

(x.com)

oMLX 0.3.9rc1 released. Highlights: - Low-memory Macs stay stable instead of getting killed by the OS - DFlash bumped to v0.1.7 (thanks to @bstnxbt 's dflash-mlx). Qwen thinking/GDN fix, Etc. - Chunked prefill. A long prompt no longer bloc

by (Jun Kim) · backlist 2026-05-19 · rubric 90.0
76.

big day of building today We’re now doing RL training on the runtime of our new agent framework The implementation is a loop: run the native agent runtime through real and ambitious economic tasks, trace every step, score behavior agains

by (Axobotl) · backlist 2026-05-19 · rubric 89.0
79.

the report is out!!!!! i want to share the spookiest transcript i read while working on this where an OpenAI model, unprompted, tried to break out of METR infrastructure ;-;

by (Vincent) · backlist 2026-05-19 · rubric 88.0
83.

(x.com)

You have to read this one. We just published a recap into how @wafer_ai pushed @AMD inference performance to a level that’s getting the entire ecosystem’s attention and the results are kind of wild. What makes this story interesting i

by (TensorWave) · backlist 2026-05-19 · rubric 88.0
89.

A (my) Pythia Search Engine find: https://12000. org Algebra, Mathematics, Control Systems, Signal Image Processing, Differential Equations, Simulations and more It goes deep with examples, solutions and it's very interestingly structur

by · backlist 2026-05-19 · rubric 88.0