Backlist — 26 May 2026 UTC

Top 90 curated tweets ranked for substance on 26 May 2026 UTC.

31.

(x.com)

@BowenWangNLP et al. dropped 32,122 verifiable rlvr tasks for training cua agents which is about 87x of osworld tasks. large enough to experiment some cua rl scaling

by (Guohao Li ) · backlist 2026-05-26 · rubric 96.0
51.

(t.co)

Introducing Preprint what if browser use could be just text? a research experiment which exposes web pages as text files to LLMs - Which they can edit to make actions, type, tap, etc. https:// github.com/supermemoryai/ preprint …

by · backlist 2026-05-26 · rubric 92.0
55.

(x.com)

Are we nearing a compute crunch? In our latest Gradient Update, @luke__emberson and @Jsevillamol estimate how many tokens all the Blackwell chips on Earth could serve, and compare this to total token demand. Direct comparisons are diff

by (Epoch AI) · backlist 2026-05-26 · rubric 91.0
70.

Building a Speculative Decoding Inference speculative decoding (sds) is when a small "draft" model predicts multiple tokens fast, then a big "target" model verifies them all at once. if done right, you get ~2x faster generation without any

by (mohit) · backlist 2026-05-26 · rubric 90.0
81.

(x.com)

Building on @nilinabra 's Soft Muon idea, I found a set of polynomials you can use to compute UΣᵖVᵀ accurately for |p| < 0.9 as efficiently as Newton-Schulz/Polar Express. check it out!

by (varun) · backlist 2026-05-26 · rubric 88.0
82.

(x.com)

i got excited when i saw @Nick_Prince12 post so i asked my agent something similar.... a US economy snapshot report based on @michaeljburry substack vs status of live US market stats & where things stand now. i let my agent use followi

by (Gega Tsurtsumia) · backlist 2026-05-26 · rubric 88.0
88.

(x.com)

"Don't trust. Evaluate." @nearestnabors set out to replace Claude Sonnet with Gemma 4. The evals showed a quantifiably better option. Full walkthrough: capability evals + prompt engineering to ship a local 3B that matches Sonnet, 2x fas

by (arize-phoenix) · backlist 2026-05-26 · rubric 88.0