Backlist — 22 May 2026 UTC

Top 90 curated tweets ranked for substance on 22 May 2026 UTC.

31.

Agents deserve Servers, not Computers Every agent is getting a computer (whether it's localhost or a sandbox) as part of their harness. Evals like TerminalBench even assume the presence of a computer! It's been a long time coming: a comput

by (Stanislas Polu) · backlist 2026-05-22 · rubric 97.0
33.

(x.com)

Excited to share that @modal is supporting Stanford CS321M: AI Measurement Science with compute for class assignments, student projects, and GPU scoring infrastructure for the Predictive AI Evaluation Challenge.

by (Sanmi Koyejo) · backlist 2026-05-22 · rubric 96.0
37.

Claude Sonnet 4.8 Leaks - Anthropic accidentally shipped a massive 512,000-line internal debugging source map through a Claude Code npm update on March 31, 2026 - The leaked source code references Sonnet 4.8 inside unreleased keyword filte

by (Pankaj Kumar) · backlist 2026-05-22 · rubric 93.0
39.

(x.com)

Extremely proud of the team @cartesia for launching Sonic 3.5, which sets a new state of the art for TTS I personally led the technical direction of this model; we built it ground up from first principles, and it contains multiple non-tr

by (Albert Gu) · backlist 2026-05-22 · rubric 92.0
40.

Wow! 4 MacBooks serving 40 tok/s+ on a 230B param model hmmmm I thought the gatekeepers said this isnt possible

by · backlist 2026-05-22 · rubric 92.0
44.

(t.co)

MoE (8): Enforcing Sequence-Level Balance https:// kexue.fm/archives/11760 This article explores how to achieve sequence-level load balancing without incurring any loss penalty. Starting from the original Quantile Balancing (QB), we gradu

by (jianlin.su) · backlist 2026-05-22 · rubric 92.0
46.

New from NVIDIA! You can edit a model’s compressed memory without scrambling what it already knows! Enter Gated DeltaNet-2. It separates the erase and write operations in linear attention using two independent gates – one for forgetting

by (机器之心 JIQIZHIXIN) · backlist 2026-05-22 · rubric 92.0
49.

New paper! Post-training doesn't build the Assistant, it just turns up the volume on personas that pretraining already laid down, at 0.22% of total tokens! We traced them across OLMo-3 and Apertus here's what we found

by (Viktor Moskvoretskii) · backlist 2026-05-22 · rubric 91.0
53.

We discover the 𝐀𝐬𝐲𝐦𝐦𝐞𝐭𝐫𝐢𝐜 𝐑𝐨𝐥𝐞𝐬 𝐨𝐟 𝐃𝐚𝐭𝐚 𝐆𝐚𝐭𝐢𝐧𝐠 𝐚𝐧𝐝 𝐑𝐞𝐰𝐚𝐫𝐝 𝐆𝐫𝐨𝐮𝐧𝐝𝐢𝐧𝐠 𝐢𝐧 𝐒𝐞𝐥𝐟-𝐏𝐥𝐚𝐲 𝐑𝐋: data gating, not reward grounding, is the binding constraint on stability. A strict gate stabiliz

by (Xin Eric Wang (hiring postdoc)) · backlist 2026-05-22 · rubric 90.0
62.

(x.com)

Big docs update for @Cloudflare MCP Server Portals head into this Memorial Day Weekend -- troubleshooting, service token auth, tool policies, DLP, Terraform, API reference, architecture docs, and more. Nearly all of it came from user fe

by (Kenny Johnson) · backlist 2026-05-22 · rubric 88.0
63.

this AI UGC video was made for under $1... my V3 system has officially killed the UGC industry, and i mean it. this video genuinely cost barely a dollar to make and no, it's not Seedance 2.0 or any model you've seen before for the longe

by (Miko) · backlist 2026-05-22 · rubric 88.0
66.

(x.com)

Excited to share that I will be joining @amazon this summer as an Applied Science Intern! I will be working with the @amazonquick team on improved reliability in multi-agent systems. If you are in Seattle this summer, I would love t

by (Arnav Goel) · backlist 2026-05-22 · rubric 88.0
73.

(x.com)

Earlier this week we confirmed @Lighter_xyz 's desert verifier reproduces byte-exact from public source. Today, we explain what that actually means for exchange risk. Every centralized venue runs on one trust assumption: you trust the ve

by (L2BEAT ) · backlist 2026-05-22 · rubric 88.0
76.

(x.com)

Happy Friday — one more thing: We’ve open-sourced OpenBridge, a local-first / BYOK version of @bridge_surf and our Computer Use stack. You can now run the full computer use system locally with your own models and API keys — with complet

by (Bridge) · backlist 2026-05-22 · rubric 88.0
77.

(t.co)

https:// arxiv.org/abs/2605.22769 Could it be better to pretrain on temporally ordered data? It could bias the model towards recent information. I have wondered when information is updated or changed over time whether the model is able to

by (Rosinality) · backlist 2026-05-22 · rubric 88.0
78.

(x.com)

After recent upgrade @manaflowai (cmux) is completely unuseable, I have about 12 tabs with 3 actively running ~2 claude sessions and it stutters to a complete stop after 10~15 minutes. If I ignore it it freezes my maxxed out M3. cc @aus

by (TJ) · backlist 2026-05-22 · rubric 88.0
82.

(t.co)

Hurricane process. Zero shader code written, all base layers in Unicorn. Lot's of craft went into this one. Live demo: https:// unicorn.studio/embed/bG5xs8kL K1bwLKAiEWY9?controls=1 …

by (George Hastings) · backlist 2026-05-22 · rubric 87.0
84.

[[ M 1/4 ]] things achieved > basic setup and docker compose starts all 8 services > transfer endpoint commits cross shard transactions > holds around 1000 + transactions > benchmarked our latency numbers which further we will use in ou

by (Mrinal) · backlist 2026-05-22 · rubric 87.0