Backlist — 25 May 2026 UTC

1.

Sequencing a human genome to 30× coverage at home

Every step from saliva collection to 30× sequencing happened in a single room, showing how quickly genomics hardware is moving from labs toward hobbyist workflows

by @SethSHowes (Seth Howes) · backlist 2026-05-25 · rubric 92.0

2.

Building functional DRAM in a backyard shed

A 20-bit memory cell array made with homemade sputtering and lithography tools turns semiconductor fabrication into a plausible garage-scale experiment

by @EmbeddedSDE (Shivam) · backlist 2026-05-25 · rubric 92.0

3.

Handling the same security report: pnpm vs. Bun

The same package-manager security issue got fast advisory/backports from pnpm and silent fixes from Bun, illustrating how disclosure process matters as much as the patch

by @DavidSherret (David Sherret) · backlist 2026-05-25 · rubric 93.0

4.

APKPure distributed Telegram builds with spyware, says report

APKPure shipped Telegram installation packages embedded with a spy framework that collected chats, contacts, files, location, and media, making app-store trust a direct security boundary

by @landiantech (蓝点网) · backlist 2026-05-25 · rubric 91.0

5.

Aalo-X: hardware complete for a 10 MW zero-power criticality test

Aalo says all hardware for a 10 MW zero-power criticality test is complete and fuel is on site pending regulatory approval, a concrete milestone for a new reactor startup

by @MattLoszak (Matt Loszak) · backlist 2026-05-25 · rubric 64.0

6.

High Performance Git, edition 1.1 (t.co)

A free 1.1 edition of High Performance Git turns scattered performance folklore around a ubiquitous tool into a durable reference

by @tnm (Ted Nyman) · backlist 2026-05-25 · rubric 80.0

7.

NanoApps: homebrew apps for the iPod nano 7th generation (t.co)

Custom homebrew apps on iPod nano 7th generation reopen an abandoned consumer device as a small, constrained developer platform

by @freemyipod · backlist 2026-05-25 · rubric 78.0

8.

mKernel: fused compute and communication kernels for multi-node GPUs (t.co)

mKernel fuses compute and communication into persistent GPU kernels across intra- and inter-node systems, attacking a real bottleneck in distributed training

by @ziming_mao (Ziming Mao) · backlist 2026-05-25 · rubric 88.0

9.

Touring an autonomous biology lab generating AI training data (t.co)

Biology model training needs fresh experimental data, and an automated lab running around the clock is a glimpse of how that data is produced

by @OmicsOmicsBlog (Keith Robison) · backlist 2026-05-25 · rubric 82.0

10.

How DeepSeek optimizations could reshape China’s AI hardware stack

DeepSeek-style reductions in HBM demand shift China’s AI stack toward domestic NAND, LPDDR, ASIC, and CPU suppliers rather than US-controlled GPU chokepoints

by @kyleichan (Kyle Chan) · backlist 2026-05-25 · rubric 92.0

11.

Huawei’s τ-scaling claims look like logic splitting, not magic lithography

Logic splitting and packaging, not lithography, are the key techniques behind Huawei’s 1.4 nm-equivalent roadmap claims

by @IanCutress (𝐷𝑟. 𝐼𝑎𝑛 𝐶𝑢𝑡𝑟𝑒𝑠𝑠) · backlist 2026-05-25 · rubric 42.0

12.

AlphaProof Nexus solves open Erdős problems (x.com)

DeepMind’s AlphaProof Nexus solved nine open Erdős problems with an agentic formal-proof-search framework, pushing AI math from demos toward open-problem work

by @pushmeet (Pushmeet Kohli) · backlist 2026-05-25 · rubric 83.0

13.

A homebuilt processor counts to 100 and halts

A homebuilt computer reaching a count-to-100 program proves the ALU, registers, branches, jumps, GPIO, SRAM, bootloader, and program counter all work together

by @BreakingTaps · backlist 2026-05-25 · rubric 94.0

14.

How flow cytometry works

Flow cytometry is one of biology’s core measurement techniques, and a clear explainer makes the machinery behind cell sorting and profiling legible

by @zaffagg3 (Gabriele Zaffagnini) · backlist 2026-05-25 · rubric 76.0

15.

Autonomous feedback control in synthetic receptor systems (t.co)

A synthetic miRNA feedback loop that represses its own synNotch receptor points toward programmable, self-regulating cell therapies

by @tfadgreef (Tom de Greef) · backlist 2026-05-25 · rubric 74.0

16.

Big Tech issued $159B in bonds this year for AI data centers

Amazon, Meta, Alphabet, and Oracle issued $159B in bonds this year for AI data centers, including $50B in foreign-currency reverse Yankees

by @trevornoren (Trevor Noren) · backlist 2026-05-25 · rubric 72.0

17.

Figma’s Q1 numbers after the AI-design-tools panic

Figma’s Q1 report—46% revenue growth, 139% NDR, raised guidance—counters the story that AI design tools or Adobe limbo had stalled the company

by @bill_kerrrrr (Bill Kerr) · backlist 2026-05-25 · rubric 81.0

18.

The Sisyphean Pursuit of Evidence for Poverty Traps (x.com)

A development-economics working paper revisits the cleanest evidence for poverty traps and finds the central idea harder to pin down than the textbook story suggests

by @deankarlan (Dean Karlan) · backlist 2026-05-25 · rubric 64.0

19.

Rechecking the “AI uses a bottle of water” statistic

A detailed critique of the AI water-use statistic matters because infrastructure debates can be derailed by a single bad denominator

by @AndyMasley (Andy Masley) · backlist 2026-05-25 · rubric 84.0

20.

.rrd: a data format for robot learning logs

Robot-learning datasets are multi-rate and multimodal by default, and .rrd is designed around that reality rather than treating logs as ordinary videos or tables

by @rerundotio (Rerun) · backlist 2026-05-25 · rubric 93.0

21.

Geolocating a security camera from a single CCTV frame

A single low-context CCTV frame can identify the house a camera network belongs to, collapsing the privacy boundary between public screenshots and physical addresses

by @heinenbros (Daniel Heinen) · backlist 2026-05-25 · rubric 72.0

22.

Schedule-free spectral optimization for language-model training

Schedule-free spectral optimization matching or beating heavily tuned AdamW across 125M and 772M parameter language models hints at simpler training recipes

by @HessianFree (Omead Pooladzandi) · backlist 2026-05-25 · rubric 96.0

23.

TanStack Virtual adds first-class chat support

Chat UIs stress virtualized lists in unusual ways, and first-class end anchoring, streaming, and stable prepends make that complexity boring

by @tan_stack (TANSTACK) · backlist 2026-05-25 · rubric 82.0

24.

A core use-after-free in Linux epoll

A rare use-after-free in Linux’s epoll subsystem is the kind of core kernel bug worth studying because the primitive sits under enormous amounts of production software

by @Shiftreduce (Shift) · backlist 2026-05-25 · rubric 86.0

25.

How a junket to Japan helped create the Paris RER

Paris’s RER became one of the world’s great regional rail systems partly through a Japan study trip, a reminder that infrastructure progress often travels through imitation

by @Infrastory_ (Infrastory) · backlist 2026-05-25 · rubric 61.0

26.

New practical demonstrations at Interface Craft (t.co)

Practical demonstrations of a library-card designer and a new control make interaction design concrete instead of another gallery of static screenshots

by @joshpuckett · backlist 2026-05-25 · rubric 90.0

27.

A compact additive-combinatorics puzzle from Tim Gowers

A compact additive-combinatorics question about ternary and base-4 digit restrictions can pull mathematicians and programmers into the same problem

by @wtgowers (Timothy Gowers @wtgowers) · backlist 2026-05-25 · rubric 18.0

28.

Jira as computation (t.co)

A deep essay on Jira as computation reframes a disliked enterprise tool as a system with state, transitions, and social semantics

by @badlogicgames (Mario Zechner) · backlist 2026-05-25 · rubric 84.0

29.

LIFT is the SFT recipe for dLLMs that actually understands the masking dynamics. Vanilla SFT on dLLMs often HURTS…

LIFT is the SFT recipe for dLLMs that actually understands the masking dynamics. Vanilla SFT on dLLMs often HURTS performance, and they finally pin down why. Their analysis: vanilla SFT overlooks learnability. Rare tokens are difficult to

by @sheriyuo (Xiuyu Li) · backlist 2026-05-25 · rubric 96.0

30.

Reinforcement learning research with Joseph Suarez

by @jsuarez (Joseph Suarez ) · backlist 2026-05-25 · rubric 96.0

31.

Approximate Machine Unlearning through Manifold Representation Forgetting Guided by Self Mode Connectivity

by @chaumian (alon turing) · backlist 2026-05-25 · rubric 96.0

32.

MiniCPM5-1B is now fully open source, including weights, training data, and deployment code. 1B params, #1 on Art… (t.co)

MiniCPM5-1B is now fully open source, including weights, training data, and deployment code. 1B params, #1 on Artificial Analysis among all open models under 2B (17.9 pts). https:// modelscope.cn/models/OpenBMB /MiniCPM5-1B … Beats Qwen3

by @ModelScope2022 (ModelScope) · backlist 2026-05-25 · rubric 96.0

33.

Why KV cache is one of the main reasons LLMs are fast?

Why KV cache is one of the main reasons LLMs are fast? KV cache is what connects attention mechanism with generation stage of autoregressive models. These models generate text token by token, but each new token still attends to all previou

by @TheTuringPost (Turing Post) · backlist 2026-05-25 · rubric 96.0

34.

RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our rece…

RL has largely been a consumer of a deep learning toolkit that was developed for supervised learning. In our recent work we explore RL specific hierarchical state representations that allow agents to overcome issues with low quality demonst

by @j_foerst (Jakob Foerster) · backlist 2026-05-25 · rubric 95.0

35.

LLMs can extract recurring reasoning fragments from prior traces and compile them into a concise reusable behavio… (t.co)

LLMs can extract recurring reasoning fragments from prior traces and compile them into a concise reusable behavior handbook, used in-context at inference or distilled into the model (Meta, Mila-Quebec AI Institute, Princeton University) h

by @hardikmittal_vc (hardy) · backlist 2026-05-25 · rubric 93.0

36.

launching AgentIR Blackbox (t.co)

launching AgentIR Blackbox https:// agentir.dev an llm request router for agent system Blackbox finds which llm calls are on your workflow’s critical path, sends them to faster providers, and routes less urgent calls cheaper to maintain

by @krishmodi404 (Krish Modi) · backlist 2026-05-25 · rubric 92.0

37.

AgentMail now supports IMAP.

AgentMail now supports IMAP. Open your agent's inbox in any email client. Debug by just looking at it. You could already send. Now you can read too. Documentation in replies.

by @agentmail (AgentMail (YC S25)) · backlist 2026-05-25 · rubric 92.0

38.

While (x.com)

While @SpaceX was launching rockets we were using @Starlink to remotely inference our excavator robot model that we trained with 2.5 hours of operator data. We are teaching heavy machines to do real tasks on job sites by learning from

by @laneburgett (Lane Burgett) · backlist 2026-05-25 · rubric 92.0

39.

Long-horizon LLM agents accumulate conversation histories that blow past the context window. The usual fix is LLM…

Long-horizon LLM agents accumulate conversation histories that blow past the context window. The usual fix is LLM-based summarization, which is lossy AND blocks the agent for tens of seconds while the summarizer runs. Parallel Context Comp

by @sheriyuo (Xiuyu Li) · backlist 2026-05-25 · rubric 92.0

40.

So software built with (x.com)

So software built with @zml_ai runs transparently at max speed on: - CPU - NVIDIA GPUs - AMD GPUs - Google TPUs - AWS Trainium - Intel GPUs - Tenstorrent NPUs - Apple GPUs (very experimental) And more to come.

by @steeve (Steeve Morin) · backlist 2026-05-25 · rubric 92.0

41.

3.8% for Claude Opus 4.7 and 0.0% for Gemini 3.1 Pro

3.8% for Claude Opus 4.7 and 0.0% for Gemini 3.1 Pro SaaS-Bench from UniPat AI just dragged Computer-Use Agent benchmark theater into the cold light. They put 23 real open-source SaaS systems into Docker with full DB state and business con

by @sheriyuo (Xiuyu Li) · backlist 2026-05-25 · rubric 92.0

42.

Phase 2 of my heuristic-learning ImageNet-10 experiment: (x.com)

Phase 2 of my heuristic-learning ImageNet-10 experiment: Inspired by @Trinkle23897 's “Learning Beyond Gradients,” I used Claude Code + Codex to iteratively improve a pure symbolic vision system. No neural nets. No backprop. Just visual

by @learningPikachu (Eason) · backlist 2026-05-25 · rubric 92.0

43.

We heard concerns that Antigravity consumes many tokens for simple tasks now. So, we're adding Gemini 3.5 Flash (…

We heard concerns that Antigravity consumes many tokens for simple tasks now. So, we're adding Gemini 3.5 Flash (Low) as a way to optimize token usage for these tasks. In our internal testing, it generates around 45% fewer tokens than Gemin

by @_mohansolo (Varun Mohan) · backlist 2026-05-25 · rubric 92.0

44.

in theory it can be super fast thanks to speculative decoding, but batching with other normal requests probably s…

in theory it can be super fast thanks to speculative decoding, but batching with other normal requests probably slows it down

by @__morse (Tommy D. Rossi) · backlist 2026-05-25 · rubric 91.0

45.

Bridge + (x.com)

Bridge + @Tiny_Fish in action: Use one request to sign in to Hacker News, capture the session, and ask for account context like current karma. TinyFish powers part of the web-agent flow; Bridge connects it back to your desktop. #AIAgen

by @bridge_surf (Bridge) · backlist 2026-05-25 · rubric 91.0

46.

LLMs are trained on web data. Physical AI is trained on physical data. Physical data is different from web data.

LLMs are trained on web data. Physical AI is trained on physical data. Physical data is different from web data. It is multi-rate: cameras might run at 10-30Hz, joint angles at 100-200Hz, and IMUs at 1kHz. It is also multimodal: one stre

by @rerundotio (Rerun) · backlist 2026-05-25 · rubric 91.0

47.

OpenClaw's dependency purge continues. Killed Sharp and Jimp. Replaced it with photon, a small WebAssembly that r…

OpenClaw's dependency purge continues. Killed Sharp and Jimp. Replaced it with photon, a small WebAssembly that runs compiled Rust for image processing. 2MB vs 140MB.

by @steipete (Peter Steinberger ) · backlist 2026-05-25 · rubric 91.0

48.

Introducing SkillOpt — an optimizer for agent skills.

Introducing SkillOpt — an optimizer for agent skills. Instead of finetuning model weights, we treat a natural-language skill as a trainable external parameter. Think of it as deep learning for the frontier-model + agent era: learning rat

by @Yif_Yang (Yifan Yang) · backlist 2026-05-25 · rubric 91.0

49.

Today we noticed Chrome unexpectedly opening Gmail and searching through emails related to us, while Codex was sh…

Today we noticed Chrome unexpectedly opening Gmail and searching through emails related to us, while Codex was shown controlling Chrome from the menu bar. After investigating for a while, we traced the behavior back to the Codex Suggestion

by @bridge_surf (Bridge) · backlist 2026-05-25 · rubric 91.0

50.

It's been a long journey — 6 years and 381 chips to be exact — but HUAWEI's He Tingbo explains how HUAWEI's high-…

It's been a long journey — 6 years and 381 chips to be exact — but HUAWEI's He Tingbo explains how HUAWEI's high-end chips are now expected to feature a transistor density that is equivalent to 14 Å (1.4 nm) processes by 2031.

by @Huawei · backlist 2026-05-25 · rubric 91.0

51.

The (x.com)

The @pnpmjs e2e tests now use a "pnpm registry" instead of verdaccio. In the future we'll make pnpm faster with this registry.

by @zkochan (Zoltan Kochan) · backlist 2026-05-25 · rubric 90.0

52.

Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in …

Grok foundation model V9-Medium (1.5T) has finished training. Evals look good. A lot of Cursor data was added in supplementary training and there is more to come. Fine-tuning is underway and reinforcement learning begins in a few days. 2 t

by @elon_musk34636 (MR ELON) · backlist 2026-05-25 · rubric 90.0

53.

Accepted to TMLR, with reproducibility certification (x.com)

Accepted to TMLR, with reproducibility certification v2 of our JEPA-WM study (arXiv:2512.24497) is out, with new data-scaling experiments, a Lipschitz analysis of multistep rollout training, and extended discussions. Recap + what's new

by @BasileTerv987 (Basile Terver) · backlist 2026-05-25 · rubric 90.0

54.

Shocking! miHoYo Employee Burns Through 2 Million Yuan Playing with AI in One Night

Shocking! miHoYo Employee Burns Through 2 Million Yuan Playing with AI in One Night A colleague set up dozens of agents over a weekend and forgot to shut them down, only to discover that they had consumed 2 million RMB in tokens overnight

by @wayen_ai (Wayen) · backlist 2026-05-25 · rubric 90.0

55.

Excited to share that Srivatsa Katta, CTO of Rapido, is keynoting Day 1 of #KubeCon + #CloudNativeCon India (18-1…

Excited to share that Srivatsa Katta, CTO of Rapido, is keynoting Day 1 of #KubeCon + #CloudNativeCon India (18-19 June, Mumbai) Rapido powers 4M+ rides daily across 150+ microservices at 200K req/sec and it runs on the CNCF stack. Srivats

by @CloudNativeFdn (CNCF) · backlist 2026-05-25 · rubric 90.0

56.

making a database index and querying it N times doesn't take N^2 complexity: it takes N + NlogN, which is what sc…

making a database index and querying it N times doesn't take N^2 complexity: it takes N + NlogN, which is what scaled dot product attention should take for a billion token context window that it's a weighted sum, rather than a lookup, is a

by @CarsonPoole (Carson Poole) · backlist 2026-05-25 · rubric 90.0

57.

nice write up from the HuggingFace folks aggregating works on defining agents, harnesses, environments, RL, etc. …

nice write up from the HuggingFace folks aggregating works on defining agents, harnesses, environments, RL, etc. The more we can roughly have a shared vocabulary the better…I still find it confusing (lolll), but we’re roughly converging on

by @Vtrivedy10 (Viv) · backlist 2026-05-25 · rubric 89.0

58.

deepseek just permanently priced their frontier model at 1/30th of american labs

deepseek just permanently priced their frontier model at 1/30th of american labs anyone know what the hardware story here is? is this huawei chips driving lower costs or model optimizations or lower margins?

by @agupta (Ankit Gupta) · backlist 2026-05-25 · rubric 89.0

59.

introducing pierrejo

introducing pierrejo for all the brotherin struggling to migrate off of github an open-source fully-functional pierre diff integration for PRs in forgejo consumable via nix flake instantaneous diff loading using pierre SSR enjoy

by @HarivanshRathi (Hari) · backlist 2026-05-25 · rubric 89.0

60.

HyperParallel-MoE is an Ascend-specific scheduling system for MoE training.

HyperParallel-MoE is an Ascend-specific scheduling system for MoE training. Ascend A3 exposes separate AIC matrix units and AIV vector/communication units, but standard MoE execution still runs Dispatch, GMM, SwiGLU, and Combine as seriali

by @gm8xx8 (𝚐𝔪𝟾𝚡𝚡𝟾) · backlist 2026-05-25 · rubric 89.0

61.

DR Tulu is now accepted for an oral presentation at #ICML2026 (t.co)

DR Tulu is now accepted for an oral presentation at #ICML2026 Updated paper: https:// arxiv.org/abs/2511.19399 We added more ablations including using Qwen3-8B as the rubric generator&judge, showing evolving rubrics work with a weak mod

by @RulinShao (Rulin Shao) · backlist 2026-05-25 · rubric 88.0

62.

in traces v0.6.0 you can

in traces v0.6.0 you can - search your local sessions across agents - launch any of the sessions you see directly in the agent cli (currently mac only) - view sessions from other people in the orgs you're in, directly from the terminal I

by @tarunsachdeva (Tarun Sachdeva) · backlist 2026-05-25 · rubric 88.0

63.

Updated my MLX Vulkan CI to record and show benchmark result on every commit to main!

by @gonizahavy (goniz) · backlist 2026-05-25 · rubric 88.0

64.

This is really really bad, found another severe vulnerability in CBSE's OSM portal. Just sent another report to C…

This is really really bad, found another severe vulnerability in CBSE's OSM portal. Just sent another report to CERT-In.

by @ni5arga (nisarga) · backlist 2026-05-25 · rubric 88.0

65.

So i've been thinking about an LVR-aware analyst AI agent to trade in some base AMM pool. There is a fairly new p…

So i've been thinking about an LVR-aware analyst AI agent to trade in some base AMM pool. There is a fairly new paper i would like to try and implement. The most common pattern for on-chain trading agents is LLM + a price feed + a swap ro

by @paoloanzn (4nzn) · backlist 2026-05-25 · rubric 88.0

66.

Lots of Blackwell specific PTX techniques to find here for the interested :)

by @Simon_Vt (Simon V) · backlist 2026-05-25 · rubric 88.0

67.

Finally advisories for vulns that I found in dwmcore.dll were fixed (CVE-2026-34336, CVE-2026-35419). However, fo…

Finally advisories for vulns that I found in dwmcore.dll were fixed (CVE-2026-34336, CVE-2026-35419). However, for CVE-2026-34336 list of CWE is not accurate cause heap-based overflow is possible due to integer overflow and integer overflow

by @immortalp0ny · backlist 2026-05-25 · rubric 88.0

68.

"use workflow" handles this (t.co)

"use workflow" handles this http:// workflow-sdk.dev

by @ErfanEbrahimnia (Erfan) · backlist 2026-05-25 · rubric 88.0

69.

Reproduced HRM-Text XL (1B). (t.co)

Reproduced HRM-Text XL (1B). Training completed in ~38 hours wall-clock on 16 H200 GPUs, and evaluation performance matches the numbers reported in the paper. Great job, team! W&B report: https:// api.wandb.ai/links/MDEQGA/7 0ciyctr …

by @huskydogewoof (Benhao Huang) · backlist 2026-05-25 · rubric 88.0

70.

Awesome job (x.com)

Awesome job @jesse_merhi sharing, writing and cranking... while helping secure us all > How I Accidentally Ended Up Helping Secure OpenClaw https:// jmerhi.mov/blog/dangerous -crustacean/ … and https:// openclaw.ai/blog/where-ope n

by @mcannonbrookes (Mike Cannon-Brookes ) · backlist 2026-05-25 · rubric 88.0

71.

this weekend's obsession, the amazing `cmux` by (x.com)

this weekend's obsession, the amazing `cmux` by @lawrencecchen It's the perfect agentic engineering Integrated Development Environment! My set up: - Each Workspace in the left navbar is a project folder - Each workspace is split into at

by @sojoodi · backlist 2026-05-25 · rubric 88.0

72.

I am working on Online RL.

I am working on Online RL. MolmoACT2 deploys out of the box but fails at many tasks due to its data distribution. One really interesting insight would be to have a setup working where you can deploy this good enough base policy and make it

by @vruga_ (vrushtee) · backlist 2026-05-25 · rubric 88.0

73.

introducing pipi, the shitty robot. (t.co)

introducing pipi, the shitty robot. brain lives on my laptop, sensors/UI live on the mounted phone. time to completion: 24h (minus sleep, knight festival, lunch, dinner, and play) built with http:// pi.dev

by @badlogicgames (Mario Zechner) · backlist 2026-05-25 · rubric 88.0

74.

1/ Weak LLMs generate correct solutions in their latent space all the time—they just fail to select them.

1/ Weak LLMs generate correct solutions in their latent space all the time—they just fail to select them. A new paper proves that wrapping a nano-sized model in a structured critic-comparator harness matches frontier giants on SWE-bench.

by @che_shr_cat (Grigory Sapunov) · backlist 2026-05-25 · rubric 88.0

75.

The architecture under it is genius

The architecture under it is genius Works with every cli, OS, one or multiple "agents" seamlessly which was also the point and how I like to build things The more I'm trying the new version (released 30m ago) the more fun it is I unified

by @wavefnx · backlist 2026-05-25 · rubric 87.0

76.

why do LLMs generate long duplicate strings of text so slowly?

why do LLMs generate long duplicate strings of text so slowly? consider an LLM outputting 100x's, if you ask it to do that 10 times it's linear amount of time to general, when instead the LLM could just reference that symbol 10 times

by @RhysSullivan (Rhys) · backlist 2026-05-25 · rubric 87.0

77.

Really amazing results analyzing what's creative/novel vs. what's copied from Internet data, enabled by the amazing (x.com)

Really amazing results analyzing what's creative/novel vs. what's copied from Internet data, enabled by the amazing @liujc1998 's Infini-gram! http:// infini-gram.io This is also enabled in @allen_ai 's OlmoTrace http:// allenai.org/b

by @sewon__min (Sewon Min) · backlist 2026-05-25 · rubric 86.0

78.

GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction

GenRecon: Bridging Generative Priors for Multi-View 3D Scene Reconstruction Reconstructing high-fidelity 3D scenes from sparse RGB input is hard. It needs a strong 3D prior! We reformulate multi-view scene reconstruction as conditional 3D

by @MattNiessner (Matthias Niessner) · backlist 2026-05-25 · rubric 86.0

79.

EVE-Agent argues self-evolving search agents should not train on examples they cannot justify. Data-free self-evo…

EVE-Agent argues self-evolving search agents should not train on examples they cannot justify. Data-free self-evolving search agents generate their own questions, answer them, and improve from their own feedback. That scales beautifully wit

by @sheriyuo (Xiuyu Li) · backlist 2026-05-25 · rubric 86.0

80.

Agents work well with static types, schemas, state machines, etc. Keep deterministic what should be deterministic.

Agents work well with static types, schemas, state machines, etc. Keep deterministic what should be deterministic. Stop trying to make everything non-deterministic.

by @DavidKPiano (David K ) · backlist 2026-05-25 · rubric 86.0

81.

This incident is unrelated to Squid’s core protocol and contracts. All Squid users and integrators are unaffected…

This incident is unrelated to Squid’s core protocol and contracts. All Squid users and integrators are unaffected and no action is needed. A third-party Gnosis Safe module was exploited today across Base and Ethereum, resulting in approxim

by @squidrouter (squid) · backlist 2026-05-25 · rubric 86.0

82.

Forged parts beat cast parts on strength for one reason: grain flow.

Forged parts beat cast parts on strength for one reason: grain flow. Casting solidifies from liquid → grain is random. Forging deforms solid metal → grain follows the shape. In a crankshaft, the grain runs exactly where the stress is. T

by @maahirpanchal (Maahir Panchal) · backlist 2026-05-25 · rubric 86.0

83.

New project: a coding and formal verification agent for computational physics and applied mathematics.

New project: a coding and formal verification agent for computational physics and applied mathematics. Auto-generate type-correct DSL code for equations and numerical schemes, autoformalize correctness properties in Lean/Isabelle/Rocq, then

by @getjonwithit (Jonathan Gorard) · backlist 2026-05-25 · rubric 86.0

84.

This graph from the NLA paper, imo, provides pretty convincing evidence that activation verbalizers surfaces unve…

This graph from the NLA paper, imo, provides pretty convincing evidence that activation verbalizers surfaces unverbalized eval awareness. It is also crazy. Notice how the verbalized eval awareness dot is offset only when it's significantly

by @Tim_Hua_ (Tim Hua ) · backlist 2026-05-25 · rubric 86.0

85.

Building some extremely cool stuff on top of Cloudflare Dynamic Workers + Sandboxes + Artifacts. Release with blo…

Building some extremely cool stuff on top of Cloudflare Dynamic Workers + Sandboxes + Artifacts. Release with blog post soon.

by @Vercantez (Miguel Salinas) · backlist 2026-05-25 · rubric 85.0

86.

We just released code and model! Go check it out! (t.co)

We just released code and model! Go check it out! Code: https:// github.com/nv-dvl/vgg-ttt Model: https:// huggingface.co/nvidia/vgg-ttt

by @s_elflein (Sven Elflein) · backlist 2026-05-25 · rubric 85.0

87.

audio reactivity tests with ASCII fluid sim. going to take a lot of work to tune emitters and parameters so it ac…

audio reactivity tests with ASCII fluid sim. going to take a lot of work to tune emitters and parameters so it actually looks good.

by @codetaur (Codetaur) · backlist 2026-05-25 · rubric 84.0

88.

Agora is about an order of magnitude faster than the system that powered Node-0. 175k tok/s is fast.

by @AlexanderLong (Alexander Long) · backlist 2026-05-25 · rubric 84.0

89.

Today, I'm working on trying to better understand how coding assistants mention HF's products. (x.com)

Today, I'm working on trying to better understand how coding assistants mention HF's products. Taking a simple approach of running tons of queries and analyzing the answers with @DAKlingbeil 's https:// submarine.ai (ex: https:// huggi

by @ClementDelangue (clem ) · backlist 2026-05-25 · rubric 84.0

90.

Crazy how quickly my workflow transitioned from

Crazy how quickly my workflow transitioned from local + cursor + claude code To exe dev + tmux + zed + codex Devtools have no lockin

by @gregpr07 (Gregor Zunic) · backlist 2026-05-25 · rubric 84.0