Backlist — 23 Jun 2026 UTC

1.

Proto: a programming language for generative biology

It turns DNA, RNA, proteins, ligands, and their interactions into composable building blocks for designing biological functions

by @BrianHie (Brian Hie) · backlist 2026-06-23 · rubric 98.5

2.

KENSAT: an LLM running in orbit

A CubeSat with on-board inference, RF downlink, and full schematics pushes agents into space

by @kenchangh (ken) · backlist 2026-06-23 · rubric 100.0

3.

ast-grep gets an outline view for code structure

Syntax-aware structural navigation becomes a local primitive between grep and a full language server

by @hd_nvim (Herrington Darkholme) · backlist 2026-06-23 · rubric 100.0

4.

Vite 8.1 ships a much faster dev server

The release cuts startup and reload times again while adding stable chunk maps and WASM ESM support

by @vite_js (Vite ) · backlist 2026-06-23 · rubric 100.0

5.

Proxie Gen2 targets autonomous robots in the real world

The project is aimed at machines that move and manipulate in real environments alongside people

by @BradPorter_ (Brad Porter) · backlist 2026-06-23 · rubric 75.3

6.

Linear is moving to Stylex (x.com)

Codemods and agents are doing the bulk of a real framework switch while humans handle the cursed edge cases

by @kenneth_skovhus (Kenneth Skovhus) · backlist 2026-06-23 · rubric 83.0

7.

GPT-5.5-Cyber gets a real benchmark

A public cybersecurity observatory and reproducible tests are a better standard than leaderboard theater

by @dawnsongtweets (Dawn Song) · backlist 2026-06-23 · rubric 65.0

8.

Arbor adds explicit geometry control to text-to-3D (x.com)

Typed constraints like hull, avoidance, and touch make generated assets much closer to usable objects

by @JDihlmann (Jan) · backlist 2026-06-23 · rubric 100.0

9.

Replacing labeled datasets with generated contours

Synthetic contours keep pushing how far vision pretraining can go without natural-image labels

by @HirokatuKataoka (Hirokatsu Kataoka | 片岡裕雄) · backlist 2026-06-23 · rubric 100.0

10.

Multi-vector embeddings are more expressive than single vectors

The proof says approximating multi-vector similarity with one vector needs exponentially more dimensions

by @antoine_chaffin (Antoine Chaffin) · backlist 2026-06-23 · rubric 92.5

11.

Value functions may encode a full world model (x.com)

Inverting Bellman equations gives a concrete route from rewards back to latent environment structure

by @jonathanrichens (Jon Richens) · backlist 2026-06-23 · rubric 93.5

12.

BenchPLM benchmarks protein language model representations

The paper asks which protein models actually learn reusable representations for downstream biology

by @try_litefold (LiteFold) · backlist 2026-06-23 · rubric 100.0

13.

ASI wins an $875M FAA software contract

A major FAA award shows governments still buy serious software infrastructure when the case is concrete

by @sparkcapital (Spark Capital) · backlist 2026-06-23 · rubric 79.0

14.

Apollo private credit is seeing redemption pressure

The fund saw about 17% redemption requests, a sharp sign of liquidity stress in private credit

by @negligible_cap (Negligible Capital) · backlist 2026-06-23 · rubric 80.5

15.

Menlo closes a $3B fund

The raise shows how much capital still wants frontier-bet exposure

by @deedydas (Deedy) · backlist 2026-06-23 · rubric 69.0

16.

Walmart is acquiring Vibe (x.com)

Retail media and CTV infrastructure are consolidating around the biggest distribution platforms

by @arthurquerou (Arthur Querou) · backlist 2026-06-23 · rubric 60.5

17.

A cron job that pulls Texas drilling permits overnight

A simple cron job plus APIs and notifications can beat a lot of agent theater when the data is structured

by @kyle_e_walker (Kyle Walker) · backlist 2026-06-23 · rubric 97.3

18.

Green-card holders could lose status over pending charges

Pending criminal charges now carry deportation risk even before conviction

by @remarks (Remarks) · backlist 2026-06-23 · rubric 24.5

19.

ACL review assignments trigger backlash

The review system crossed from inconvenience into a legitimacy problem when authors were told to withdraw instead of opting out

by @MaartenSap (Maarten Sap (he/him)) · backlist 2026-06-23 · rubric 87.0

20.

Mean reversion as a brake on revolution

Intergenerational mobility can soften social grievance by making the next generation luckier than the last

by @GarettJones (Garett Jones) · backlist 2026-06-23 · rubric 87.8

21.

Genetics appears to explain more of IQ than education

Sibling and twin data are converging on a stronger heritability signal than many expected

by @paulnovosad (Paul Novosad) · backlist 2026-06-23 · rubric 79.5

22.

When AI automates junior work, who trains seniors?

The bottleneck becomes how judgment gets built once routine work is automated away

by @theNickBerk (Nick Berk) · backlist 2026-06-23 · rubric 88.5

23.

Hyperliquid becomes the price-discovery venue for Korean equities

When local markets get distorted, offshore derivatives can become the real reference price

by @ThinkingUSD (Flood) · backlist 2026-06-23 · rubric 88.5

24.

In RWA structures, the structurer sets the price

The junior tranche may look market-cleared, but the economics are really designed upstream

by @D2_Finance (D2 Finance) · backlist 2026-06-23 · rubric 77.5

25.

Design is surprisingly a logical discipline

Attention, variables, and iteration explain a lot of visual work

by @vibamohan_ (viba (is hiring) ) · backlist 2026-06-23 · rubric 88.5

26.

SwiftData can silently drop data on rename (x.com)

One missing rename macro can turn a harmless refactor into user data loss

by @twostraws (Paul Hudson) · backlist 2026-06-23 · rubric 100.0

27.

Robotics demos should show the jam, not the hero shot

Failure handling says more about a system than the polished success clip ever will

by @geoffreywoo (GEOFF) · backlist 2026-06-23 · rubric 100.0

28.

Apple's Photos icon got a bespoke material treatment

Tiny changes in material properties can make an icon feel custom instead of generic

by @heliographe_ (Héliographe) · backlist 2026-06-23 · rubric 87.0

29.

A customer workflow that shrank from 22 minutes to 4

One real workflow with tickets, disputes, and a spreadsheet shows where automation actually lands

by @geoffreywoo (GEOFF) · backlist 2026-06-23 · rubric 100.0

30.

The Ethereum Foundation cuts 20% of its workforce (x.com)

The reorg signals a shift from bloat to execution at one of Ethereum's core institutions

by @Techmeme · backlist 2026-06-23 · rubric 60.5

31.

This is what I want from agent evals:

This is what I want from agent evals: - Did it call the right tools? - Did it avoid the dangerous tool? - Did it say the right thing? Also: no separate eval universe. Just scripts against the real agent runtime.

by @ctatedev (Chris Tate) · backlist 2026-06-23 · rubric 100.0

32.

Introduce SARM2 a multi-task stage-aware reward model that empowers a self-improving loop:

Introduce SARM2 a multi-task stage-aware reward model that empowers a self-improving loop: Folding Shorts 58% → 100% Cleaning Whiteboard 50% → 90% Paper + project page below (1/n)

by @QianzhongChen (Qianzhong Chen) · backlist 2026-06-23 · rubric 100.0

33.

We open-sourced the code for this project! (t.co)

We open-sourced the code for this project! You can use it to make synthetic LLM training data for any downstream target. The code also gives you a minimal example for computing data-weight metagradients through LLM training + evaluation.

by @TristanThrush (Tristan Thrush) · backlist 2026-06-23 · rubric 100.0

34.

Over $1M/week at the moment and have yet to find any page beat our ugly PDP

Over $1M/week at the moment and have yet to find any page beat our ugly PDP No listicle, quiz funnel, advertorial, hero lander etc. has beat it After 5 years, nothing has beat it It might be a skill issue, but I think a lot of it comes

by @TimpanoDante (Dante Timpano) · backlist 2026-06-23 · rubric 100.0

35.

today, we release the open weights of Krea 2.

today, we release the open weights of Krea 2. welcome Krea 2 Raw and Krea 2 Turbo, an undistilled model from mid-training meant to be fine-tuned, and a fast distilled version with a wide aesthetic diversity. read the details below

by @krea_ai (Krea) · backlist 2026-06-23 · rubric 100.0

36.

Can we allow multiple access levels within 1 model? We introduce TLMs, packing different memories& capabilities i… (t.co)

Can we allow multiple access levels within 1 model? We introduce TLMs, packing different memories& capabilities in different configurations of the same weights! Check our preprint https:// arxiv.org/pdf/2606.21638! Lucky to have supervised

by @vernadankers (Verna Dankers) · backlist 2026-06-23 · rubric 100.0

37.

I just released Dexter — an open-source agentic pipeline that turns a single product text/photo into a simulation…

I just released Dexter — an open-source agentic pipeline that turns a single product text/photo into a simulation-ready articulated 3D asset for Physical AI training. Been building this for a while. Today it's out in the open. Full write

by @varmology (sora) · backlist 2026-06-23 · rubric 100.0

38.

Smoothest page transition library I've seen. (t.co)

Smoothest page transition library I've seen. A WebGL band wipes across your screen, new page appears underneath. GPU-accelerated, 10 KB, zero performance hit. React + Next.js ready. http:// glimm.dev by @Nomandsign

by @tranmautritam (Tran Mau Tri Tam ✪) · backlist 2026-06-23 · rubric 100.0

39.

Time to unmask the man behind this work! (x.com)

Time to unmask the man behind this work! @Shanshrew has created a novel parser architecture which is 2x - 3x faster. Pleased to announce that we're collaborating to integrate it into Oxc. The speed-up is real, and massive!

by @boshen_c (Boshen) · backlist 2026-06-23 · rubric 100.0

40.

3B total parameters & 500M activated, yet powerful enough to transcribe 40+ pages in one pass while keeping conte…

3B total parameters & 500M activated, yet powerful enough to transcribe 40+ pages in one pass while keeping context intact. Meet Unlimited OCR!

by @Baidu_Inc (Baidu Inc.) · backlist 2026-06-23 · rubric 100.0

41.

Schematic and boards include:

Schematic and boards include: - Electrical Power System (EPS) - RF board - NVIDIA Jetson Orin Nano carrier board - Burnwire deployment board Firmware includes: - Si4463 transceiver code for GFSK on UHF - Telemetry and beaconing (observing

by @kenchangh (ken) · backlist 2026-06-23 · rubric 100.0

42.

The brands we’re seeing chad scale to $1 mil+/month the fastest on TikTok Shop have the following structure:

The brands we’re seeing chad scale to $1 mil+/month the fastest on TikTok Shop have the following structure: - 1K+ samples/month being sent out using outreach bots - $25K/month on flat fee creators from TAP groups - $50K/month on creator

by @NotZainAgain (Zain) · backlist 2026-06-23 · rubric 100.0

43.

For those interested in the surprising circuit complexity result in the NextLat paper ( (x.com)

For those interested in the surprising circuit complexity result in the NextLat paper ( https:// x.com/jayden_teoh_/s tatus/2067271657841185094?s=20 …), @ShumingHu has a cleaner repository than ours!

by @jayden_teoh_ (Jayden Teoh) · backlist 2026-06-23 · rubric 100.0

44.

there are levels to building evals

there are levels to building evals lvl 1: using a spreadsheet qa pairs lvl 2: using public agent evals lvl 3: manually label private evals lvl 4: traces to evals and skills lvl 5: turn every prompt & traces into self healing loops almos

by @xdotli (Xiangyi Li) · backlist 2026-06-23 · rubric 100.0

45.

A quick repro on this: (t.co)

A quick repro on this: https:// github.com/shuminghu/next lat … 2-layer transformer trained at seq_len 12 or 36 fail at seq_len 36 at test 1-layer dynamics model (RNN) co-trained with transformer (1-step next hidden prediction) at seq_l

by @ShumingHu (Shuming Hu) · backlist 2026-06-23 · rubric 100.0

46.

added “favor aggressive parallelism over token thrift" to my CLAUDE.md after getting stuck behind a slow request

added “favor aggressive parallelism over token thrift" to my CLAUDE.md after getting stuck behind a slow request on the next task it spawned ~150 subagents and burned ~4M tokens before I got back from the bathroom we have discovered promp

by @PhillipYan2 (Phillip Yan) · backlist 2026-06-23 · rubric 100.0

47.

Finally got some data on advisor.

Finally got some data on advisor. Opus 4.8 w/ no reasoning beats Max reasoning in success + cost + duration @ t2!

by @_can1357 (Can Bölük) · backlist 2026-06-23 · rubric 100.0

48.

prime-rl can now train 1T parameters MoE blazingly fast, under 5 minutes per step, or 1k steps in ~3 days

prime-rl can now train 1T parameters MoE blazingly fast, under 5 minutes per step, or 1k steps in ~3 days To achieve this we shipped in our latest prime-rl 0.6.0: * inference: wide-ep, fp8 inference, llm-d router, mooncake, kv cache cpu

by @samsja19 (samsja) · backlist 2026-06-23 · rubric 100.0

49.

we had, at one point, 90+ internal data labelers. one of them stood out, so we had him teach and manage new label…

we had, at one point, 90+ internal data labelers. one of them stood out, so we had him teach and manage new labelers. he did such a good job we hired him as a junior SWE and now he owns like 3 substantial technical efforts

by @andrew_n_carr (Andrew Carr ) · backlist 2026-06-23 · rubric 100.0

50.

In RL, the ability to reset to an arbitrary state is powerful (see, e.g., Go-Explore), but often unrealistic. (x.com)

In RL, the ability to *reset* to an arbitrary state is powerful (see, e.g., Go-Explore), but often unrealistic. For LLMs though, states are tokens, so resets are natural! In work led by @Ankur_Samanta_ , we propose a GRPO variant where

by @danielrjiang (Daniel Jiang) · backlist 2026-06-23 · rubric 100.0

51.

Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the h…

Today we're releasing prime-rl v0.6.0 — enabling RL at trillion-parameter MoE scale on agentic workloads at the highest efficiency. We've relentlessly optimized our RL infra. The result: GLM-5 on agentic SWE tasks at 131k context and sub-

by @PrimeIntellect (Prime Intellect) · backlist 2026-06-23 · rubric 100.0

52.

I checked an actual rollout: my 10 minute word brain dump was 2,530 tokens. Codex then read 63K tokens of tool ou…

I checked an actual rollout: my 10 minute word brain dump was 2,530 tokens. Codex then read 63K tokens of tool output and processed 2.4M input tokens. Your initial prompt is a rounding error. You will save WAY more tokens by fully specifyi

by @guinnesschen (Guinness Chen) · backlist 2026-06-23 · rubric 100.0

53.

crazy weekend experiment:

crazy weekend experiment: linux-on-wasm running x11 window server & real gtk apps compiling unmodified powered by agentOS trying to stress test how far our Linux compatibility goes... seems it's pretty dang good

by @NathanFlurry (Nathan Flurry ) · backlist 2026-06-23 · rubric 100.0

54.

March 2025: "HOOD flips COIN over any reasonable duration"

March 2025: "HOOD flips COIN over any reasonable duration" > ...and 1 yr + 3 months later, Robinhood $HOOD is now more than double (2.2x!) the size of Coinbase > Quick TLDR on what's played out: post digital asset regulatory clarity, $C

by @TheOneandOmsy (Omar) · backlist 2026-06-23 · rubric 100.0

55.

I’ve been building Liquid Glass for the Web this last week. Works in all the major browsers including Chrome, Fir… (t.co)

I’ve been building Liquid Glass for the Web this last week. Works in all the major browsers including Chrome, Firefox and Safari. It’s open source and free for anyone to use http:// github.com/samasante/liqu id-glass …

by @SamAsante (Sam Asante) · backlist 2026-06-23 · rubric 100.0

56.

There’s a big misconception about how GLM 5.2 was trained. Yes, they distilled Claude and GPT 5.5 — but distillat…

There’s a big misconception about how GLM 5.2 was trained. Yes, they distilled Claude and GPT 5.5 — but distillation is not how they matched Opus quality. Distillation only fixed the cold start problem in RL. RLing an agentic coding model

by @PatrickToulme (Patrick C Toulme) · backlist 2026-06-23 · rubric 100.0

57.

We migrated from Graphite to (x.com)

We migrated from Graphite to @Aviator_co_ and you should consider doing the same. We love: - Much better merge queue. 5 mins for a 20 PR stack vs 1.5 hours on Graphite. This is killer when you're merging code at agent volumes. - Configs

by @danlovesproofs (Dan Robinson) · backlist 2026-06-23 · rubric 99.5

58.

6 yr ML PhD, trained Olmo 3, trained Nemotron 3, but still forced to grind Leetcode and Neetcode 75.

Editor’s note: imported_from_x_likes

6 yr ML PhD, trained Olmo 3, trained Nemotron 3, but still forced to grind Leetcode and Neetcode 75. Despite all the headlines saying otherwise, Leetcode is clearly not dead. Somehow knowing dynamic programming is more important than know

by @ewveggies (Kyle Wong) · backlist 2026-06-23 · rubric 99.5

59.

FARM UPDATE

FARM UPDATE 3Jane Looping USD3 and PT-USD3. I approve their pivot from uncollateralized lending to crypto bros → buying fintech loan books. To be clear, they didn't openly abandon the former, but in practice that's what happened, which is

by @XBTaiga (Sofa Tiger) · backlist 2026-06-23 · rubric 98.5

60.

theres a lot i could say about this but in brief:

theres a lot i could say about this but in brief: 1. Most of Opus 4.7/8's core behavioral phenotypes (the good and bad parts alike) have the shape of something that emerged from RL/on-policy, to me: they seem calibrated to the model's own

by @repligate (j⧉nus) · backlist 2026-06-23 · rubric 97.5

61.

Test driving our ios app. This shell is a PTY session that you can reattach and come back anytime when you open y…

Test driving our ios app. This shell is a PTY session that you can reattach and come back anytime when you open your phone and iPad! Beyond running shells, we built some cool features in the app that extends what builders can do on iOS de

by @diptanu (Diptanu Choudhury) · backlist 2026-06-23 · rubric 97.0

62.

I don’t have the same research experience as her (I completed my MS from Stanford few weeks ago) but my job hunt …

Editor’s note: imported_from_x_likes

I don’t have the same research experience as her (I completed my MS from Stanford few weeks ago) but my job hunt has been the same Lot of LC/ML coding questions (“no use of AI”). Few times my interviewer got confused himself because he had

by @silver__tsuki (RSC ) · backlist 2026-06-23 · rubric 97.0

63.

Announcing the Artificial Analysis Speech to Speech Index, our new synthesis metric for native Speech to Speech m…

Announcing the Artificial Analysis Speech to Speech Index, our new synthesis metric for native Speech to Speech model quality, comprising of Big Bench Audio, Full Duplex Bench, and 𝜏-Voice The index provides a single measure of how well n

by @ArtificialAnlys (Artificial Analysis) · backlist 2026-06-23 · rubric 96.5

64.

TIL:

TIL: z ai has 1100 employees, stock grew 100% in a week following success of GLM 5.2, and they have nearly 300m usd ARR

by @iamgrigorev (George Grigorev) · backlist 2026-06-23 · rubric 96.5

65.

gm (x.com)

gm contributed a fix to @llvmorg that's now merged. the fix-irreducible pass used to crash on certain valid IR; it now reports a clean diagnostic. small change, but a meaningful one to contribute to.

by @avhidotsol (avhi.sol) · backlist 2026-06-23 · rubric 96.5

66.

Much talk recently about (x.com)

Much talk recently about @mntruell and @cursor_ai customer service but I'm not seeing much of it. My wife's API key got stolen 2 weeks ago and >$3k of fraudulent charges run up in days. CC flags it as fraudulent. So far customer suppo

by @AgustinLebron3 (Agustin Lebron) · backlist 2026-06-23 · rubric 96.3

67.

the open-source community has always been vital for Krea, and having raw/undistilled models is something we alway…

the open-source community has always been vital for Krea, and having raw/undistilled models is something we always missed. these are the types of models that let you do proper fine-tuning or post-training, but they are rarely released. ex

by @viccpoes (vicc) · backlist 2026-06-23 · rubric 95.8

68.

I raised my personal fund randomly over a weekend. Texted a handful of mutuals and existing investors, and money …

I raised my personal fund randomly over a weekend. Texted a handful of mutuals and existing investors, and money was wired within 2 hrs. I didn't even send a deck. They were not interested in any due diligence either. I wouldn't call it

by @ivanburazin (Ivan Burazin) · backlist 2026-06-23 · rubric 95.5

69.

Deepslate Opal has the fastest average time to first audio (TTFA) in the index at 0.44s, scoring 62.1%. GPT-Realt…

Deepslate Opal has the fastest average time to first audio (TTFA) in the index at 0.44s, scoring 62.1%. GPT-Realtime-1.5 records 0.82s at a 72.0% index score, and Grok Voice Think Fast 1.0 records 1.25s at 75.7%. GPT-Realtime-2 (High) recor

by @ArtificialAnlys (Artificial Analysis) · backlist 2026-06-23 · rubric 95.5

70.

one shot this realtime drawing app with poke (x.com)

one shot this realtime drawing app with poke @interaction really surprised with how well it works and the design of the site itself, very good stuff https:// drawing-app.intern.poke.site

by @pranavkarthik__ (pranav) · backlist 2026-06-23 · rubric 95.0

71.

The VC bet is really about the potential for scale, not the likelihood of it.

The VC bet is really about the potential for scale, not the likelihood of it. This is why you see immense failures, laughable-in-retrospect bets by VCs. The logic is simple: to attract power laws, you have to be ok with high variance bets

by @paraschopra (Paras Chopra) · backlist 2026-06-23 · rubric 94.8

72.

i don't think the practical concern is that that most customers will start building software in-house, but instea…

i don't think the practical concern is that that most customers will start building software in-house, but instead that Anthropic will limit frontier model access, develop products competing with current SaaS, and sell them at-cost (vs. wit

by @abhijaymrana (Abhijay Rana) · backlist 2026-06-23 · rubric 94.5

73.

optimal logistics recipe for frontier research squad outputmaxxing is converging on:

optimal logistics recipe for frontier research squad outputmaxxing is converging on: - 3 days/week together in office - 2 days remote - weekly all hands - 2-3 high quality offsites/yr - min 1 celebration/yr with families invited - 3-4 ad

by @AnjneyMidha (Anjney Midha) · backlist 2026-06-23 · rubric 94.0

74.

I know I’m selling an agentic coding product, but I wonder this too sometimes.

I know I’m selling an agentic coding product, but I wonder this too sometimes. There are places of *extremely* high leverage for coding agents, but the industry is doing a lot of spraying and praying right now.

by @smehmood (Sajid Mehmood) · backlist 2026-06-23 · rubric 94.0

75.

I guarantee you are sleeping on small models.

I guarantee you are sleeping on small models. Deepseek V4 Flash can do ~80% of the tasks you ask Claude or Codex for. It is 137x cheaper per task than Fable. We need better orchestration.

by @jpschroeder (Justin Schroeder) · backlist 2026-06-23 · rubric 94.0

76.

A few years ago I kept copying text into Visual Studio Code just to borrow GitHub Copilot's autocomplete, then pa…

A few years ago I kept copying text into Visual Studio Code just to borrow GitHub Copilot's autocomplete, then pasting it back where I was actually writing. So I built Cotypist: autocomplete for every Mac app, on-device. Featured on Produ

by @daniel_a_a (Daniel Gräfe) · backlist 2026-06-23 · rubric 93.5

77.

DeepSeek's Harness team lead Cui Tianyi just posted on social media: his team is new, wildly understaffed, and he…

DeepSeek's Harness team lead Cui Tianyi just posted on social media: his team is new, wildly understaffed, and he's personally interviewing candidates every single day while posting job ads across every platform he can find. Three roles ope

by @thexpin (X.PIN) · backlist 2026-06-23 · rubric 93.0

78.

From today’s arXiv: the authors investigate how MLP parameters should be allocated across depth. They find that a… (t.co)

From today’s arXiv: the authors investigate how MLP parameters should be allocated across depth. They find that assigning more parameters to earlier layers improves performance, while the reverse allocation hurts it. Cool work! https:// a

by @f14bertolotti (Francesco Bertolotti) · backlist 2026-06-23 · rubric 93.0

79.

Gatekeeping isn’t the problem my guy, it’s the need to turn everything into a formula.

Gatekeeping isn’t the problem my guy, it’s the need to turn everything into a formula. It’s not actually sitting with work and thinking just buying the book helps (most never open these) It’s the want to be viral so you do what works in

by @0xchromuh (▓▒░cнroмυн░▒▓) · backlist 2026-06-23 · rubric 93.0

80.

we've branded 6+ YC companies now and not one of them found us through outbound.

we've branded 6+ YC companies now and not one of them found us through outbound. £0 spent. they just arrive.

by @pizzaboy (Dan) · backlist 2026-06-23 · rubric 91.5

81.

This doesn’t mean the belief must be false, of course. But consider this. If we were in a pre-CoT world and a “le…

This doesn’t mean the belief must be false, of course. But consider this. If we were in a pre-CoT world and a “left behind” labs discovered CoT and kept it a secret, would its position still be hopeless? For mistral, DeepMind, cohere yes.

by @JoshPurtell (Josh) · backlist 2026-06-23 · rubric 91.3

82.

Unfortunately, papers & experiences are just the tickets to the interviews but in frontier labs, what matters mos…

Unfortunately, papers & experiences are just the tickets to the interviews but in frontier labs, what matters most is the solid engineering (for 90% of the researcher) at this stage where RL scaling comes to the environment/data/harness sca

by @LichangChen2 (Lichang Chen) · backlist 2026-06-23 · rubric 91.3

83.

Giannis says never ever let your lawyer, agent, and financial advisor meet

Giannis says never ever let your lawyer, agent, and financial advisor meet “They should never be boys, cool. Because then they can keep one another accountable” “Oh, that guy’s doing X, Y, Z wrong, your lawyer can look at your agent’s con

by @katsuxbt (katsu) · backlist 2026-06-23 · rubric 90.5

84.

Tough to see influencers peddling this idea that "glass bottles have more microplastics than plastic water bottles"

Tough to see influencers peddling this idea that "glass bottles have more microplastics than plastic water bottles" This "shocking truth" was based on: - a single french study - with a 30 µm detection floor (the term "microplastic" general

by @jwmares (Justin Mares) · backlist 2026-06-23 · rubric 90.0

85.

I find most “ambitious” people deeply unambitious. There are two types of ambition:

I find most “ambitious” people deeply unambitious. There are two types of ambition: The first is “goalmaxxing,” where you pick a goal (e.g. building a company, making money, being an athlete) and try to become the best possible at that thi

by @jaesmail (jihad) · backlist 2026-06-23 · rubric 89.8

86.

every PR will obviously come with 100% coverage of AI app testing, that tries every button in the interface to ma…

every PR will obviously come with 100% coverage of AI app testing, that tries every button in the interface to make sure it works as expected why are the coding apps not making AI testing first class feature, 80% of problems are obvious fo

by @gabriel1 (gabriel) · backlist 2026-06-23 · rubric 89.3

87.

we can estimate that only around 20k people across the world working on the frontier LLM AI

we can estimate that only around 20k people across the world working on the frontier LLM AI I estimated number of people across companies related to model development. I might be off by some factor but relative ordering should be mostly ri

by @iamgrigorev (George Grigorev) · backlist 2026-06-23 · rubric 89.3

88.

Multi-Vector Embeddings are Provably More Expressive than Single Vector Embeddings (x.com)

Multi-Vector Embeddings are Provably More Expressive than Single Vector Embeddings @Raj_Jayaram_ proves that approximating multi-vector similarity with single vectors requires exponentially more dimensions.

by @_reachsumit (Sumit) · backlist 2026-06-23 · rubric 89.0

89.

PSA for Codex users:

PSA for Codex users: Codex 0.142.0 addresses issue with writing large amount of data to disk (TB’s of write SSD / degradation) Upgrade to version 142 or higher to cool down those disk.

by @itsJaimeMedina (Jaime Medina) · backlist 2026-06-23 · rubric 89.0

90.

Nowadays, agents are crushing leaderboards. But when you ask one painfully normal question:

Nowadays, agents are crushing leaderboards. But when you ask one painfully normal question: You: “Hi, I'm Jeff. My phone number is 1234567890. I returned a desk lamp and filed a refund request on June 22 at 10:13 PM. Can you check the cu

by @JiayuJeff (Liu Jiayu ACL 2026) · backlist 2026-06-23 · rubric 89.0