Backlist — 17 Jun 2026 UTC

1.

Mastra-AI npm ecosystem hit by supply-chain attack

Microsoft identified 80-plus compromised npm packages in the Mastra-AI ecosystem after an account takeover introduced a phantom dependency

by @calcsam (Sam Bhagwat) · backlist 2026-06-17 · rubric 62.0

2.

Getting macOS cursor-to-display latency near zero

A 120 Hz slow-motion capture shows how much work is required to drive perceived input latency down to almost zero on macOS

by @rsms (Rasmus Andersson) · backlist 2026-06-17 · rubric 81.0

3.

A 60-year-old rectangle-piercing conjecture has been disproved

A conjecture from 1965 about how many points are needed to pierce families of axis-parallel rectangles in the plane has finally been refuted

by @publishiperishi (Rishikesh Gajjala) · backlist 2026-06-17 · rubric 72.0

4.

IIT Bombay undergrads are building a semiconductor fab from scratch

Three undergraduates spent ten months building a hacker fab and are reportedly approaching their first complete NMOS transistor on tools they made themselves

by @SwarajyaMag (Swarajya) · backlist 2026-06-17 · rubric 64.0

5.

New robot tasks without retraining: just retrieve demos

A retrieval-conditioned VLA policy can be frozen once and extended to new tasks at test time by adding cheap human-hand demonstrations to a retrieval pool

by @oodgnas (Sangdoo Yun) · backlist 2026-06-17 · rubric 88.0

6.

GameCraft-Bench: can coding agents build playable Godot games?

A new benchmark asks coding agents to ship complete playable Godot projects across 140 tasks, and the best current agent solves only 41.5%

by @ZiniuLi (Ziniu Li) · backlist 2026-06-17 · rubric 90.0

7.

GLM-5.2 becomes the leading open-weights model on Artificial Analysis (x.com)

Z.ai’s GLM-5.2 reached the top open-weights score on the Artificial Analysis Intelligence Index while sitting on the cost-performance Pareto frontier

by @ArtificialAnlys (Artificial Analysis) · backlist 2026-06-17 · rubric 56.0

8.

An O*NET for AI R&D automation

Epoch proposes a 60-plus-task taxonomy of frontier AI research work and rates each task from 0 to 5 by current automability

by @EpochAIResearch (Epoch AI) · backlist 2026-06-17 · rubric 76.0

9.

Automation and Repression (t.co)

Daron Acemoglu’s new working paper models how pervasive automation combined with redistribution can reshape political incentives and repression

by @DAcemogluMIT (Daron Acemoglu) · backlist 2026-06-17 · rubric 20.0

10.

Safe Rust kernel programming on GPUs (x.com)

A Rust abstraction on Tile IR claims effectively free safety for GPU kernels, with a safe GEMM competitive with hand-tuned CUDA on B200 hardware

by @roeschinc (Jared Roesch) · backlist 2026-06-17 · rubric 34.0

11.

Latent Context Language Models make long-context inference up to 8.8× faster

A 0.6B encoder compresses long context into latent vectors for a 4B decoder, reducing long-context cost while preserving accuracy

by @artemg314 (Artem Gazizov) · backlist 2026-06-17 · rubric 72.0

12.

ABC: open data, training, and infrastructure for robotics (x.com)

ABC releases what it calls the largest teleoperation dataset to date along with open training and infrastructure for robot policies

by @ritvik_singh9 (Ritvik Singh) · backlist 2026-06-17 · rubric 86.0

13.

GPT-5.4 improves a medicinal chemistry reaction (t.co)

GPT-5.4 helped move a medicinal chemistry project from literature review to a validated experimental improvement in a widely used drug-discovery reaction

by @OpenAI · backlist 2026-06-17 · rubric 34.0

14.

Scientists still need FP64

The FP8-for-everything narrative breaks down in scientific computing, where many workloads still require FP64 precision or careful compensated methods

by @Underfox3 (Underfox) · backlist 2026-06-17 · rubric 68.0

15.

Glass cores for advanced semiconductor packages

As interposer sizes grow, TSMC and Intel appear to be converging on glass cores for advanced packages, with OSATs positioned to benefit either way

by @vikramskr (Vikram Sekar) · backlist 2026-06-17 · rubric 66.0

16.

CSS field-sizing is now baseline

Form fields can now size themselves to their contents with CSS field-sizing, removing a common JavaScript workaround

by @FirefoxWebDevs (Firefox for Web Developers) · backlist 2026-06-17 · rubric 42.0

17.

Telegram routing and the BGP hijack allegation

Reports of Reliance Communications announcing Telegram IP prefixes through FLAG Telecom raised a live BGP hijacking concern affecting traffic beyond India

by @basedjensen (Hensen Juang) · backlist 2026-06-17 · rubric 58.0

18.

China did not cap global oil prices (t.co)

Robin Brooks argues China’s oil imports fell because Hormuz was closed and Iran was blockaded, not because Beijing was intentionally stabilizing prices

by @robin_j_brooks (Robin Brooks) · backlist 2026-06-17 · rubric 61.0

19.

US naturalization processing slows to 9.5 months (t.co)

New USCIS data show average naturalization processing time reaching 9.5 months, with more than 400,000 cases pending over six months

by @J_Gelatt (@juliagelatt.bsky.social) · backlist 2026-06-17 · rubric 72.0

20.

Illinois taxes crypto wallet transfers at 0.2%

A 0.2% Illinois crypto transaction tax applies even to transfers between personal wallets, creating potentially massive costs for large custody moves

by @jbrukh (Jake Brukhman) · backlist 2026-06-17 · rubric 66.0

21.

Snowflake vs. Databricks: GAAP revenue versus run-rate ARR

Annualizing Snowflake’s latest quarterly GAAP revenue gives a $5.56B run rate, narrowing the apparent gap with Databricks’ non-GAAP ARR figure

by @credistick (Dan Gray) · backlist 2026-06-17 · rubric 72.0

22.

Autoresearch agents that replicate arXiv papers

alphaXiv is deploying agents to set up arXiv codebases, resolve environment issues, reproduce core claims, and rank papers by implementation difficulty

by @askalphaxiv (alphaXiv) · backlist 2026-06-17 · rubric 72.0

23.

The case for building your own personal software stack

Replacing Notion, Roam, and Airtable with a custom app shows how AI-assisted building can make personal software stacks viable for power users

by @JaredSleeper (Jared Sleeper) · backlist 2026-06-17 · rubric 62.0

24.

SaaS vs. DaaS: where tacit expertise gets monetized

When tacit expertise trains frontier models instead of becoming bespoke software, founders trade durable end-customer revenue for easier distribution through labs

by @catboosted (altra) · backlist 2026-06-17 · rubric 74.0

25.

Why Commerce’s Anthropic export letter may be legally flawed

The critique argues that Commerce’s letter to Anthropic stretches export-control rules because access to a hosted model may not constitute an export of an item

by @alasdairpr (Alasdair Phillips-Robins) · backlist 2026-06-17 · rubric 74.0

26.

Commerce reportedly delayed Entity List additions for DeepSeek and CXMT

The Department of Commerce reportedly held back on adding DeepSeek, CXMT, and more than 100 Chinese companies to the Entity List to avoid escalating tensions with China

by @AndrewCurran_ (Andrew Curran) · backlist 2026-06-17 · rubric 72.0

27.

Why Giordano Bruno’s final trial was different

Renaissance justice often ran on patronage, which helps explain why Bruno survived earlier trials before the Inquisition finally executed him in 1600

by @dwarkesh_sp (Dwarkesh Patel) · backlist 2026-06-17 · rubric 0.0

28.

date MATH: a dating sim for mathematical concepts (t.co)

A free visual novel lets players romance more than 25 mathematical concepts, complete with four endings, a genocide route, and secret characters

by @Akuicia (Aku | 少骨) · backlist 2026-06-17 · rubric 74.0

29.

Most vegan leather is polyurethane

Many products marketed as vegan leather are just polyurethane plastic, while actual plant- or fungus-based alternatives remain expensive and scarce

by @amypretzel (amy) · backlist 2026-06-17 · rubric 72.0

30.

A Great White Egret in flight at RSPB Ham Wall in Somerset recently.

by @CarlBovisNature (Carl Bovis) · backlist 2026-06-17 · rubric 89.0

31.

Charlton forgot to mention the 80% discount and MFN

by @ItzSuds (sudarshan) · backlist 2026-06-17 · rubric 88.0

32.

Very hawkish dot plot. Nine out of 18 officials have at least one hike this year (and six of those 9 have *multiple hikes*). Only one person has a cut this year, and one participant (presumably Warsh) didn't submit an SEP The statement

by @NickTimiraos (Nick Timiraos) · backlist 2026-06-17 · rubric 88.0

33.

zhipu和deepseek在25年春都曾经是jina reader的数一数二的大客户，也都是由我直接founder support。二者给我留下的印象就是非常精，对技术指标要求非常苛刻，动不动就p99

zhipu和deepseek在25年春都曾经是jina reader的数一数二的大客户，也都是由我直接founder support。二者给我留下的印象就是非常精，对技术指标要求非常苛刻，动不动就p99

by @hxiao (Han Xiao) · backlist 2026-06-17 · rubric 88.0

34.

not surprising. to my knowledge there's a single person in the US government with experience working on frontier …

not surprising. to my knowledge there's a single person in the US government with experience working on frontier AI models at a company.

by @ohlennart (Lennart Heim) · backlist 2026-06-17 · rubric 86.0

35.

. (x.com)

. @ItzSuds won’t stop — he’s sourcing founders earlier and earlier. Locked in allocation with Aurelio to an Uncapped SAFE.

by @CharltonJBoyd (Charlton J. Boyd) · backlist 2026-06-17 · rubric 86.0

36.

nvfp4 vs mxfp4 is not just different choices of block size and scale format, nvfp4 uses an additional tensor-wise…

nvfp4 vs mxfp4 is not just different choices of block size and scale format, nvfp4 uses an additional tensor-wise scale factor to overcome the range limit of fp4, and thus can use more precisions for block-wise scale factors.

by @zcbenz (Cheng) · backlist 2026-06-17 · rubric 86.0

37.

New nugget in our latest story on the Anthropic Fable saga:

New nugget in our latest story on the Anthropic Fable saga: Dario Amodei told Howard Lutnick "This means we can't have the model out" Friday after learning of the ban on foreign use. "That's the point," the Commerce Secretary said.

by @AmrithRamkumar (Amrith Ramkumar) · backlist 2026-06-17 · rubric 86.0

38.

We're launching turbo mode data extraction - 5x faster, 5x cheaper, and 7% more accurate than Azure Content Under…

We're launching turbo mode data extraction - 5x faster, 5x cheaper, and 7% more accurate than Azure Content Understanding. 4.5s p50/7s p90 across 1-30 page docs - good enough for realtime user flows.

by @VikParuchuri (Vik Paruchuri) · backlist 2026-06-17 · rubric 84.0

39.

Are AI agents shape rotators? In this new benchmark, we let the models play campaign puzzles in Opus Magnum, a pu… (x.com)

Are AI agents shape rotators? In this new benchmark, we let the models play campaign puzzles in Opus Magnum, a puzzle game by @zachtronics . Ironically, Claude Opus 4.8 performed poorly, being beaten by GPT-5.5, Gemini 3.5 Flash, and GLM

by @RobertHaisfield (Rob Haisfield) · backlist 2026-06-17 · rubric 84.0

40.

RQL is a new, clean algorithm for (offline) flow RL!

RQL is a new, clean algorithm for (offline) flow RL! The main idea is to treat flow steps as MDP steps, and use "reversed" flows to generate hindsight flow trajectories for off-policy data.

by @seohong_park (Seohong Park) · backlist 2026-06-17 · rubric 84.0

41.

New work: The Value Axis

New work: The Value Axis How do LLMs choose which path to take mid-task? We find they internally track the chance of reaching their goal along a linear axis, akin to a value function in RL. We show it modulates confidence in math & coding

by @nickhjiang (Nick Jiang) · backlist 2026-06-17 · rubric 84.0

42.

The tightly overlapping beaver twins

The tightly overlapping beaver twins #Hanamura_City_Animal_Park #American_Beaver 2023

by @higashiyama5555 (やまこじ) · backlist 2026-06-17 · rubric 84.0

43.

Using off-policy (rollouts of another model) prefixes gives the game away - the model would learn to classify off…

Using off-policy (rollouts of another model) prefixes gives the game away - the model would learn to classify off- vs on- policy even better than they do already. You would get higher eval awareness, not lower, even though it would be bette

by @tessera_antra (antra) · backlist 2026-06-17 · rubric 83.0

44.

vc data point:

vc data point: old diligence asked: - who else is in? - how big is tam? - did a famous firm pass? new diligence asks: - what work disappeared? - why now? - what breaks if GPT-6 gets cheaper? status questions age badly when research is fr

by @geoffreywoo (GEOFF) · backlist 2026-06-17 · rubric 82.0

45.

Layering in:

Layering in: 1) The Anthropic/Google data center rental ARR ($26B) 2) And Cursor's end-of-year ARR (potentially over $10B) On an annualized basis, I expect SpaceX's revenue to exceed $60B by year-end.

by @JackKuhr (Jack Kuhr) · backlist 2026-06-17 · rubric 81.0

46.

currently all of the results are getting manually merged in by a single co-ordinator...

currently all of the results are getting manually merged in by a single co-ordinator... it's a huge bottleneck... so i'm adding hierarchical merging locks where any agent can apply changes. then i need to start hosting them on aws. i nee

by @JungleSilicon (Silicon Jungle) · backlist 2026-06-17 · rubric 81.0

47.

Poor theory of mind is one of the main things keeping models from being good software engineers. They can resolve…

Poor theory of mind is one of the main things keeping models from being good software engineers. They can resolve specific, reproducible bugs, but they struggle to anticipate what users want in the first place, which is much of what buildin

by @MechanizeWork (Mechanize) · backlist 2026-06-17 · rubric 78.0

48.

as much as we have fun building evals + environments, at some point, poor grad students (among others) become the…

as much as we have fun building evals + environments, at some point, poor grad students (among others) become the bottleneck to improving AI system capabilities. there's a ton of domains that are technically verifiable (but not in ways that

by @18jeffreyma (Jeff Ma ICML) · backlist 2026-06-17 · rubric 78.0

49.

LoopCoder-v2 is out (t.co)

LoopCoder-v2 is out Loop Transformers reuse the same block for recurrent hidden-state refinement — letting models “think” more without simply stacking more layers. We study how many loops are actually worth it in Parallel Loop Transforme

by @DorothyDDU (Yaxin Du) · backlist 2026-06-17 · rubric 78.0

50.

Yeah, I think this is a fair concern.

Yeah, I think this is a fair concern. One practical issue is cost: a single 24h Codex run already consumes around 100M tokens, so extending this to the full two-week human window across multiple tasks/trials would quickly reach the 10B-tok

by @MangQiuyang (Qiuyang Mang) · backlist 2026-06-17 · rubric 78.0

51.

this part is actually very interesting, for the mtp head at t+2 they don't include the kv of the indexer of the p…

this part is actually very interesting, for the mtp head at t+2 they don't include the kv of the indexer of the predicted value at mtp t+1 for efficiency (indexer sharing) AND found that it leads to better results because it avoids training

by @eliebakouch (elie) · backlist 2026-06-17 · rubric 78.0

52.

LMAO,

LMAO, $sats liability is GONE. GONE! Just in 30 mins ago. TLDR: $sats owe FCC $2.9 Billion. If Auction 113 raises ~$2.921 billion or more, EchoStar owes $0 It’s $3.1 Billion now Project out the rest of the spectrum echostar owns. Ma

by @pakpakchicken (Chicken Genius) · backlist 2026-06-17 · rubric 78.0

53.

Read about it here - (t.co)

Read about it here - https:// datalab.to/blog/turbo-ext raction … . Our latest latency test showed p50 4.67s, p90 7.0s, p99 17.05s. Field accuracy on our internal 225-doc benchmark is 89.5% vs Azure 83.4%. Pricing is $6/1000 pages vs Az

by @VikParuchuri (Vik Paruchuri) · backlist 2026-06-17 · rubric 76.0

54.

I only recently realized that Zhipu is far from the only lab that has moved away from GRPO. Some teams working on… (x.com)

I only recently realized that Zhipu is far from the only lab that has moved away from GRPO. Some teams working on long horizon tasks still rely heavily on PPO or even REINFORCE, and a few have never seriously adopted GRPO at all. It is int

by @sheriyuo (Xiuyu Li) · backlist 2026-06-17 · rubric 76.0

55.

Meltdown, 2023-26 (x.com)

Meltdown, 2023-26 Edition of 16 unique works by @andreasgysin Fully on-chain (ERC-721) JavaScript, WebGL, silent, responsive Zero 10, @ArtBasel

by @nguyenwahed · backlist 2026-06-17 · rubric 76.0

56.

Hm... But often for the wrong reasons. Like the infamous "tell the AI or alien space prob a logical paradox to ma…

Hm... But often for the wrong reasons. Like the infamous "tell the AI or alien space prob a logical paradox to make it explode". When it's closer to buffer overflows.

by @gwern (𝔊𝔴𝔢𝔯𝔫) · backlist 2026-06-17 · rubric 76.0

57.

Dog

Dog Colorful dogs in condiment colors. They have been waiting patiently for summer. Cooling off in the sea, warming up in the sand, then doing it all over again. Favorite food: Nachos

by @kasumioomine (KASUMI OOMINE) · backlist 2026-06-17 · rubric 76.0

58.

We have a portfolio company where I installed a new CEO.

We have a portfolio company where I installed a new CEO. No one said no to him because they all wanted to suck up to the new owner. I was way too hands off. He went on to launch a new vertical and burned a lot of $$$ pre PMF because no

by @RomanEcom (Roman Khan) · backlist 2026-06-17 · rubric 76.0

59.

GLM 5.2 is absolutely convinced that it is actually Claude, from Anthropic. When I tell it that it's GLM 5.2, it …

GLM 5.2 is absolutely convinced that it is actually Claude, from Anthropic. When I tell it that it's GLM 5.2, it refuses to believe me, but is willing to check the local agent config to see what model is running. The realization:

by @peakcooper (Cooper) · backlist 2026-06-17 · rubric 76.0

60.

New (x.com)

New @fulcrum_inc research - Agents are under-elicited: A case study in optimization tasks. We find that simple and general prompt/scaffold interventions can roughly double agent performance by getting agents to use more resources more ef

by @uzpg_ (Uzay) · backlist 2026-06-17 · rubric 76.0

61.

Databricks announced it has crossed $6.9b in annualized recurring revenue, up 80% year over year. Snowflake's lat…

Databricks announced it has crossed $6.9b in annualized recurring revenue, up 80% year over year. Snowflake's latest quarter puts them at roughly $5.3b ARR, up 34%.

by @ttunguz (Tomasz Tunguz) · backlist 2026-06-17 · rubric 74.0

62.

winning position on polymarket usually isn't the smartest analysis

winning position on polymarket usually isn't the smartest analysis it's just being first news breaks -> sharp money moves -> by the time you open the app, the line already priced it in you were late > signal detected > analysis done > o

by @0xbobaaa (0xbobaa) · backlist 2026-06-17 · rubric 74.0

63.

We’re publishing a new daily report comparing GPU compute prices, price changes, and volatilities across models, … (x.com)

We’re publishing a new daily report comparing GPU compute prices, price changes, and volatilities across models, with data from @ComputeDesk , Bloomberg: CIBLKWUS, CIHOPUS H100s, the oldest model with the largest install base, currently s

by @BrettHarrison (Brett Harrison) · backlist 2026-06-17 · rubric 74.0

64.

It’s an internal site for usage stats

by @mweinbach (Max Weinbach) · backlist 2026-06-17 · rubric 74.0

65.

Every time (x.com)

Every time @mlmabc posted a large TWAP, I wondered why anyone would reveal their execution params to the whole market instead of executing privately So we dug into the data Turns out visible execution is not that bad and can even be che

by @Turtle_Lair (Turtle Lair) · backlist 2026-06-17 · rubric 74.0

66.

CMU Advanced NLP Lecture 9: Decoding Algorithms

CMU Advanced NLP Lecture 9: Decoding Algorithms This lecture explains a key aspect of generative LLMs: The model learns a probability distribution, but useful generation still depends on how we decode from that distribution. Greedy deco

by @ickma2311 (Chao Ma) · backlist 2026-06-17 · rubric 74.0

67.

Becoming pretty clear the real AI labor story is less mass layoffs and far more org chart restructuring

Becoming pretty clear the real AI labor story is less mass layoffs and far more org chart restructuring > Good slides from Cloudflare $NET on automating sales support, redeploying the savings into AEs, and driving more growth w/ the same

by @TheOneandOmsy (Omar) · backlist 2026-06-17 · rubric 74.0

68.

the self is a model that is used to alter your automatic tendencies in order to improve your safety. it vanishes …

the self is a model that is used to alter your automatic tendencies in order to improve your safety. it vanishes bit by bit once safety is no longer in question

by @ftlsid · backlist 2026-06-17 · rubric 72.0

69.

Quick UX tip:

Quick UX tip: Crossing out completed todo items makes them harder to read Checkmarks + dimming are usually enough

by @ctatedev (Chris Tate) · backlist 2026-06-17 · rubric 72.0

70.

Economists often study labor markets using the O*NET database, which breaks ~1000 occupations into tasks. But the…

Economists often study labor markets using the O*NET database, which breaks ~1000 occupations into tasks. But these tasks are too coarse-grained to track automation in AI R&D specifically, even in occupations closest to “AI researcher”.

by @EpochAIResearch (Epoch AI) · backlist 2026-06-17 · rubric 72.0

71.

For years now, the actual rate change announced at every FOMC meeting did not matter. By the time the meeting occ…

For years now, the actual rate change announced at every FOMC meeting did not matter. By the time the meeting occurred, the move was priced into the SOFR curve weeks in advance. The only exception to this was in September 2024 when Powell s

by @fejau_inc (fejau) · backlist 2026-06-17 · rubric 72.0

72.

To succeed at this game, agents must reason about shape rotation, concurrency, and optimizing against competing t…

To succeed at this game, agents must reason about shape rotation, concurrency, and optimizing against competing tradeoffs. To match the human world record on all puzzles would be an insane feat. Agents played the game entirely through a py

by @RobertHaisfield (Rob Haisfield) · backlist 2026-06-17 · rubric 72.0

73.

GLM 5.2 is the new open-weight SOTA on the Vals Index, Vibe Code Bench and Terminal Bench!

GLM 5.2 is the new open-weight SOTA on the Vals Index, Vibe Code Bench and Terminal Bench! It is also #5 across all models, and right on the heels of Opus 4.7 - released only two months ago

by @ValsAI (Vals AI) · backlist 2026-06-17 · rubric 72.0

74.

another day, another batch haha (x.com)

another day, another batch haha - @RicursiveAI ( @annadgoldie ) - @AI21Labs ( @AmnonShashua , @origoshen , @yshoham ) - @unconvai ( @mcarbin ) - @inflectionAI ( @mustafasuleyman ) - @hark_labs ( @adcock_brett ) - @simile_ai (

by @aimalysheva (Sasha Malysheva) · backlist 2026-06-17 · rubric 72.0

75.

paid 1c on a kuala lumpur temperature call

paid 1c on a kuala lumpur temperature call $4,092 on that position right now 10 more open just like it. all at 100c $16,961 all-time. 8,421 predictions. closed tab is wall-to-wall green GFS and ECMWF update every 6h. polymarket prices l

by @0xbobaaa (0xbobaa) · backlist 2026-06-17 · rubric 72.0

76.

This is FALSE

This is FALSE 1. The Govt literally pays €30million+ to a private company, Didean Dochas, to buy houses across the midlands for asylum seekers. This company owns the houses, rents them back to the State and routes all profits through Is

by @Nick_Delehanty (Nick Delehanty ) · backlist 2026-06-17 · rubric 72.0

77.

very hard for AI to blow up -- at current market prices on OpenRouter for GLM 5.2 8 200s cost $370k and can churn…

very hard for AI to blow up -- at current market prices on OpenRouter for GLM 5.2 8 200s cost $370k and can churn $1.47m of tokens a year - so 3-4 month payback period and fully tax deductible as equipment

by @goodalexander · backlist 2026-06-17 · rubric 72.0

78.

Bad Apple but I’m drawing it with Strava, frame 1470

by @linguinelabs (Kevin) · backlist 2026-06-17 · rubric 72.0

79.

Having worked on unlearning for multiple years, it was clear that post-training "fixes" alone were a dead-end. Mo…

Having worked on unlearning for multiple years, it was clear that post-training "fixes" alone were a dead-end. Model learning is way too entangled. With 𝗡𝗨𝗟𝗟𝘀 we decided to architect unlearnability into the model, and scaled it to 1B+

by @pratyushmaini (Pratyush Maini) · backlist 2026-06-17 · rubric 72.0

80.

We are taking a big step towards scaling LLMs that can unlearn on demand.

We are taking a big step towards scaling LLMs that can unlearn on demand. Cleanly deleting data from LLMs has proven impossible: training entangles every source in shared weights. NULLs (Natively Unlearnable LLMs) escapes this, keeping mill

by @gaurav_ghosal (Gaurav Ghosal) · backlist 2026-06-17 · rubric 72.0

81.

We just released an open-weights IDM that action-annotates unlabeled screencasts. We outperform all off-the-shelf…

We just released an open-weights IDM that action-annotates unlabeled screencasts. We outperform all off-the-shelf models (both open and closed!), many of them being orders-of-magnitude bigger. (1/3)

by @lemergenz (Franz Srambical (in MONTREAL)) · backlist 2026-06-17 · rubric 72.0

82.

Etherfi is crushing it with 30k daily credit card transactions and $3m daily volumes

Etherfi is crushing it with 30k daily credit card transactions and $3m daily volumes Over $1b annualized spend on their cards And it’s still being priced at a fraction of other private companies and tokens doing the same thing Think of

by @dcfgod (DCF GOD) · backlist 2026-06-17 · rubric 72.0

83.

Great work! The coding benchmarks are really impressive. Parallel loops are especially good for memory-bound deco…

Great work! The coding benchmarks are really impressive. Parallel loops are especially good for memory-bound decoding particularly on edge devices, because the extra compute can often be hidden under memory access.

by @RidgerZhu (Rui-Jie Zhu) · backlist 2026-06-17 · rubric 72.0

84.

The heron taking flight from the Anadolu Hisarı pier

by @onderkayaistan1 (önder kaya istanbul gezgini) · backlist 2026-06-17 · rubric 72.0

85.

POV: You warned your friend to not build a business on top of the scratchy Claude Code endpoint that Anthropic is…

POV: You warned your friend to not build a business on top of the scratchy Claude Code endpoint that Anthropic is going to be dropped for sure

by @tugot17 (Piotr Mazurek) · backlist 2026-06-17 · rubric 72.0

86.

This is so ironic, cause I’m pretty sure they increasingly feel like (at least in CS adjacent fields) joining a f…

This is so ironic, cause I’m pretty sure they increasingly feel like (at least in CS adjacent fields) joining a frontier lab is their only chance to do frontier (pun) research again

by @Laz4rz (Lazarz) · backlist 2026-06-17 · rubric 72.0

87.

Balmain East House by Studio Johnston

Balmain East House by Studio Johnston Sydney, Australia

by @4AAAAart (Architectural Art (A-A)) · backlist 2026-06-17 · rubric 72.0

88.

Sleepy Wren!

by @CarlBovisNature (Carl Bovis) · backlist 2026-06-17 · rubric 72.0

89.

They use fixed-point residual as a halting signal itself unlike previous papers. I think it's close to EqR in spi…

They use fixed-point residual as a halting signal itself unlike previous papers. I think it's close to EqR in spirit of landscape/attractor shaping as it modified training with pre-norm, residual scaling and damping. Other papers focus on t

by @Louis9687221579 (Louis) · backlist 2026-06-17 · rubric 72.0

90.

I’m honestly very excited about Virat Kohli’s new brand.

I’m honestly very excited about Virat Kohli’s new brand. He walked away from a guaranteed ₹300 crore of Puma money and instead threw his lot in with a little-known Indian brand called Agilitas. This is their story. Agilitas was started

by @harnidhish (Harnidh Kaur) · backlist 2026-06-17 · rubric 72.0