@no_stp_on_snek on Backlist

23.

TurboQuant+ shrinks KV cache memory 4.75x (x.com)

TurboQuant+ shrinks KV cache memory 4.75x with 3-bit quantization across CUDA and Metal while preserving near-fp8 top-5 behavior

by @no_stp_on_snek (Tom Turney) · backlist 2026-05-24 · rubric 88.0