42.
I actually spent nearly a whole day implementing this thing from scratch. In the end, under the same throughput a…
I actually spent nearly a whole day implementing this thing from scratch. In the end, under the same throughput and VRAM usage, the precision (measured by PPL) still couldn't beat TurboQuant. What a complete waste of time.