@aryagm01 on Backlist

1 appearance on the backlist front page in the last 30 days.

43.

I ported HRM-Text-1B to Apple MLX On an M4 Max: PyTorch MPS BF16: 22 tok/s HRM-mlx BF16: 28 tok/s HRM-mlx 4-bit: 53 tok/s That’s 2.4x faster single-response decode, with hosted MLX BF16 + 4-bit checkpoints

by (Arya Manjaramkar) · backlist 2026-05-20 · rubric 92.0