32.
PP 1841.7 tk/s | TG 101.3 tk/s | Context 735K (t.co)
PP 1841.7 tk/s | TG 101.3 tk/s | Context 735K 2 x #2080Ti 22GB NVlinked run Qwen3.6-27B-AWQ through vLLM TP=2 MTP K=3 KV=tq4nc single request at extraordinary performance! Maximized AI value of the $500 legacy setup. https:// github.com/w