31.
Our inference stack, optimized for Blackwells, with a novel attention kernel and many new optimizations has start… (x.com)
Our inference stack, optimized for Blackwells, with a novel attention kernel and many new optimizations has started rolling out! It's already charting on Artificial Analysis, eg: #1 speed and latency for @Kimi_Moonshot Kimi 2.6. #1 on l