48.
I took one key insight from this convo: inference disaggregation between prefill and decode enable GPU lifespan t…
I took one key insight from this convo: inference disaggregation between prefill and decode enable GPU lifespan to be extended to 10+ years. This totally shifts the risk and return profile of datacenter capex - especially for neoclouds suc