24.
Cross-node KV-cache reuse for agentic rollouts
Cross-node prefix-cache reuse in vLLM via Mooncake Store makes agent rollouts cheaper by sharing reused context across distributed training nodes
1 appearance on the backlist front page in the last 30 days.
Cross-node prefix-cache reuse in vLLM via Mooncake Store makes agent rollouts cheaper by sharing reused context across distributed training nodes