@h100envy on Backlist

45.

Ying Sheng co-wrote SGLang, the inference engine now serving Grok at xAI on a hundred thousand GPUs.

Ying Sheng co-wrote SGLang, the inference engine now serving Grok at xAI on a hundred thousand GPUs. She also built FlexGen, which made a 175-billion model run on a single consumer GPU, and helped build Chatbot Arena. Three artifacts the

by @h100envy · backlist 2026-06-20 · rubric 82.0