34.
Can an LLM act as a selective model of a GPU during evolutionary search, by reasoning + forecasting a kernel’s ru…
Can an LLM act as a selective model of a GPU during evolutionary search, by reasoning + forecasting a kernel’s runtime but deferring to a GPU when unsure? We produced 12k kernels + runtimes from evolutionary search, costing 400M reasoning t