14.
Can LLMs predict GPU kernel runtimes?
A 12K-kernel benchmark suggests LLMs can act as selective runtime surrogates during kernel search and defer to real GPUs when uncertain
1 appearance on the backlist front page in the last 30 days.
A 12K-kernel benchmark suggests LLMs can act as selective runtime surrogates during kernel search and defer to real GPUs when uncertain