Can LLMs mimic the GPU kernel engineering workflow?
Determine whether large language models can mimic the real-world GPU kernel engineering workflow used to develop GPU kernels, including effective use of compiler feedback, profiling metrics, hardware-specific specifications and instruction sets, and hardware-efficiency techniques such as tiling and operator fusion.
References
AI engineers use a rich set of information when developing kernels and it is not clear whether LMs can mimic the workflow.
— KernelBench: Can LLMs Write Efficient GPU Kernels?
(2502.10517 - Ouyang et al., 14 Feb 2025) in Section 1 (Introduction)