Show HN: INT21 – Self-Improving PTX Kernel Factory
INT21 has launched a self-improving PTX Kernel Factory that automates the creation of highly optimized CUDA GPU kernels. The system writes, tests, and benchmarks kernel candidates, iteratively refining them to match or exceed hand-tuned performance. Initial results show strong performance on GEMM and RNN operations, with plans to expand functionality.