KernelGenBench Documentation

KernelGenBench Documentation#

A benchmark framework for evaluating LLM and agent-based Triton kernel generation across multiple hardware platforms.

Getting Started »

Overview

Learn what KernelGenBench is, why it matters, and what it can do for you.

Overview
Features

Explore multi-source operators, multi-chip support, anti-hack validation, and evaluation metrics.

Features
LLM Track

Evaluate LLMs on generating Triton kernels with Pass@K metric.

LLM Track
Agent Track

Evaluate coding agents that iteratively generate, verify, and optimize kernels.

Agent Track
Reference

Datasets, operators, hardware platforms, and technical specifications.

Reference
Development

Contributing guides, custom operators, and extending the framework.

Development