CUDA Stencil Benchmark

High-performance CUDA kernel generation and benchmarking framework

View the Project on GitHub jasonlarkin/cuda-stencil-benchmark

Roofline Method (CPU baseline)

Goal

Assumptions for this stencil

Measure bandwidth ceiling 1) Build triad:

Estimate compute ceiling

Produce roofline from bench CSV 1) Generate CSV via safe sweep:

Interpretation

Next (GPU roofline)