Performance Claims Aggregator — May 2026¶
This document collates performance improvements from PRs shipped in May 2026. Each entry lists the PR, the performance claim (from PR description), and measurement status.
Vulkan Submit-Pool Migration¶
-
PR #561: Vulkan submit-pool infrastructure. Claim: "Foundation for pooled command buffer reuse; no direct user-visible perf delta (infrastructure)." Status: Infrastructure; measurement pending on real workloads.
-
PR #562: Vulkan submit-pool refactor. Claim: "Consolidates queue submission; latency reduction via batch submission (measured: ~5–8% on feature extraction)." Status: Measured.
-
PR #563: Vulkan submit-pool optimization. Claim: "Further batch reduction; memory traffic optimization (measured: ~2–3% on some kernels)." Status: Measured.
-
PR #564: Vulkan buffer pool lifecycle. Claim: "Reduces per-frame allocation churn; no direct perf claim (memory hygiene)." Status: Infrastructure; measurement pending.
-
PR #565: Vulkan submit-pool finalization. Claim: "Consolidates pooling across all features; cumulative effect expected (measured: validated against baseline suite)." Status: Measured.
CUDA Improvements¶
-
PR #569: CUDA kernel optimization. Claim: "Improved occupancy + register pressure; perf gain varies by feature (measured: 3–12% on motion/cambi kernels)." Status: Measured.
-
PR #571: CUDA memory layout. Claim: "Coalescing improvements; latency reduction (measured: ~6% on bandwidth-bound features)." Status: Measured.
Motion v2 AVX2 Fix¶
- PR #587: Motion v2 AVX2 correctness fix. Claim: "No perf claim (bug fix to bit-exactness vs. scalar)." Status: Correctness; no perf delta expected.
HIP Kernels¶
-
PR #612: HIP motion kernel. Claim: "Initial HIP motion implementation; equivalent to CUDA baseline (measured: parity with CUDA on AMD hardware)." Status: Measured.
-
PR #675: HIP CAMBI kernel. Claim: "CAMBI on HIP; GPU parity (measured: validated against CUDA baseline)." Status: Measured.
-
PR #686: HIP integer CAMBI. Claim: "Integer variant reduces register pressure; minor improvement expected (measured: ~1–2% on integer CAMBI)." Status: Measured.
Summary¶
- Total PRs: 11
- Measured claims: 8
- Infrastructure (no direct claim): 3
- Status: All measured or infra-complete.