VIF¶

VIF (Visual Information Fidelity) measures how much of the reference image's visual information is preserved in the distorted image, using a statistical model of natural scenes and human visual system sensitivity.

VIF is a luma-only metric by design (Sheikh & Bovik, 2006 — defined on a single luminance channel). Every libvmaf backend — CPU (AVX2, AVX-512, NEON), CUDA, HIP, SYCL, Metal — and upstream Netflix/vmaf reads only the Y plane (data[0]) of the input picture. There are no per-chroma-plane VIF features. See ADR-0597 for the disposition of the 2026-05-18 deep-audit finding that questioned this design.

Variant¶

Extractor name	Algorithm	Options
`integer_vif`	Integer fixed-point multi-scale implementation	`vif_enhn_gain_limit`, `vif_skip_scale0`, `debug`
`float_vif`	Floating-point 4-scale Gaussian-pyramid implementation (feature `float_vif` + `float_vif_scale0..3`)	`vif_enhn_gain_limit`, `vif_kernelscale`, `vif_sigma_nsq`, `vif_skip_scale0`, `debug`

float_vif is implemented on every GPU backend: CUDA (float_vif_cuda), SYCL (float_vif_sycl), HIP (float_vif_hip), and Metal (float_vif_metal, feature/metal/float_vif.metal + _metal.mm) — a 4-scale separable-Gaussian pyramid with per-scale mean/variance/covariance statistics, parity-checked vs the CPU float_vif on the macOS Apple-Silicon CI lane (places=4, ADR-0214).

The CUDA twin (vif_cuda) additionally carries a vestigial enable_chroma option (see Options table below); it is a documented no-op.

`integer_vif` extractor¶

The extractor uses an integer fixed-point implementation of the VIF algorithm across multiple spatial scales. It is the extractor invoked when VMAF model JSON files reference "integer_vif".

integer_vif (feature vif) is implemented on every GPU backend: CUDA (vif_cuda), SYCL (vif_sycl), HIP (vif_hip), and Metal (integer_vif_metal, feature/metal/integer_vif.metal + _metal.mm) — a 4-scale fixed-point Gaussian pyramid with int64 moment accumulators and the integer log2-LUT, mirroring the proven float_vif_metal scaffold. Parity vs the CPU vif is checked on the macOS Apple-Silicon CI lane (places=4, ADR-0214); because Apple GPUs lack fp64 the per-pixel gain is computed in float (the same fp64-free trade-off as the SYCL twin, ADR-0220).

Output features¶

All features are computed on the luma (Y) plane only.

Feature name	Description	Condition
`integer_vif`	Fused VIF score (luma)	`debug=true`
`VMAF_integer_feature_vif_scale0_score`	VIF at scale 0 (finest, luma)	Always (or `0.0` when `vif_skip_scale0=true`)
`VMAF_integer_feature_vif_scale1_score`	VIF at scale 1 (luma)	Always
`VMAF_integer_feature_vif_scale2_score`	VIF at scale 2 (luma)	Always
`VMAF_integer_feature_vif_scale3_score`	VIF at scale 3 (coarsest, luma)	Always
`integer_vif_num`, `integer_vif_den`	Fused numerator/denominator	`debug=true`
`integer_vif_num_scaleN`, `integer_vif_den_scaleN` (N=0..3)	Per-scale num/den	`debug=true`

Options¶

Option	Alias	Type	Default	Range	Effect
`vif_enhn_gain_limit`	`egl`	double	`100.0`	`1.0–100.0`	Cap the per-pixel enhancement-gain ratio so over-sharpened content cannot saturate the score. Set to `1.0` to disable enhancement gain entirely.
`vif_skip_scale0`	`ssclz`	bool	`false`	—	Skip scale-0 (finest pyramid level) calculations; scale-0 outputs are forced to `0.0` and excluded from the fused score. Matches the CPU path for GPU backends.
`debug`	—	bool	`false`	—	Emit additional per-scale numerator/denominator debug metrics.
`enable_chroma` (CUDA twin only)	—	bool	`false`	—	No-op retained for backward compatibility with callers that pass the option on the CLI or in a model JSON. VIF is luma-only by design; setting `enable_chroma=true` emits a one-shot warning during init and otherwise has no effect. See ADR-0597.

How to run¶

# Luma-only VIF (the only mode that exists across every backend)
core/build/tools/vmaf \
    --reference ref.yuv --distorted dist.yuv \
    --width 1920 --height 1080 --pixel_format 420 --bitdepth 8 \
    --no_prediction --feature integer_vif --output /dev/stdout

# Skip scale-0 (GPU-parity mode)
core/build/tools/vmaf \
    --reference ref.yuv --distorted dist.yuv \
    --width 1920 --height 1080 --pixel_format 420 --bitdepth 8 \
    --no_prediction --feature 'integer_vif:vif_skip_scale0=true' --output /dev/stdout

Cross-backend parity¶

The core/test/test_integer_vif_cpu_cuda_parity.c smoke test (suite fast, runs when enable_cuda=true) asserts CPU vs CUDA vif_scaleN_score agreement within 1e-5 on the Netflix src01_hrc00_576x324 reference pair — making the luma-only parity claim a regression gate.