Research-0082 — BVI-DVC corpus feasibility for fr_regressor_v2¶
Question¶
Can adding the BVI-DVC reference corpus (Bristol VI Lab, 2021) to the fr_regressor_v2 training corpus alongside the Netflix Public drop produce a measurable LOSO PLCC lift, given the license, content-overlap, and fold-expansion posture the fork commits to?
Method¶
- Audit the BVI-DVC license terms via the Bristol Visual Information Lab portal and Zenodo metadata; classify against fork redistribution rules.
- Compare BVI-DVC's content distribution against the Netflix Public drop (9 sources) — qualitative content-class overlap (cinematic film_drama, sports, animation, wildlife, urban, texture).
- Project the LOSO partition expansion: today the trainer holds out one of 9 sources per fold; with BVI-DVC's tier-D added, the partition expands to 9 + N source-folds.
Result¶
1. License analysis¶
BVI-DVC is distributed under research-only terms — the corpus may be used for research and may not be redistributed. The fork's existing posture for the Netflix Public drop (ADR-0203) is the same model: local-only handling, derived weights ship, source corpus does not. The redistribution rule for BVI-DVC therefore requires no new infrastructure; it requires the same gitignore + README discipline already in place for .workingdir2/netflix/.
Concrete rules:
.workingdir2/BVI-DVC Part 1.zip— gitignored..workingdir2/bvi-dvc-extracted/— gitignored.runs/full_features_bvi_dvc_*.parquet— gitignored.runs/bvi_dvc_corpus.jsonl— gitignored.~/.cache/vmaf-tiny-ai-bvi-dvc-full/— outside repo, never committed.model/tiny/fr_regressor_v2*.onnx— derived weights, shippable.
2. Content overlap with Netflix Public¶
| Content class | Netflix sources | BVI-DVC tier-D coverage |
|---|---|---|
| film_drama | Seeking, ElFuente1/2, OldTownCross | Sparse |
| sports / high-motion | CrowdRun, Tennis | Strong (e.g. tai-chi, walking, market scenes) |
| animation | BigBuckBunny | None |
| wildlife | BirdsInCage, FoxBird | Some (ferris-wheel, hong-kong-market) |
| urban / architectural | None | Strong (street, rooftop, canal) |
| texture-heavy | None | Strong (BVITexture clips) |
The Netflix drop under-represents urban and texture-heavy content; BVI-DVC fills both gaps. Animation has no BVI-DVC analogue, so BigBuckBunny remains the only anchor for that class.
3. LOSO partition expansion¶
Netflix-only LOSO trains 9 models, one per held-out source. Adding BVI-DVC tier-D (~120 sources) expands the partition to 9 + ~120 source-folds. Each fold trains on a much larger remainder set, which should narrow the per-fold variance band.
Trade-off: tier-D's resolution (480 × 272) is well below typical production resolutions, so weighting BVI-DVC tier-D folds equally with Netflix HD/UHD folds may bias the regressor toward low-res behaviour. Two mitigations are available:
- include tier-B (1920 × 1088) and tier-C (960 × 544) instead of / alongside tier-D — increases extraction cost ~16× per clip;
- weight folds by source-resolution category in the LOSO aggregator — changes only the reporting layer.
The ingestion ADR-0310 ships the infrastructure tier-agnostic; tier selection is a runtime flag, deferred to the multi-seed sweep.
Conclusion¶
License is compatible with the fork's existing local-only corpus posture. Content overlap is favourable: BVI-DVC fills the urban and texture gaps the Netflix drop has and reinforces high-motion. The LOSO partition expansion materially widens the training surface. The infrastructure to make this measurable (JSONL adapter + merge utility
- tests) is small and ships with ADR-0310.
The actual PLCC measurement and the production-weights flip are deferred to a multi-seed sweep that runs outside this PR. The flip gate stays anchored on ADR-0303's ensemble criterion; a corpus expansion that does not lift mean LOSO PLCC by ≥ 1σ above the Netflix-only baseline is not shipped to production weights.
References¶
- Ma, Zhang, Bull. BVI-DVC: A Training Database for Deep Video Compression. IEEE Transactions on Multimedia, 2021.
- ADR-0203 — Netflix Public drop redistribution posture.
- ADR-0235 —
fr_regressor_v2. - ADR-0303 — ensemble-flip ship gate (corpus-expansion ship criterion lives here).
- ADR-0310 — this digest's decision record.