Research-0136: HDR/UGC Dataset License Audit (2026-05-15)¶

Status: Complete Date: 2026-05-15 Branch: research/hdr-ugc-dataset-license-audit-2026-05-15 Related ADR: ADR-0459 (panel-aware workstream) Companion task: Batch 16 of the 2026-05-15 Gap-Fill Plan

Summary¶

License, distribution-stability, and actionability assessment for the 13 HDR/UGC video quality dataset leads enumerated in Audit Slice C.7 (2026-05-15). Findings directly inform which datasets can be ingested into local training pipelines this week versus which require negotiation or infrastructure investment before they become useful.

Results: 6 datasets are ACTIONABLE-NOW, 5 are BLOCKED (access or license restrictions), 1 is ASPIRATIONAL-scale (needs infrastructure), and 1 (CHUG) is already active. HDRSDR-VQA introduced a new panel/display-aware workstream scoped in ADR-0459.

Dataset Summary Table¶

#	Dataset + URL	Scale	License	Distribution	HDR coverage	HFR coverage	Bit depth	Resolution	Subjective method	Actionability	Recommended next step
1	Beyond8Bits	~44k clips, ~1.5M crowd ratings	CC BY-SA 4.0 (metadata); NC restriction on video payload ("non-commercial research" + no redistribution + no commercial deployment)	Direct download via AWS S3; no sign-up wall	HDR-only — PQ/BT.2020, 10-bit HEVC ladder	24/25/30/60 fps	10-bit	360p–1080p + source ref	Single-stimulus ACR, AMT workers, SUREAL-MLE aggregation	ACTIONABLE-NOW	Direct S3 download of metadata + selected clips; cite Chen et al. + comply with NC clause
2	CHUG	856 source / 5,992 distorted videos, 211k ratings	CC BY-SA 4.0	Direct download via AWS S3; no sign-up	UGC HDR (PQ 10-bit HEVC)	120 fps clips confirmed (~189 clips); mixed 24/30/60/120 fps	10-bit	Mixed 360p–1080p portrait + landscape	Single-stimulus ACR, AMT workers	Already active — local pipeline running	HFR normalisation patch; add `--metadata-jsonl` to live invocation
3	YouTube SFV+HDR	~4k source contents, ~2k native HDR (per paper)	YouTube Terms of Service — research use only; no redistribution	JavaScript-rendered landing page; access appears to require YouTube research partner agreement	Mixed HDR + SDR	Unknown (YouTube typical: 24/30/60 fps)	10-bit (HDR)	Up to 4K	Unknown — no public methodology doc found	BLOCKED-on-access	Contact YouTube Research team at media.withyoutube.com; partnership or signed agreement likely required
4	HDRSDR-VQA	558 clips (31 open-sourced), 145 subjects; 22k pairwise JOD scores across 6 displays	Custom academic — "no royalty, no written agreement; use + distribute freely with citation" (Chen et al. 2025)	Google Form sign-up required; no NDA	Mixed HDR10 + SDR matched pairs	50/60 fps clips documented	Not explicitly stated	Not explicitly stated	Paired comparison across 6 HDR TVs; JOD (Just-Objectionable-Difference) scores	ACTIONABLE-NOW	Complete Google Form; download open-sourced 31-video subset; begin panel-aware tuning design (ADR-0459)
5	LIVE HDR Database	310 videos, 31 content sources, 66 subjects (40 for quality), 20k+ opinion scores	Custom academic — "no royalty, no written agreement; use + distribute freely with citation" (Shang et al. ICIP 2022)	Google Form sign-up; password protected	HDR10 only	50/60 fps	Not stated	Not stated	Controlled-lab ACR; two ambient lighting conditions (dark lab + bright living room)	ACTIONABLE-NOW	Complete Google Form to receive password + link; download and ingest
6	AVT-VQDB-UHD-2-HDR	31 source videos (8K UHD-2 HDR)	CC BY-NC 4.0 (TU Ilmenau content)	Google Form sign-up for video sources; objective metrics and metadata on GitHub; checksums provided	HDR full — 8K UHD-2	Not specified	Not specified (8K HDR → likely 10-bit)	8K (UHD-2 = 7680×4320)	Not fully specified (paper in QoMEX 2024)	BLOCKED-on-access	Complete Google Form; note that 8K decode requires significant compute — assess infra fit before download
7	IPI-MobileHDRVQA	60 source videos, AV1 codec variants; 1.2 GB download (27.4 GB this version)	CC BY 4.0	Direct Zenodo download; no sign-up required	UGC mobile HDR	Not specified	Not specified	Mixed mobile resolutions	ACR-HR (ACR with Hidden Reference); MOS + DMOS + 95% CI provided; lab study at Nantes University	ACTIONABLE-NOW	Direct Zenodo download — `curl https://zenodo.org/records/11544387/files/...`
8	SHU-HDRQAD	1,409 distorted HDR images; 147 scenes; 6 distortion types	No license stated in repository	Google Drive download (no formal access control, but no explicit license grant)	HDR images (IQA not VQA)	N/A — image dataset	Not stated	Not stated	MOS values provided; methodology details absent from README	BLOCKED-on-license	Request explicit license from maintainers before ingesting; image-only limits utility for VQA training
9	BrightVQ	300 source / 2,100 transcoded videos; 73,794 crowd ratings	CC BY-NC 4.0	Direct AWS S3 download (browser or CLI); full-dataset link marked "COMING SOON" at time of audit	HDR UGC — 360p to 1080p	Not specified	Not specified	360p–1080p	Single-stimulus ACR, AMT workers; MOS provided	BLOCKED-on-access	Monitor GitHub for full-dataset download link; contact author (shreshthsaini) for early access
10	BVI-AOM / BVI-CC	956 sequences (239 unique 4K sources); 64 frames each; multi-resolution (480p–2176p)	Research-only; "material shall only be used for developing future video coding standards, for training or performance evaluation of test models in JVET and AOM"	Direct S3 download (124 GB); no NDA but restricted to JVET/AOM standard development	SDR only (BVI-AOM is SDR despite being 10-bit)	24–50 fps	10-bit, 4:2:0	480p–2176p (source 4K)	No subjective; objective only (PSNR-Y, VMAF)	BLOCKED-on-license	Do not ingest for general VQA training — license restricts to JVET/AOM use; use BVI-DVC (already ingested) instead
11	SJTU HDR Video Sequences	16 video sequences	CC BY-NC-ND 4.0	Academic-only; contact admin for Dropbox/file-share access; no public direct download link	HDR — Sony RAW 16-bit, OpenEXR output, S-Gamut/S-Log3, BT.2020, SMPTE ST.2084	Not specified	16-bit raw (half-float 4:4:4 RGB OpenEXR)	UHD (4K)	No subjective ratings	BLOCKED-on-access	Contact SJTU Media Lab admin; note ND clause prohibits format conversion or derivative works
12	HDR-VDC	16 reference + 132 test videos; 30 subjects; JOD scores	CC BY 4.0	DOI repository (Cambridge Apollo): `https://doi.org/10.17863/CAM.107964`; direct download	HDR (PQ, LG G2 OLED displays)	Not specified	Not specified	720p, 1080p, 4K	Pairwise comparison; JOD (Just-Objectionable-Difference); 2 luminance levels × 2 viewing distances	ACTIONABLE-NOW	Direct Cambridge Apollo DOI download; CC BY 4.0 permits training use with attribution
13	AGH/NTIA/Dolby (CDVL)	Unknown — content search required after registration	No specific license quoted on public pages; "freely available for research and development" per site copy	Registration required ("free to join"); no NDA mentioned; members-only access	Possibly HDR (Dolby involvement suggests HDR); confirmation requires access	Unknown	Unknown	Unknown	Unknown	BLOCKED-on-access	Register at cdvl.org (free); search for `AGH-NTIA-Dolby` after login; confirm HDR coverage and license before committing to ingest

Per-Dataset Prose Assessment¶

1. Beyond8Bits¶

Beyond8Bits is the largest HDR-UGC corpus audited, with approximately 44,000 transcoded videos derived from 6,861 crowd-sourced source videos and over 1.5 million AMT quality ratings. The dataset is methodologically rigorous: ratings use SUREAL Maximum Likelihood Estimation aggregation, workers are screened for HDR-display ownership, and a golden-set calibration step filters low-quality annotators. All content is PQ/BT.2020 10-bit HEVC across a five-rung bitrate ladder (0.2–5 Mbps) at 360p, 720p, and 1080p — matching the CHUG transcoding philosophy closely.

The license situation requires care. Metadata is released under CC BY-SA 4.0, but the video payloads carry an additional "non-commercial research" restriction that prohibits redistribution outside the approved S3 mirror and deployment in commercial products without a separate license from UT Austin / YouTube. This NC clause is compatible with internal training but rules out shipping model weights that are demonstrably trained on this corpus without legal review, mirroring the existing BVI-DVC posture (weights ship locally only).

Frame rate coverage — 24/25/30/60 fps — does not include 120 fps, making it less affected by the HFR normalisation gap identified in CHUG's audit. It is the natural scale-up corpus to run after CHUG extraction completes.

2. CHUG (already active)¶

Already actively ingesting locally (chug_pipeline.pid running, 3,597 clips complete at audit time). Key open issues from Audit Slice H: missing --metadata-jsonl flag in the live invocation (content-split columns absent from the finished parquet), and the HFR normalisation gap (189 clips at 120 fps, ~525 at 60 fps, all processed without motion_fps_weight normalisation). Neither issue is blocked on external access; both are internal pipeline fixes tracked as open items. No further access work required.

3. YouTube SFV+HDR¶

The media.withyoutube.com/sfv-hdr landing page is JavaScript-rendered and returns only a title heading via WebFetch, consistent with the page being a research partner portal rather than a public download page. No paper preprint with a verified public arXiv ID was located. Based on secondary references in the HDR VQA literature, the dataset covers approximately 4,000 content items of which roughly 2,000 are native HDR; methodological details are not publicly documented. Access likely requires a YouTube Research partnership or signed data-sharing agreement. This dataset represents the highest potential scale in the audit cohort but is the most opaque regarding access terms. A direct contact with the YouTube Research team is the only viable path this week.

4. HDRSDR-VQA¶

This is the most strategically important dataset in the audit for the vmaf-tune toolchain specifically. The dataset covers 960 total video clips (558 with public open-source content; the remaining 145 subjects' VoD and live sports clips are unavailable due to copyright). The distinguishing feature is the six-display pairwise evaluation design: 145 participants rated HDR10 versus matched SDR versions on six distinct HDR televisions, producing scaled JOD scores. This introduces display-type variability data that no other dataset in the cohort provides, making it the unique resource for the panel-aware vmaf-tune training workstream scoped in ADR-0459. The license is permissive academic (no NDA, no royalty, citation required). Accessing the 31 open-sourced videos via Google Form is the immediate action. The 10 VoD clips and 10 live sports clips remain unavailable due to third-party copyright.

5. LIVE HDR Database¶

The LIVE HDR database provides 310 video clips from 31 source contents, evaluated by 66 participants (40 for the quality scoring task) under two ambient conditions. Its license is permissive academic (identical clause to HDRSDR-VQA: no royalty, no written agreement, cite the paper). The Google Form access gate is a minor friction; no NDA or institutional affiliation requirement is stated. The dataset is complementary to HDRSDR-VQA in that it studies ambient viewing condition effects (dark lab versus bright living room) rather than display panel variability. At 20,000+ opinion scores it is moderately sized and appropriate for validation of panel-aware or ambient-condition-aware quality models.

6. AVT-VQDB-UHD-2-HDR¶

This is the only 8K HDR corpus in the audit. The CC BY-NC 4.0 license permits non-commercial research use. The Google Form access gate implies active curation by TU Ilmenau. The principal limitation for immediate ingestion is infrastructure: 8K HDR decode requires significantly more compute than the 1080p CHUG clips, and the local NVIDIA GPU pipeline may not efficiently process 8K without additional buffering. The QoMEX 2024 paper should be reviewed before committing to download (124 GB S3 equivalent is the BVI-AOM reference point; this dataset is likely similar scale). The subjective methodology is not fully documented on the GitHub page; details are in the QoMEX paper. Not a week-1 priority but represents the only available 8K HDR quality corpus.

7. IPI-MobileHDRVQA¶

The simplest acquisition path of the 13: a CC BY 4.0 Zenodo record with direct download, no registration, 1.2 GB for the primary download (27.4 GB for this version, 197 GB total across all versions). The AV1 codec coverage is a gap relative to CHUG (HEVC-encoded), but AV1 represents an increasing fraction of real-world HDR delivery. The ACR-HR methodology with hidden reference is slightly richer than standard ACR, enabling DMOS computation. At 60 source videos it is small but immediately actionable without infrastructure investment. The mobile-oriented resolution mix (portrait mode content) adds a domain the CHUG pipeline does not cover. Download this week; run standard VMAF feature extraction against the AV1 bitstreams.

8. SHU-HDRQAD¶

This is an image quality (IQA) dataset, not a video quality dataset. At 1,409 HDR images across 147 scenes it has reasonable IQA scale, but the complete absence of an explicit license in the GitHub repository is a blocker: the Google Drive hosting provides no formal access controls but also provides no rights. Using unlicensed data for training model weights that ship in a commercial fork is legally risky. Additionally, the VQA training pipeline requires temporal content; a static image dataset has limited utility for training motion or compression-artifact quality models. The recommended action is to contact the maintainers requesting an explicit license grant — if they respond with CC BY or similar, the dataset could serve as an HDR IQA validation set. Skip for the near term.

9. BrightVQ¶

BrightVQ is architecturally very similar to Beyond8Bits and CHUG (same UT Austin + AMT + AWS S3 pipeline, CC BY-NC 4.0). At 73,794 ratings from 300 source videos it is substantially smaller than Beyond8Bits. The critical finding is that the full-dataset download link was marked "COMING SOON" at the time of audit (2026-05-15). This is not a license blocker — CC BY-NC 4.0 permits research use — but it is a practical access blocker. The dataset is from the same research group (Shreshth Saini, UT Austin) that published Beyond8Bits and CHUG, suggesting the data will become available through the same S3 mechanism. Monitor the GitHub repository; contact the author for early access if the timeline aligns with training needs.

10. BVI-AOM / BVI-CC¶

Despite the attractive technical specifications (10-bit, multi-resolution, 124 GB corpus, VMAF evaluation baseline), BVI-AOM's license explicitly restricts use to "developing future video coding standards" and "training or performance evaluation of test models in JVET and Alliance for Open Media." This restriction is incompatible with general vmaf-tune model training. The BVI-DVC corpus from the same Bristol VI Lab group is already ingested (PR #310, ADR-0310) under a research license; BVI-AOM adds no HDR content (it is confirmed SDR 10-bit) and carries a narrower license. Do not ingest BVI-AOM for the vmaf-tune pipeline.

11. SJTU HDR Video Sequences¶

SJTU provides 16 professional-grade HDR sequences captured with Sony F65/F55 cameras at 16-bit raw, stored as half-float 4:4:4 RGB OpenEXR after SMPTE ST.2084 tone mapping. The technical quality is exceptional — over 14 stops of dynamic range, BT.2020 primaries, S-Gamut/S-Log3. The access model (contact admin, download via Dropbox/file share) is manageable. However, two issues make this a low priority: (1) the CC BY-NC-ND 4.0 license's NoDerivatives clause explicitly prohibits format conversion, meaning the OpenEXR files cannot legally be decoded to YUV for VMAF feature extraction without constituting a "derivative work" under a strict reading; (2) there are no subjective quality ratings — the sequences serve as pristine HDR reference content for codec evaluation, not as a VQA training corpus. Contact SJTU to clarify the derivative-works scope before proceeding.

12. HDR-VDC¶

HDR-VDC (Cambridge gfxdisp lab, 2024) is a clean CC BY 4.0 corpus with 148 videos, 30 subjects, and pairwise JOD scores across AV1 compression and Lanczos upscaling distortions at three resolutions (720p, 1080p, 4K) on a high-quality LG G2 OLED display. The CC BY 4.0 license is the most permissive in the audit cohort — it permits redistribution and derivative works with attribution, making it suitable as a training corpus without the NC restrictions on weight shipping that Beyond8Bits and BrightVQ carry. The DOI-linked Cambridge Apollo repository provides a stable, archival download. The primary limitation is scale (30 subjects, 148 clips versus CHUG's 5,992). It is a strong validation set for the HDR-VDC quality dimension specifically (compression + upscaling artefacts on OLED). Download immediately.

13. AGH/NTIA/Dolby (CDVL)¶

The Consumer Digital Video Library (CDVL.org) requires free registration to access content. The public-facing pages state that content is "freely available for research and development purposes" once registered, but no specific license text is available without logging in. The AGH/NTIA/Dolby sequences are not listed on the public pages; their existence and HDR coverage must be verified post-registration. Dolby's involvement is a positive signal for HDR content but also introduces potential IP complexity. The CDVL is a well-established academic resource (long history of MPEG and HEVC evaluation content) and registration is low-friction. Register this week; search for the AGH/NTIA/Dolby collection; evaluate the license and HDR metadata before committing further.

Top 3 Priority Recommendations¶

Priority 1 — IPI-MobileHDRVQA (Zenodo CC BY 4.0) — Download this week¶

Why first: Zero friction. No registration, no sign-up, no NDA. CC BY 4.0 permits training use and derivative weight shipping without NC restrictions. At 60 source videos and 1.2–27 GB it fits on local storage, and AV1 content complements CHUG's HEVC coverage. Feature extraction can run using the existing CPU pipeline while the CHUG CUDA run continues.

Concrete action (this week): curl -L https://zenodo.org/records/11544387/files/<manifest> then run extract_k150k_features.py (NR mode) or wait for the CHUG chug_extract_features.py FR-pair adaptation to handle AV1 bitstreams. Open a BACKLOG entry: T-IPIHDRVQA-INGEST.

Priority 2 — HDRSDR-VQA (Google Form, permissive academic) — Apply today¶

Why second: Unique data. No other dataset in the cohort provides per-display-type pairwise HDR-vs-SDR quality scores. This data is the direct prerequisite for the panel-aware vmaf-tune workstream (ADR-0459). The Google Form gate is minutes of friction. The 31 open-sourced clips are available immediately after form submission; the 145-participant JOD score files likely accompany them.

Concrete action (this week): Submit Google Form at https://live.ece.utexas.edu/research/Bowen_SDRHDR/sdr-hdr-bowen.html; download the open-source 31-clip subset; extract VMAF features and JOD scores; design the panel-aware MOS-head schema (see ADR-0459 §Context).

Priority 3 — HDR-VDC (Cambridge Apollo, CC BY 4.0) — Download this week¶

Why third: Most permissive license in the cohort (CC BY 4.0, no NC restriction). The AV1 + Lanczos upscaling distortion types are complementary to CHUG's HEVC compression focus. The Cambridge Apollo DOI link (https://doi.org/10.17863/CAM.107964) provides archival stability. At 148 clips it is small but immediately usable as a validation holdout set or supplementary training shard.

Concrete action (this week): wget or curl from the Cambridge Apollo download page; add to BACKLOG as T-HDRVDC-INGEST. Extract VMAF features using existing CPU pipeline.

New Workstream Surfaced — Panel/Display-Aware vmaf-tune¶

Background¶

HDRSDR-VQA introduces a six-display pairwise evaluation design (OLED/QLED/LCD panels of varying peak luminance and color volume) that reveals substantial display-type-dependent perceptual quality differences for the same HDR10 + SDR video pair. This is not captured by any existing vmaf-tune model or recommendation workflow: the current vmaf-tune recommend path is display-agnostic — it assumes a generic high-end HDR display. In the field, HDR content is consumed on a range of panels where the same CRF setting can be perceptually optimal on one display and visibly degraded on another.

Proposed T-number¶

T-VMAFTUNE-PANEL-AWARE — Panel/display-aware vmaf-tune recommendation workstream.

Scope: Train a display-conditioned MOS/JOD head that accepts a display_type or peak_luminance_nits input feature alongside the standard VMAF feature vector. The head produces display-conditioned quality estimates enabling vmaf-tune recommend --display-profile {oled|qled|lcd|generic}. Data source: HDRSDR-VQA six-display JOD scores + HDRVDC OLED measurements as a second signal.

Estimated effort: T4 (medium: data ingestion 1–2 days, model head training 1 day, CLI integration 2 days, docs 1 day). Not blocked on any open PR.

ADR Reference¶

See ADR-0459 (Status: Proposed) for the full decision scaffold, alternatives considered, and consequences.