Demonstrations of the visual effects of XPSNR based optimization during encoding
The following illustrations serve as a demonstration of the visual benefit of using the perceptually optimized quantization parameter adaptation (QPA) in a transform-based still-image codec like HEVC [1]. The basic coding algorithm used for this demonstration is draft 3 of the Versatile Video Coding (VVC) specification [2], as implemented by the VTM3.0 reference software [3] into which our QPA method has been integrated. Since only single images are utilized for this demonstration, the VVC codec was configured to apply only “still-image” Intra-picture prediction.
The presented images were transcoded, with visual transparency, to high-bit-rate JPEG in order to limit the download durations for the viewers. Differences between the coded pictures are mostly visible in low-contrast regions, so viewing in low background-lighting conditions is advised.
This demonstration serves as an accurate depiction of how rate control encodings with visual QPA (i.e., using XPSNR based R-D optimization) differ perceptually from rate control encodings without such visual optimization (i.e., using traditional PSNR based least-squares optimization).
BQTerrace, uncoded input (HD, 1920×1080, lossless size: 4989 KB)
BQTerrace, VTM 3.0.1 without QPA, base QP 32 (HD, 1920×1080, coded size: 96.7 KB)
BQTerrace, VTM 3.0.1 with QPA, base QP 29 (HD, 1920×1080, coded size: 98.5 KB)
BasketballDrive, frame 68, uncoded input (HD, 1920×1080, lossless size: 5123 KB)
BasketballDrive, frame 68, VTM 3.0.1 without QPA, base QP 30 (HD, 1920×1080, coded size: 51.5 KB)
BasketballDrive, frame 68, VTM 3.0.1 with QPA, base QP 30 (HD, 1920×1080, coded size: 49.2 KB)
Kodak Image 15, uncoded input (768×512, lossless size: 755 KB)
Kodak Image 15, VTM 3.0.1 without QPA, base QP 28 (768×512, coded size: 22.2 KB)
Kodak Image 15, VTM 3.0.1 with QPA, base QP 29 (768×512, coded size: 22.4 KB)
ParkScene, uncoded input (HD, 1920×1080, lossless size: 4911 KB)
ParkScene, VTM 3.0.1 without QPA, base QP 29 (HD, 1920×1080, coded size: 95.9 KB)
ParkScene, VTM 3.0.1 with QPA, base QP 30 (HD, 1920×1080, coded size: 92.9 KB)