polito.it
Politecnico di Torino (logo)

The Dynamic Optimizer Framework. Video encoding, assessment and comparison

Davide Chemin

The Dynamic Optimizer Framework. Video encoding, assessment and comparison.

Rel. Enrico Masala. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Del Cinema E Dei Mezzi Di Comunicazione, 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (5MB) | Preview
Abstract:

Delivery costs, low latency, and streaming at scale are key challenges in driving the technical development of the video industry. Improvements in video coding are always pushing the effort for new video coding solutions and quality monitoring. The recently released coding standards, like AV1 and VVC, are proof of this. In line with the constant aim of ensuring better compression ratios for the same visual quality, we analysed a new approach to digital video compression. The so-called Dynamic Optimizer has been proposed and conceived originally by Netflix, it is compatible with any existing and future video codecs, and it is suitable for non-real time encoding of on-demand video contents in adaptive streaming applications. Therefore, its ideal resources are long sequences of shots, typical of the Netflix catalogue. It works by fine-tuning the available encoding parameters for the single shots in order to find the optimal combination that respects a given bitrate or quality target. We developed a software capable of reproducing the proposed optimizer, which will be released with an open-source licence. It consists of three alternative implementations: a brute force approach that encodes and compares all possible combinations and it is very expensive in terms of number of computations and encodings, and the Lagrangian optimization, based on searching the optimal minimum of the convex hull of all combinations. The third and the most efficient proposed solution instead, which constitutes the innovative element of the work, is based on the RD curves approximation. In this thesis, we analyse the performances of the dynamic optimizer thanks to a comprehensive set of objective assessment tests, using the VMAF quality metric and the AVC, HEVC, VP9, and AV1 encoders. The results highlight the advantages and disadvantages of the different encodings in various bitrate and quality ranges and for diverse types of content.

Relators: Enrico Masala
Academic year: 2022/23
Publication type: Electronic
Number of Pages: 113
Subjects:
Corso di laurea: Corso di laurea magistrale in Ingegneria Del Cinema E Dei Mezzi Di Comunicazione
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: UNSPECIFIED
URI: http://webthesis.biblio.polito.it/id/eprint/26705
Modify record (reserved for operators) Modify record (reserved for operators)