View : 552 Download: 0

Fast Multi-Type Tree Partitioning for Versatile Video Coding Using a Lightweight Neural Network

Title
Fast Multi-Type Tree Partitioning for Versatile Video Coding Using a Lightweight Neural Network
Authors
Park, Sang-hyoKang, Je-Won
Ewha Authors
강제원
SCOPUS Author ID
강제원scopus
Issue Date
2021
Journal Title
IEEE TRANSACTIONS ON MULTIMEDIA
ISSN
1520-9210JCR Link

1941-0077JCR Link
Citation
IEEE TRANSACTIONS ON MULTIMEDIA vol. 23, pp. 4388 - 4399
Keywords
EncodingComplexity theoryImage codingTransformsToolsStandardsQuantization (signal)Block partitioningDeep learningEncoding complexityImage compressionIntra predictionMulti-type treeNeural networkVideo compressionVVC
Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
Indexed
SCIE; SCOPUS WOS
Document Type
Article
Abstract
In this paper, we propose a fast decision scheme using a lightweight neural network (LNN) to avoid redundant block partitioning in versatile video coding (VVC). A more versatile block structure, named the multi-type tree (MTT) structure, which includes binary trees (BTs) and ternary trees (TTs), is adopted by VCC, in addition to the traditional quadtree structure. The MTT improved the coding efficiency compared with previous video coding standards. However, the new tree structures, mainly TT, significantly increased the complexity of the VVC encoder. Although widespread application of VVC has been inhibited, this problem has not yet been investigated thoroughly in the literature. In this study, we first determine the statistical characteristics of coded parameters that exhibit correlation with the TT and develop two useful types of features-explicit VVC features (EVFs) and derived VVC features (DVFs)-to facilitate the intra coding of VVC. These features can be obtained efficiently during the intra prediction before the determination of the best block partitioning during rate-distortion optimization in VVC encoding. Our LNN model decides whether to terminate the nested TT block structures subsequent to a quadtree based on the features. The experimental results confirm that the proposed method substantially decreases the encoding complexity of VVC with a slight coding loss under the All Intra configuration. Our code, models, and dataset are available at https://github.com/foriamweak/MTTPartitioning_LNN.
DOI
10.1109/TMM.2020.3042062
Appears in Collections:
공과대학 > 전자전기공학전공 > Journal papers
Files in This Item:
There are no files associated with this item.
Export
RIS (EndNote)
XLS (Excel)
XML


qrcode

BROWSE