Fast CU partition algorithm based on swin-transformer for depth intra coding in 3D-HEVC,Multimedia Tools and Applications

当前位置： X-MOL 学术 › Multimed. Tools Appl. › 论文详情

Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)

Fast CU partition algorithm based on swin-transformer for depth intra coding in 3D-HEVC
Multimedia Tools and Applications ( IF 3.6 ) Pub Date : 2024-04-15 , DOI: 10.1007/s11042-024-18926-1
Shucen Liu , Shaoguo Cui , Tiansong Li , Haokun Liu , Qingsong Yang , Hao Yang

The three-dimensional High Efficiency Video Coding (3D-HEVC) standard is an extension of the High Efficiency Video Coding (HEVC) standard which is the latest three-dimensional (3D) video coding standard available. Based on the HEVC standard, 3D-HEVC adds some advanced techniques that are conducive to depth map coding at the expense of a significant increase in coding complexity. In this paper, a Swin-CNN network is proposed, which leverages the advantages of Swin Transformer in extracting global information and convolutional neural network (CNN) in extracting local information. Through Swin-CNN, the coding tree unit (CTU) partition structure in depth intra coding (DIC) can be predicted accurately. In addition, we construct a large-scale depth map dataset to train the Swin-CNN. Finally, we use the proposed algorithm to replace the search process of CTU quadtree partition in DIC. Experimental results show that the proposed algorithm can reduce the coding time by 60.9% to 67.5% without compromising the quality of the synthesised views, effectively reducing the coding complexity of 3D-HEVC.

中文翻译：

基于 swin-transformer 的 3D-HEVC 深度帧内编码快速 CU 划分算法

三维高效视频编码 (3D-HEVC) 标准是高效视频编码 (HEVC) 标准的扩展，后者是最新的三维 (3D) 视频编码标准。 3D-HEVC在HEVC标准的基础上，增加了一些有利于深度图编码的先进技术，但代价是编码复杂度大幅增加。本文提出了一种Swin-CNN网络，它利用了Swin Transformer在提取全局信息方面和卷积神经网络（CNN）在提取局部信息方面的优势。通过Swin-CNN，可以准确预测深度帧内编码（DIC）中的编码树单元（CTU）分区结构。此外，我们构建了一个大规模深度图数据集来训练 Swin-CNN。最后，我们使用所提出的算法来代替DIC中CTU四叉树划分的搜索过程。实验结果表明，该算法在不影响合成视图质量的情况下，可以将编码时间减少60.9%~67.5%，有效降低3D-HEVC的编码复杂度。

更新日期：2024-04-16

点击分享查看原文

点击收藏

阅读更多本刊最新论文本刊介绍/投稿指南

全部期刊列表>>