Fast partition algorithm in depth map intra coding unit based on multi-deep convolution neural network

Omran, Nacir; Maraoui, Amna; Werda, Imen; Hamdi, Belgacem

doi:10.1007/s11554-023-01404-6

Fast partition algorithm in depth map intra coding unit based on multi-deep convolution neural network

Research
Published: 23 January 2024

Volume 21, article number 23, (2024)
Cite this article

Journal of Real-Time Image Processing Aims and scope Submit manuscript

Nacir Omran¹,
Amna Maraoui¹,
Imen Werda² &
…
Belgacem Hamdi¹

155 Accesses
Explore all metrics

Abstract

The three-dimension high-efficiency video coding standard (3D-HEVC) finalized comes with a significant increase in complexity caused by the integration of depth map coding technology. This complexity is primarily triggered by the quad-tree partition of the Intra Coding Units (CU) in the depth map. A new technique utilizing deep learning is proposed, in this paper, to tackle the issue of excessive complexity aiming to predict efficiently the CU partition structure. The proposed method involves building a dataset of CU partition structure information for a depth map, creating a Multi-Deep Convolutional Neural Network (MD-CNN) model using this dataset, and then incorporating the model into the 3D-HEVC test platform. This approach reduces the 3D-HEVC video encoder complexity by 48.29% without affecting robustness, compression efficiency and video quality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast CU partition algorithm based on swin-transformer for depth intra coding in 3D-HEVC

Article 15 April 2024

A CU Depth Prediction Model Based on Pre-trained Convolutional Neural Network for HEVC Intra Encoding Complexity Reduction

High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

Article 01 April 2024

References

Tech, G., Chen, Y., Müller, K., Ohm, J.-R., Vetro, A., Wang, Y.-K.: Overview of the multiview and 3d extensions of high efficiency video coding. IEEE Trans. Circuits Syst. Video Technol. 26(1), 35–49 (2016)
Article Google Scholar
Sullivan, G.J., Ohm, J.-R., Han, W.-J., Wiegand, T.: Overview of the high efficiency video coding (hevc) standard. IEEE Trans. Circuits Syst. Video Technol. 22(12), 1649–1668 (2012)
Article Google Scholar
Sanchez, G., Agostini, L., Marcon, C.: Algorithms for efficient and fast 3d-hevc depth map encoding (2020). https://doi.org/10.1007/978-3-030-25927-3
Bakkouri, S., Elyousfi, A.: An adaptive cu size decision algorithm based on gradient boosting machines for 3d-hevc inter-coding. Multimed. Tools Appl., 1–19 (2023)
Zhang, Z., Yu, L., Qian, J., Wang, H.: Learning-based fast depth inter coding for 3d-hevc via xgboost. In: 2022 Data Compression Conference (DCC), pp. 43–52. IEEE (2022)
Song, W., Dai, P., Zhang, Q.: Content-adaptive mode decision for low complexity 3d-hevc. Multimedi. Tools Appl., 1–16 (2023)
Lee, J.Y., Kang, M., Park, S.-h.: Fast depth intra mode decision using machine learning in 3d-hevc. Available at SSRN 4197680
Chiang, J.-C., Peng, K.-K., Wu, C.-C., Deng, C.-Y., Lie, W.-N.: Fast intra mode decision and fast cu size decision for depth video coding in 3d-hevc. Signal Process.: Image Commun. 71, 13–23 (2019)
Google Scholar
Zuo, J., Chen, J., Zeng, H., Cai, C., Ma, K.-K.: Bi-layer texture discriminant fast depth intra coding for 3d-hevc. IEEE Access 7, 34265–34274 (2019)
Article Google Scholar
Li, T., Wang, H., Chen, Y., Yu, L.: Fast depth intra coding based on spatial correlation and rate distortion cost in 3d-hevc. Signal Process.: Image Commun. 80, 115668 (2020)
Google Scholar
Li, T., Yu, L., Wang, S., Wang, H.: Simplified depth intra coding based on texture feature and spatial correlation in 3d-hevc. In: 2018 Data Compression Conference, pp. 421–421. IEEE (2018)
Wang, C., Feng, G., Cai, C., Han, X.: Fast cu size decision algorithm for depth map intra-coding in 3d-hevc. Commun. Technol. 50(4), 655–661 (2017)
Google Scholar
Hamout, H., Elyousfi, A.: Fast 3d-hevc pu size decision algorithm for depth map intra-video coding. J. Real-Time Image Proc. 17(5), 1285–1299 (2020)
Article Google Scholar
Hamout, H., Elyousfi, A.: A computation complexity reduction of the size decision algorithm in 3d-HEVC depth map intracoding. Adv. Multimed. 2022, 3507201 (2022). https://doi.org/10.1155/2022/3507201
Article Google Scholar
Saldanha, M., Sanchez, G., Marcon, C., Agostini, L.: Fast 3d-hevc depth map encoding using machine learning. IEEE Trans. Circuits Syst. Video Technol. 30(3), 850–861 (2019)
Article Google Scholar
Saldanha, M., Sanchez, G., Marcon, C., Agostini, L.: Fast 3d-hevc depth maps intra-frame prediction using data mining. In: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 1738–1742. IEEE (2018)
Liu, C., Jia, K., Liu, P., Sun, Z.: Fast depth intra coding based on layer-classification and cnn for 3d-hevc. In: 2020 Data Compression Conference (DCC), pp. 381–381. IEEE (2020)
Fu, C.-H., Chen, H., Chan, Y.-L., Tsang, S.-H., Hong, H., Zhu, X.: Fast depth intra coding based on decision tree in 3d-hevc. IEEE Access 7, 173138–173147 (2019)
Article Google Scholar
Liu, C., Jia, K., Liu, P.: Fast depth intra coding based on depth edge classification network in 3d-hevc. IEEE Trans. Broadcast. 68(1), 97–109 (2021)
Article MathSciNet Google Scholar
Zhang, R., Jia, K., Yu, Y., Liu, P., Sun, Z.: Fast 3d-hevc inter coding using data mining and machine learning. IET Image Proc. 16(11), 3067–3084 (2022)
Article Google Scholar
Xu, M., Li, T., Wang, Z., Deng, X., Yang, R., Guan, Z.: Reducing complexity of hevc: a deep learning approach. IEEE Trans. Image Process. 27(10), 5044–5059 (2018)
Article ADS MathSciNet Google Scholar
Imen, W., Amna, M., Fatma, B., Ezahra, S.F., Masmoudi, N.: Fast hevc intra-cu decision partition algorithm with modified lenet-5 and alexnet. SIViP 16(7), 1811–1819 (2022)
Article Google Scholar
Li, T., Xu, M., Tang, R., Chen, Y., Xing, Q.: Deepqtmt: a deep learning approach for fast qtmt-based cu partition of intra-mode vvc. IEEE Trans. Image Process. 30, 5377–5390 (2021)
Article ADS PubMed Google Scholar
Amna, M., Imen, W., Fatma Ezahra, S.: Fast multi-type tree partitioning for versatile video coding using machine learning. SIViP 17(1), 67–74 (2023)
Article Google Scholar
Dang-Nguyen, D.-T., Pasquini, C., Conotter, V., Boato, G.: Raise: A raw images dataset for digital image forensics. In: Proceedings of the 6th ACM Multimedia Systems Conference, pp. 219–224 (2015)
ITU/ISO/IEC: HEVC HM reference software. [Online; accessed 28-April-2023] (2017). https://vcgit.hhi.fraunhofer.de/jvet/HM/-/tree/HM-16.18?ref_type=tags
ITU/ISO/IEC: 3D-HEVC HTM reference software. [Online; accessed 28-April-2023] (2017). https://hevc.hhi.fraunhofer.de/svn/svn_3DVCSoftware/branches/HTM-16.3-fixes/
Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., Kudlur, M., Levenberg, J., Monga, R., Moore, S., Murray, D.G., Steiner, B., Tucker, P., Vasudevan, V., Warden, P., Wicke, M., Yu, Y., Zheng, X.: Tensorflow: A system for large-scale machine learning. In: 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), pp. 265–283 (2016)

Download references

Author information

Authors and Affiliations

Electronics and Microelectronics Laboratory, Faculty of Sciences of Monastir, Environment Street, Monastir, 5019, Tunisia
Nacir Omran, Amna Maraoui & Belgacem Hamdi
Electronic and Information Technology Laboratory, University of Sfax, Sfax, Tunisia
Imen Werda

Authors

Nacir Omran
View author publications
You can also search for this author in PubMed Google Scholar
Amna Maraoui
View author publications
You can also search for this author in PubMed Google Scholar
Imen Werda
View author publications
You can also search for this author in PubMed Google Scholar
Belgacem Hamdi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

NO and AM wrote the manuscript and prepared figures. All authors reviewed the manuscript.

Corresponding author

Correspondence to Nacir Omran.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Omran, N., Maraoui, A., Werda, I. et al. Fast partition algorithm in depth map intra coding unit based on multi-deep convolution neural network. J Real-Time Image Proc 21, 23 (2024). https://doi.org/10.1007/s11554-023-01404-6

Download citation

Received: 18 August 2023
Accepted: 19 December 2023
Published: 23 January 2024
DOI: https://doi.org/10.1007/s11554-023-01404-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast partition algorithm in depth map intra coding unit based on multi-deep convolution neural network

Abstract

Access this article

Similar content being viewed by others

Fast CU partition algorithm based on swin-transformer for depth intra coding in 3D-HEVC

A CU Depth Prediction Model Based on Pre-trained Convolutional Neural Network for HEVC Intra Encoding Complexity Reduction

High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Fast partition algorithm in depth map intra coding unit based on multi-deep convolution neural network

Abstract

Access this article

Similar content being viewed by others

Fast CU partition algorithm based on swin-transformer for depth intra coding in 3D-HEVC

A CU Depth Prediction Model Based on Pre-trained Convolutional Neural Network for HEVC Intra Encoding Complexity Reduction

High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

References

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation