Abstract
Due to an increased computational complexity of calculating the values of the distributed-order Caputo fractional derivative compared to the classical Caputo derivative there is a need to develop new techniques that accelerate it. In this paper for this purpose we propose to use a fast matrix "multiply and accumulate" operation available in GPU’s that contain the so-called tensor cores. We present and experimentally analyze the properties of GPU-algorithms that are based on the L1 finite-difference approximation of the derivative and incorporate them into the Crank-Nicholson scheme for the distributed-order time-fractional diffusion equation. The computation of derivative’s values on GPU was faster than the multi-threaded implementation on CPU only for a large number of time steps with growing performance gain when number of time steps increase. The usage of the single-precision data type increased the error up to \(2.7\%\) comparing with the usage of the double-precision data type. Half-precision computations in tensor cores increased the error up to \(29.5\%\). While solving a time-fractional diffusion equation, algorithms implemented for GPU with the usage of the single-precision data type were at least three times faster than the CPU-implementation for the number of time steps more than 1280. Data type precision had only slight influence on the solution error with significantly increased execution time when the double-precision data type was used for data storage and processing.
Similar content being viewed by others
References
Atanackovic, T., Stankovic, B.: On a numerical scheme for solving differential equations of fractional order. Mech. Res. Comm. 35(7), 429–438 (2008)
Baffet, D., Hesthaven, J.: A kernel compression scheme for fractional differential equations. SIAM J. Numer. Anal. 55(2), 496–520 (2017)
Bohaienko, V.: A fast finite-difference algorithm for solving space-fractional filtration equation with a generalised Caputo derivative. Comput. Appl. Math. 38(3), 105 (2019)
Bohaienko, V.: Computing psi-caputo fractional derivative values using CUDA 10. In: CEUR Workshop Proceedings, pp. 636–648 (2020)
Bonchis, C., Kaslik, E., Rosu, F.: HPC optimal parallel communication algorithm for the simulation of fractional-order systems. J. Supercomput. 75, 1014–1025 (2019)
Bu, W., Xiao, A., Zeng, W.: Finite difference/finite element methods for distributed-order time fractional diffusion equations. J. Sci. Comput. 72, 422–441 (2017)
Bulavatsky, V.: Some boundary-value problmes of filtration dynamics corresponding to models of fractional diffusion of distributed order. Cyber. Syst. Anal. 58, 65–76 (2022)
Chen, Z., Zheng, S., Zhang, Z.: Sparsity-aware precorrected tensor train algorithm for fast solution of 2-d scattering problems and current flow modeling on unstructured meshes. IEEE Trans. Microwave Theory Tech. 67, 4833–4847 (2019)
Ding, W., Patnaik, S., Sidhardh, S., Semperlotti, F.: Applications of distributed-order fractional operators: a review. Entropy 23, 110 (2021)
Durastante, F.: Efficient solution of time-fractional differential equations with a new adaptive multi-term discretization of the generalized Caputo-Dzherbashyan derivative. Calcolo 56, 36 (2019)
Ford, N., Simpson, A.: The numerical solution of fractional differential equations: speed versus accuracy. Numer. Algo. 26(4), 333–346 (2001)
Golev, A., Penev, A., Stefanova, K., Hristova, S.: Using GPU to speed up calculation of some approximate methods for fractional differential equations. Int. J. Pure Appl. Math. 119(3), 391–401 (2018)
Gorenflo, R., Mainardi, F.: Fractional calculus: integral and differential equations of fractional order, pp. 223–276. Springer Verlag, Wien (1997)
Hu, G., O’Connell, R.: Analytical inversion of symmetric tridiagonal matrices. J. Phys. A Math. Gen. 29, 1511–1513 (1996)
Jia, J., Wang, H., Zheng, X.: A fast numerical scheme for a variably distributed-order time-fractional diffusion equation and its analysis. Comput. Math. Appl. 108, 24–32 (2022)
Kochubei, A.: Distributed order calculus and equations of ultraslow diffusion. J. Math. Anal. Appl. 340(1), 252–281 (2008)
Liao, H., Lyu, P., Vong, S., Zhao, Y.: Stability of fully discrete schemes with interpolation-type fractional formulas for distributed-order subdiffusion equations. Numer. Algorithms 75, 845–878 (2017)
Mainardi, F., Pagnini, G., Gorenflo, R.: Some aspects of fractional diffusion equations of single and distributed order. Appl. Math. Comput. 187, 295–305 (2007)
Podlubny, I.: Fractional Differential Equations. Academic Press, New York (1999)
Sales Teodoro, G., Tenreiro Machado, J., Capelas de Oliveira, E.: A review of definitions of fractional derivatives and other operators. J. Comput. Phys. 388, 195–208 (2019)
Samarskii, A.: The Theory of Difference Schemes. CRC Press, New York (2001)
Samko, S., Kilbas, A., Marichev, O.: Fractional Integrals and Derivatives. Gordon and Breach Science Publishers, New York (1993)
Sun, L.Y., Fang, Z.W., Lei, S.L., Sun, H.W., Zhang, J.L.: A fast algorithm for two-dimensional distributed-order time-space fractional diffusion equations. Appl. Math. Comput. 425, 127095 (2022)
Taghipour, M., Aminikhah, H.: An efficient non-standard finite difference scheme for solving distributed order time fractional reaction-diffusion equation. Int. J. Appl. Comput. Math. 8, 56 (2022)
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Bohaienko, V. Calculation of Distributed-Order Fractional Derivative on Tensor Cores-Enabled GPU. Int J Parallel Prog 51, 256–270 (2023). https://doi.org/10.1007/s10766-023-00754-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10766-023-00754-9