Abstract
With the development of high-performance computing and the expansion of large-scale multiprocessor systems, it is significant to study the reliability of systems. Probabilistic fault diagnosis is of practical value to the reliability analysis of multiprocessor systems. In this paper, we design a linear time diagnosis algorithm with the multiprocessor system whose threshold is set to 3, where the probability that any node is correctly diagnosed in the discrete state can be calculated. Furthermore, we give the probabilities that all nodes of a d-regular and d-connected graph can be correctly diagnosed in the continuous state under the Weibull fault distribution and the Chi-square fault distribution. We prove that they approach to 1, which implies that our diagnosis algorithm can correctly diagnose almost all nodes of the graph.
References
Somani A K, Agarwal V K, Avis D. A generalized theory for system level diagnosis. IEEE Trans. Computers, 1987, C-36(5): 538–546. https://doi.org/10.1109/TC.1987.1676938.
Preparata F P, Metze G, Chien R T. On the connection assignment problem of diagnosable systems. IEEE Trans. Electronic Computers, 1967, EC-16(6): 848–854. https://doi.org/10.1109/PGEC.1967.264748.
Chang N W, Hsieh S Y. Structural properties and conditional diagnosability of star graphs by using the PMC model. IEEE Trans. Parallel and Distributed Systems, 2014, 25(11): 3002–3011. https://doi.org/10.1109/TPDS.2013.290.
Li X Y, Fan J X, Lin C K, Cheng B L, Jia X H. The extra connectivity, extra conditional diagnosability and t/k-diagnosability of the data center network DCell. Theoretical Computer Science, 2019, 766: 16–29. https://doi.org/10.1016/j.tcs.2018.09.014.
Li X Y, Fan J X, Lin C K, Jia X H. Diagnosability evaluation of the data center network DCell. The Computer Journal, 2018, 61(1): 129–143. https://doi.org/10.1093/comjnl/bxx057.
Lin L M, Huang Y Z, Wang X D, Xu L. Restricted connectivity and good-neighbor diagnosability of split-star networks. Theoretical Computer Science, 2020, 824-825: 81–91. https://doi.org/10.1016/j.tcs.2020.04.015.
Lv M J, Fan J X, Zhou J Y, Cheng B L, Jia X H. The extra connectivity and extra diagnosability of regular interconnection networks. Theoretical Computer Science, 2020, 809: 88–102. https://doi.org/10.1016/j.tcs.2019.12.001.
Wang S Y, Wang Z H, Wang M J S, Han W P. g-Good-neighbor conditional diagnosability of star graph networks under PMC model and MM* model. Frontiers of Mathematics in China, 2017, 12(5): 1221–1234. https://doi.org/10.1007/s11464-017-0657-9.
Zhu Q, Zhang J, Li L L. The h-extra connectivity and h-extra conditional diagnosability of bubble-sort star graphs. Discrete Applied Mathematics, 2018, 251: 322–333. https://doi.org/10.1016/j.dam.2018.03.077.
Xu X, Zhou S M, Xu L. Diagnosabilities of regular networks under three-valued comparison models. International Journal of High Performance Computing and Networking, 2017, 10(4/5): 251–258. https://doi.org/10.1504/IJHPCN.2017.086529.
Liang J R, Feng H, Du X J. Intermittent fault diagnosability of interconnection networks. Journal of Computer Science and Technology, 2017, 32(6): 1279–1287. https://doi.org/10.1007/s11390-017-1800-5.
Sun X L, Zhou S M, Lv M J, Liu J F, Lian G Q. Intermittent fault diagnosability of some general regular networks. The Computer Journal, 2020, 63(1): 16–24. https://doi.org/10.1093/comjnl/bxy128.
Blough D M, Sullivan G F, Masson G M. Efficient diagnosis of multiprocessor systems under probabilistic models. IEEE Trans. Computers, 1992, 41(9): 1126–1136. https://doi.org/10.1109/12.165394.
Fussell D, Rangarajan S. Probabilistic diagnosis of multiprocessor systems with arbitrary connectivity. In Proc. the 19th International Symposium on Fault-Tolerant Computing. Digest of Papers, Jun. 1989, pp.560–565. https://doi.org/10.1109/FTCS.1989.105636.
Tang Q Y, Song X Y. Diagnosis of parallel computers arbitrary connectivity. IEEE Trans. Computers, 1999, 48(7): 757–761. https://doi.org/10.1109/12.780885.
Rangarajan S, Fussell D. Diagnosing arbitrarily connected parallel computers with high probability. IEEE Trans. Computers, 1992, 41(5): 606–615. https://doi.org/10.1109/12.142687.
Huang K Y, Agarwal V K, Thulasiraman K. Diagnosis of clustered faults and wafer testing. IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, 1998, 17(2): 136–148. https://doi.org/10.1109/43.681263.
Tang Q Y, Song X Y, Wang Y K. Diagnosis of clustered faults for identical degree topologies. IEEE Trans. Computer-Aided Design of Integrated Circuits and Systems, 1999, 18(8): 1192–1201. https://doi.org/10.1109/43.775637.
Lu X J, Li J P, Seo C J. Probabilistic diagnosis of clustered faults for shared structures. Mathematical and Computer Modelling, 2009, 49(3/4): 623–634. https://doi.org/10.1016/j.mcm.2008.06.011.
Lv M J, Zhou S M, Sun X L, Lian G Q, Liu J F, Wang D J. Probabilistic diagnosis of clustered faults for hypercube-based multiprocessor system. Theoretical Computer Science, 2019, 793: 113–131. https://doi.org/10.1016/j.tcs.2019.06.023.
Akers S B, Krishnamurthy B. A group-theoretic model for symmetric interconnection networks. IEEE Trans. Computers, 1989, 38(4): 555–566. https://doi.org/10.1109/12.21148.
Compeau P E C. Girth of pancake graphs. Discrete Applied Mathematics, 2011, 159(15): 1641–1645. https://doi.org/10.1016/j.dam.2011.06.013.
Song S L, Zhou S M, Li X Y. Conditional diagnosability of burnt pancake networks under the PMC model. The Computer Journal, 2016, 59(1): 91–105. https://doi.org/10.1093/comjnl/bxv066.
Author information
Authors and Affiliations
Corresponding author
Supplementary Information
ESM 1
(PDF 731 kb)
Rights and permissions
About this article
Cite this article
Sun, XL., Fan, JX., Cheng, BL. et al. Probabilistic Fault Diagnosis of Clustered Faults for Multiprocessor Systems. J. Comput. Sci. Technol. 38, 821–833 (2023). https://doi.org/10.1007/s11390-021-1099-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-021-1099-0