Abstract
Cancer is the leading cause of death before the age of 70 years. An important step for reducing the cancer mortality can be its early detection. To improve the early diagnosis of cancer, we propose a novel probability calibration method based on the fuzzy set theory. Our approach was tested on the detection of female breast cancer and lung cancer. These are complicated by a small data set for the first case and by highly imbalanced data for the second case. In both cases, our probability calibration method improved the Log Loss metric (the best result was improved by 48.86%), the Brier score (the best result was improved by 13.24%), and the Precision-Recall metric (the best result was improved by 13.94%). The application field of our algorithm can be extended to any progressive diseases and events without a clearly defined boundary.
Similar content being viewed by others
REFERENCES
H. Sung, J. Ferlay, R. L. Siegel, M. Laversanne, et al., “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” Cancer J. Clin. 71 (3), 209–249 (2021). https://doi.org/10.3322/caac.21660
E. W. Steyerberg, Clinical Prediction Models: A Practical Approach to Development, Validation and Updating (Springer, New York, 2009).
B. Böken, “On the appropriateness of Platt scaling in classifier calibration,” Inf. Syst. 95, 101641 (2021). https://doi.org/10.1016/j.is.2020.101641
N. Chakravarti, “Isotonic median regression: A linear programming approach,” Math. Oper. Res. 14 (2), 303–308 (1989). http://www.jstor.org/stable/3689709
J. Platt, “Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods,” in Advances in Large Margin Classifiers (MIT Press, Cambridge, Mass., 2000).
B. Zadrozny and C. Elkan, “Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers,” in Proceedings of the 18th International Conference on Machine Learning (2001), pp. 609–616.
M. P. Naeini, G. F. Cooper, and M. Hauskrecht, “Obtaining well calibrated probabilities using Bayesian binning,” Proc. AAAI Conf. Artif. Intell. 2015, 2901–2907 (2015). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410090/pdf/nihms679964.pdf
H.-J. Zimmermann, Fuzzy Set Theory and Its Applications (Springer, Dordrecht, 2001).
A. Torres and J. J. Nieto, “Fuzzy logic in medicine and bioinformatics,” J. Biomed. Biotechnol. 2006, 091908 (2006). https://doi.org/10.1155/jbb/2006/91908
A. Hassanien, “Fuzzy rough sets hybrid scheme for breast cancer detection,” Image Vision Comput. 25 (2), 172–183 (2007). https://doi.org/10.1016/j.imavis.2006.01.026
S. K. Ghosh, A. Mitra, and A. Ghosh, “A novel intuitionistic fuzzy soft set entrenched mammogram segmentation under Multigranulation approximation for breast cancer detection in early stages,” Expert Syst. Appl. 169, 114329 (2021). https://doi.org/10.1016/j.eswa.2020.114329
S. K. Ghosh, A. Ghosh, and S. Bhattacharyya, “Recognition of cancer mediating biomarkers using rough approximations enabled intuitionistic fuzzy soft sets based similarity measure,” Appl. Soft Comput. 124, 109052 (2022). https://doi.org/10.1016/j.asoc.2022.109052
N. Wang, W. Yao, Y. Zhao, and X. Chen, “Bayesian calibration of computer models based on Takagi–Sugeno fuzzy models,” Comput. Methods Appl. Mech. Eng. 378, 113724 (2021). https://doi.org/10.1016/j.cma.2021.113724
A. V. Pechinkin, O. I. Teskin, G. M. Tsvetkova, et al., Probability Theory, Ed. by V. S. Zarubin and A. P. Krishchenko, 3rd ed. revised (Mosk. Gos. Tekh. Univ. im. N.E. Baumana, Moscow, 2004) [in Russian].
K. Sadegh-Zadeh, “The logic of diagnosis,” in Philosophy of Medicine (North Holland, Amsterdam, 2011), pp. 357–424. https://doi.org/10.1016/b978-0-444-51787-6.50012-x
M. Castaneda, P. den Hollander, N. A. Kuburich, et al., “Mechanisms of cancer metastasis,” Semin. Cancer Biol. 87, 17–31 (2022). https://doi.org/10.1016/j.semcancer.2022.10.006
G. Beliakov, “Fuzzy sets and membership functions based on probabilities,” Inf. Sci. 91 (1–2), 95–111 (1996). https://doi.org/10.1016/0020-0255(95)00291-x
G. Chen and T. T. Pham, Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems (CRC, Boca Raton, 2000).
G. Aresta, T. Araújo, S. Kwok, et al., “BACH: Grand challenge on breast cancer histology images,” Med. Image Anal. 56, 122–139 (2019). https://doi.org/10.1016/j.media.2019.05.010
S. G. Armato III, G. McLennan, L. Bidaut, et al., “Data from LIDC-IDRI (Version 4) [dataset],” The Cancer Imaging Archive (2015). https://doi.org/10.7937/K9/TCIA.2015.LO9QL9SX
National Lung Screening Trial Research Team, “Data from the National Lung Screening Trial (NLST) (Version 3) [dataset],” The Cancer Imaging Archive (2013). https://doi.org/10.7937/TCIA.HMQ8-J677
P. F. Pinsky, “Lung cancer screening with low-dose CT: A world-wide view,” Transl. Lung Cancer Res. 7 (3), 234–242 (2018). https://doi.org/10.21037/tlcr.2018.05.12
C. Szegedy, V. Vanhoucke, et al. “Rethinking the inception architecture for computer vision,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016). https://doi.org/10.1109/cvpr.2016.308
J. Davis and M. Goadrich, “The relationship between precision-recall and ROC curves,” in Proceedings of the 23rd International Conference on Machine Learning (ACM, 2006). https://doi.org/10.1145/1143844.1143874
A. Küppers, J. Schneider, and A. Haselhoff, “Parametric and multivariate uncertainty calibration for regression and object detection,” Computer Vision—ECCV 2022 Workshops (Springer, Cham, 2023), pp 426–442. https://doi.org/10.1007/978-3-031-25072-9_30
Funding
This work was supported by the ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
The authors of this work declare that they have no conflicts of interest.
Additional information
Publisher’s Note.
Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Filimonova, O.A., Ovsyannikov, A.G. & Biryukova, N.V. Probability Calibration with Fuzzy Set Theory to Improve Early Cancer Detection. Dokl. Math. 108 (Suppl 2), S179–S185 (2023). https://doi.org/10.1134/S106456242370103X
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1134/S106456242370103X