Skip to main content
Log in

Probability Calibration with Fuzzy Set Theory to Improve Early Cancer Detection

  • Published:
Doklady Mathematics Aims and scope Submit manuscript

Abstract

Cancer is the leading cause of death before the age of 70 years. An important step for reducing the cancer mortality can be its early detection. To improve the early diagnosis of cancer, we propose a novel probability calibration method based on the fuzzy set theory. Our approach was tested on the detection of female breast cancer and lung cancer. These are complicated by a small data set for the first case and by highly imbalanced data for the second case. In both cases, our probability calibration method improved the Log Loss metric (the best result was improved by 48.86%), the Brier score (the best result was improved by 13.24%), and the Precision-Recall metric (the best result was improved by 13.94%). The application field of our algorithm can be extended to any progressive diseases and events without a clearly defined boundary.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.
Fig. 3.
Fig. 4.

Similar content being viewed by others

REFERENCES

  1. H. Sung, J. Ferlay, R. L. Siegel, M. Laversanne, et al., “Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries,” Cancer J. Clin. 71 (3), 209–249 (2021). https://doi.org/10.3322/caac.21660

    Article  Google Scholar 

  2. E. W. Steyerberg, Clinical Prediction Models: A Practical Approach to Development, Validation and Updating (Springer, New York, 2009).

    Book  Google Scholar 

  3. B. Böken, “On the appropriateness of Platt scaling in classifier calibration,” Inf. Syst. 95, 101641 (2021). https://doi.org/10.1016/j.is.2020.101641

  4. N. Chakravarti, “Isotonic median regression: A linear programming approach,” Math. Oper. Res. 14 (2), 303–308 (1989). http://www.jstor.org/stable/3689709

    Article  MathSciNet  Google Scholar 

  5. J. Platt, “Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods,” in Advances in Large Margin Classifiers (MIT Press, Cambridge, Mass., 2000).

    Google Scholar 

  6. B. Zadrozny and C. Elkan, “Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers,” in Proceedings of the 18th International Conference on Machine Learning (2001), pp. 609–616.

  7. M. P. Naeini, G. F. Cooper, and M. Hauskrecht, “Obtaining well calibrated probabilities using Bayesian binning,” Proc. AAAI Conf. Artif. Intell. 2015, 2901–2907 (2015). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4410090/pdf/nihms679964.pdf

  8. H.-J. Zimmermann, Fuzzy Set Theory and Its Applications (Springer, Dordrecht, 2001).

    Book  Google Scholar 

  9. A. Torres and J. J. Nieto, “Fuzzy logic in medicine and bioinformatics,” J. Biomed. Biotechnol. 2006, 091908 (2006). https://doi.org/10.1155/jbb/2006/91908

  10. A. Hassanien, “Fuzzy rough sets hybrid scheme for breast cancer detection,” Image Vision Comput. 25 (2), 172–183 (2007). https://doi.org/10.1016/j.imavis.2006.01.026

    Article  Google Scholar 

  11. S. K. Ghosh, A. Mitra, and A. Ghosh, “A novel intuitionistic fuzzy soft set entrenched mammogram segmentation under Multigranulation approximation for breast cancer detection in early stages,” Expert Syst. Appl. 169, 114329 (2021). https://doi.org/10.1016/j.eswa.2020.114329

  12. S. K. Ghosh, A. Ghosh, and S. Bhattacharyya, “Recognition of cancer mediating biomarkers using rough approximations enabled intuitionistic fuzzy soft sets based similarity measure,” Appl. Soft Comput. 124, 109052 (2022). https://doi.org/10.1016/j.asoc.2022.109052

  13. N. Wang, W. Yao, Y. Zhao, and X. Chen, “Bayesian calibration of computer models based on Takagi–Sugeno fuzzy models,” Comput. Methods Appl. Mech. Eng. 378, 113724 (2021). https://doi.org/10.1016/j.cma.2021.113724

  14. A. V. Pechinkin, O. I. Teskin, G. M. Tsvetkova, et al., Probability Theory, Ed. by V. S. Zarubin and A. P. Krishchenko, 3rd ed. revised (Mosk. Gos. Tekh. Univ. im. N.E. Baumana, Moscow, 2004) [in Russian].

  15. K. Sadegh-Zadeh, “The logic of diagnosis,” in Philosophy of Medicine (North Holland, Amsterdam, 2011), pp. 357–424. https://doi.org/10.1016/b978-0-444-51787-6.50012-x

    Book  Google Scholar 

  16. M. Castaneda, P. den Hollander, N. A. Kuburich, et al., “Mechanisms of cancer metastasis,” Semin. Cancer Biol. 87, 17–31 (2022). https://doi.org/10.1016/j.semcancer.2022.10.006

    Article  Google Scholar 

  17. G. Beliakov, “Fuzzy sets and membership functions based on probabilities,” Inf. Sci. 91 (1–2), 95–111 (1996). https://doi.org/10.1016/0020-0255(95)00291-x

    Article  MathSciNet  Google Scholar 

  18. G. Chen and T. T. Pham, Introduction to Fuzzy Sets, Fuzzy Logic, and Fuzzy Control Systems (CRC, Boca Raton, 2000).

    Book  Google Scholar 

  19. G. Aresta, T. Araújo, S. Kwok, et al., “BACH: Grand challenge on breast cancer histology images,” Med. Image Anal. 56, 122–139 (2019). https://doi.org/10.1016/j.media.2019.05.010

    Article  Google Scholar 

  20. S. G. Armato III, G. McLennan, L. Bidaut, et al., “Data from LIDC-IDRI (Version 4) [dataset],” The Cancer Imaging Archive (2015). https://doi.org/10.7937/K9/TCIA.2015.LO9QL9SX

  21. National Lung Screening Trial Research Team, “Data from the National Lung Screening Trial (NLST) (Version 3) [dataset],” The Cancer Imaging Archive (2013). https://doi.org/10.7937/TCIA.HMQ8-J677

  22. P. F. Pinsky, “Lung cancer screening with low-dose CT: A world-wide view,” Transl. Lung Cancer Res. 7 (3), 234–242 (2018). https://doi.org/10.21037/tlcr.2018.05.12

    Article  Google Scholar 

  23. C. Szegedy, V. Vanhoucke, et al. “Rethinking the inception architecture for computer vision,” in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2016). https://doi.org/10.1109/cvpr.2016.308

  24. J. Davis and M. Goadrich, “The relationship between precision-recall and ROC curves,” in Proceedings of the 23rd International Conference on Machine Learning (ACM, 2006). https://doi.org/10.1145/1143844.1143874

  25. A. Küppers, J. Schneider, and A. Haselhoff, “Parametric and multivariate uncertainty calibration for regression and object detection,” Computer Vision—ECCV 2022 Workshops (Springer, Cham, 2023), pp 426–442. https://doi.org/10.1007/978-3-031-25072-9_30

    Book  Google Scholar 

Download references

Funding

This work was supported by the ongoing institutional funding. No additional grants to carry out or direct this particular research were obtained.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to O. A. Filimonova.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Filimonova, O.A., Ovsyannikov, A.G. & Biryukova, N.V. Probability Calibration with Fuzzy Set Theory to Improve Early Cancer Detection. Dokl. Math. 108 (Suppl 2), S179–S185 (2023). https://doi.org/10.1134/S106456242370103X

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S106456242370103X

Keywords:

Navigation