Abstract
In this paper, we develop two SVM-based classifiers named stable nested one-class support vector machines (SN-1SVMs) and decoupled margin-moment based SVMs (DMMB-SVMs), to predict the specific type of pancreatic carcinoma using quantitative histopathological signatures of images. For each patient, the diagnosis can produce hundreds of images, which can be used to classify the pancreatic tissues into three classes: chronic pancreatitis, intraductal papillary mucinous neoplasms, and pancreatic carcinoma. The proposed two approaches tackle the classification problems from two different perspectives: the SN-1SVM treats each image as a classification point in a nested fashion to predict malignancy of the tissues, while the DMMB-SVM treats each patient as a classification point by assembling information across images. One attractive feature of the DMMB-SVM is that, in addition to utilizing the mean information, it also takes into account the covariance of features extracted from images for each patient. We conduct numerical experiments to evaluate and compare performance of the two methods. It is observed that the SN-1SVM can take advantage of the data structure more effectively, while the DMMB-SVM demonstrates better computational efficiency and classification accuracy. To further improve interpretability of the final classifier, we also consider the \(\ell _1\)-norm in the DMMB-SVM to handle feature selection.
Similar content being viewed by others
Data availability
The datasets generated during and/or analyzed during the current study are available on reasonable request.
References
Anderson, D., Bjarnadöttir, M.: When is an ounce of prevention worth a pound of cure? Identifying high-risk candidates for case management. IIE Transactions on Healthcare Systems Engineering 6(1), 22–32 (2016)
Ben-Tal, A., Bhadra, S., Bhattacharyya, C., Nath, J.S.: Chance constrained uncertain classification via robust optimization. Math. Program. 127(1), 145–173 (2011)
Bersch, V.P., da Silva, V.D., Osvaldt, A.B., da Costa, M.S., Rohde, L., Mossmann, D.: Digital karyometry in pancreatic adenocarcinoma. Anal. Quant. Cytol. Histol. 25(2), 108–114 (2003)
Bhattacharyya, C., Grate, L.R., Jordan, M.I., El Ghaoui, L., Mian, I.S.: Robust sparse hyperplane classifiers: application to uncertain molecular profiling data. J. Comput. Biol. 11(6), 1073–1089 (2004)
Bi, J., Bennett, K., Embrechts, M., Breneman, C., Song, M.: Dimensionality reduction via sparse support vector machines. J. Mach. Learn. Res. 3(3), 1229–1243 (2003)
Chang, C.-C., Lin, C.-J.: Training \(\nu \)-support vector classifiers: theory and algorithms. Neural. Comput. 13, 2119–2147 (2001)
Glazer, E.S., Bartels, P.H., Prasad, A.R., Yozwiak, M.L., Bartels, H.G., Einspahr, J.G., Alberts, D.S., Krouse, R.S.: Nuclear morphometry identifies a distinct aggressive cellular phenotype in cutaneous squamous cell carcinoma. Cancer Prevention Research 4(11), 1770–1777 (2011)
Glazer, E.S., Zhang, H.H., Hill, K.A., Patel, C., Kha, S.T., Yozwiak, M.L., Bartels, H., Nafissi, N.N., Watkins, J.C., Alberts, D.S., Krouse, R.S.: Evaluating ipmn and pancreatic carcinoma utilizing quantitative histopathology. Cancer Medicine 5(10), 2841–2847 (2016)
Hosseini, R., Chan, H., Kapur, P., Cadeddu, J., Liu, H., Wang, S.: Discriminative spectral pattern analysis for positive margin detection of prostate cancer specimens using light reflectance spectroscopy. IISE Transactions on Healthcare Systems Engineering 8(2), 144–154 (2018)
Krouse, R.S., Alberts, D.S., Prasad, A.R., Bartels, H., Yozwiak, M., Liu, Y., Bartels, P.H.: Progression of skin lesions from normal skin to squamous cell carcinoma. Analytical and quantitative cytology and histology/the International Academy of Cytology and American Society of Cytology 31(1), 17–25 (2009)
Lofberg, J.: Yalmip: A toolbox for modeling and optimization in matlab. In Computer Aided Control Systems Design, 2004 IEEE International Symposium on, pp. 284–289. (2005). IEEE
Platt, J.: Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in Large Margin Classifiers 10(3), 61–74 (1999)
Schölkopf, B., Smola, A.J., Williamson, R.C., Bartlett, P.L.: New support vector algorithms. Neural Comput. 12(5), 1207–1245 (2000)
Schölkopf, B., Platt, J.C., Shawe-Taylor, J., Smola, A.J., Williamson, R.C.: Estimating the support of a high-dimensional distribution. Neural Comput. 13(7), 1443–1471 (2001)
Shivaswamy, P.K., Bhattacharyya, C., Smola, A.J.: Second order cone programming approaches for handling missing and uncertain data. J. Mach. Learn. Res. 7(7), 1283–1314 (2006)
Sturm, J.F.: Using sedumi 1.02, a matlab toolbox for optimization over symmetric cones. Optimization Methods and Software 11(1-4), 625–653 (1999)
Tibshirani, R.: Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society. Series B (Methodological) 267–288 (1996)
Tucker, C., Han, Y., Nembhard, H.B., Lee, W., Lewis, M., Sterling, N., Huang, X.: A data mining methodology for predicting early stage Parkinson’s disease using non-invasive, high-dimensional gait sensor data. IIE Transactions on Healthcare Systems Engineering 5(4), 238–254 (2015)
Wang, X., Fan, N., Pardalos, P.M.: Robust chance-constrained support vector machines with second-order moment information. Ann. Oper. Res. 263(1–2), 45–68 (2015)
Washburn, A.: High-confidence learning from uncertain data with high dimensionality. PhD Dissertation, University of Arizona (2018)
Wu, T., Lin, C., Weng, R.: Probability estimates for multi-class classification by pairwise coupling. J. Mach. Learn. Res. 5, 975–1005 (2004)
Zhong, P., Fukushima, M.: A new multi-class support vector algorithm. Optimisation Methods and Software 21(3), 359–372 (2006)
Acknowledgements
This material is based upon work supported by NSF Grants CMMI #1634282 and CCF #1740858. An allocation of computer time from the UA Research Computing High Performance Computing (HPC) and High Throughput Computing (HTC) at the University of Arizona is gratefully acknowledged. The majority of the materials contained in this paper were modified from the first author’s dissertation [20].
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
Conflict of Interest for all authors - None
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Washburn, A., Fan, N. & Zhang, H.H. Novel SVM-based classification approaches for evaluating pancreatic carcinoma. Ann Math Artif Intell (2023). https://doi.org/10.1007/s10472-023-09888-5
Accepted:
Published:
DOI: https://doi.org/10.1007/s10472-023-09888-5