Abstract
Data privacy has become one of the most important concerns in the big data era. Because of its broad applications in machine learning and data analysis, many algorithms and theoretical results have been established for privacy clustering problems, such as k-means and k-median problems with privacy protection. However, there is little work on privacy protection in k-center clustering. Our research focuses on the k-center problem, its distributed variant, and the distributed k-center problem under differential privacy constraints. These problems model the concept of safeguarding the privacy of individual input elements, with the integration of differential privacy aimed at ensuring the security of individual information during data processing and analysis. We propose three approximation algorithms for these problems, respectively, and achieve a constant factor approximation ratio.
Similar content being viewed by others
Data Availability
We do not analyse or generate any datasets, because our work proceeds within a theoretical and mathematical approach.
References
Balcan, M., Dick, T., Liang, Y., Mou, W., Zhang, H.: Differentially private clustering in high-dimensional euclidean spaces. In: Proceedings of ICML, pp. 322-331 (2017)
Charikar, M., Khuller, S., Mount, D.M., Narasimhan, G.: Algorithms for facility location problems with outliers. In: Proceedings of SODA, pp. 642-651 (2001)
Dwork, C.: Differential privacy. In: Proceedings of the ICALP, Part II, 1–12 (2006)
Dwork, C., Roth, A.: The algorithmic foundations of differential privacy. Found. Trends Theor. Comput. Sci. 9(3–4), 211–407 (2014)
Dyer, M., Frieze, A.: A simple heuristic for the \(p\)-center problem. Oper. Res. Lett. 3, 285–288 (1985)
Ene, A., Im, S., Moseley, B.: Fast clustering using MapReduce. In: Proceedings of KDD, pp 681-689 (2011)
Feldman, D., Fiat, A., Kaplan, H., Nissim, K.: Private coresets. In: Proceedings of STOC, pp. 361-370 (2009)
Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theor. Comput. Sci. 38, 293–306 (1985)
Guha, S., Li, Y., Zhang, Q.: Distributed partial clustering. In: Proceedings of SPAA, pp. 143-152 (2017)
Gupta, A., Ligett, K., McSherry, F., Roth, A.: Talwar K. Differentially private combinatorial optimization. In: Proceedings of SODA, pp. 1106-1125 (2010)
Hochbaum, D.S., Shmoys, D.B.: A best possible heuristic for the \(k\)-center problem. Math. Oper. Res. 10(2), 180–184 (1985)
Li, S., Guo, X.: Distributed \(k\)-clustering for data with heavy noise. In: Proceedings of NIPS, pp. 7838-7846 (2018)
McSherry, F.: Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In: Proceedings of SIGMOD, pp. 19-30 (2009)
Malkomes, G., Kusner, M.J., Chen, W., Weinberger, K.Q., Moseley, B.: Fast distributed \(k\)-center clustering with outliers on massive data. In: Proceedings of NIPS, pp. 1063-1071 (2015)
Mcsherry, F., Talwar, K.: Mechanism design via differential privacy. In: Proceedings of FOCS, pp. 94-103 (2007)
Stemmer, U.: Locally private \(k\)-means clustering. In: Proceedings of SODA, pp. 548-559 (2020)
Stemmer, U., Kaplan, H.: Differentially private \(k\)-means with constant multiplicative error. In: Proceedings of NIPS, pp. 5436-5446 (2018)
Wang, Y., Wang, Y.X., Singh, A.: Differentially private subspace clustering. In: Proceedings of NIPS, pp. 1000-1008 (2015)
Acknowledgements
The first two authors are supported by National Natural Science Foundation of China (No. 12131003). The third author is supported by the Natural Sciences and Engineering Research Council of Canada (NSERC) grant 06446, and Natural Science Foundation of China (Nos. 11771386, 11728104). The fourth author is supported by Natural Science Foundation of Shandong Province of China (No. ZR2020MA029).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
A preliminary version (two-page extended abstract) of this paper appeared in Proceedings of the 9th International Conference on Computational Data and Social Networks (CSoNet), 2020, pp. xviii-xx.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Yuan, F., Xu, D., Du, D. et al. Differentially private k-center problems. Optim Lett (2024). https://doi.org/10.1007/s11590-023-02090-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11590-023-02090-w