skip to main content
research-article

Fast and Perfect Sampling of Subgraphs and Polymer Systems

Published:22 January 2024Publication History
Skip Abstract Section

Abstract

We give an efficient perfect sampling algorithm for weighted, connected induced subgraphs (or graphlets) of rooted, bounded degree graphs. Our algorithm utilizes a vertex-percolation process with a carefully chosen rejection filter and works under a percolation subcriticality condition. We show that this condition is optimal in the sense that the task of (approximately) sampling weighted rooted graphlets becomes impossible in finite expected time for infinite graphs and intractable for finite graphs when the condition does not hold. We apply our sampling algorithm as a subroutine to give near linear-time perfect sampling algorithms for polymer models and weighted non-rooted graphlets in finite graphs, two widely studied yet very different problems. This new perfect sampling algorithm for polymer models gives improved sampling algorithms for spin systems at low temperatures on expander graphs and unbalanced bipartite graphs, among other applications.

REFERENCES

  1. [1] Agostini Matteo, Bressan Marco, and Haddadan Shahrzad. 2019. Mixing time bounds for graphlet random walks. Inform. Process. Lett. 152 (2019), 105851.Google ScholarGoogle ScholarCross RefCross Ref
  2. [2] Konrad Anand, Andreas Göbel, Marcus Pappik, and Will Perkins. 2023. Perfect sampling for hard spheres from strong spatial mixing. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM’23), Vol. 275. Schloss Dagstuhl – Leibniz-Zentrum für Informatik, Dagstuhl, 38:1–38:18.Google ScholarGoogle Scholar
  3. [3] Konrad Anand and Mark Jerrum. 2022. Perfect sampling in infinite spin systems via strong spatial mixing. SIAM J. Comput. 51, 4 (2022), 1280–1295.Google ScholarGoogle Scholar
  4. [4] Anari Nima, Liu Kuikui, and Gharan Shayan Oveis. 2021. Spectral independence in high-dimensional expanders and applications to the hardcore model. SIAM J. Comput.0 (2021), FOCS20–1.Google ScholarGoogle ScholarCross RefCross Ref
  5. [5] Baskerville Kim, Grassberger Peter, and Paczuski Maya. 2007. Graph animals, subgraph sampling, and motif search in large networks. Physical Review E 76, 3 (2007), 036107.Google ScholarGoogle ScholarCross RefCross Ref
  6. [6] Bhuiyan Mansurul A., Rahman Mahmudur, Rahman Mahmuda, and Hasan Mohammad Al. 2012. Guise: Uniform sampling of graphlets for large graph analysis. In Proceedings of the 2012 IEEE 12th International Conference on Data Mining. IEEE, 91100.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Borgs Christian, Chayes Jennifer, Helmuth Tyler, Perkins Will, and Tetali Prasad. 2020. Efficient sampling and counting algorithms for the Potts model on \(\mathbb {Z}^d\) at all temperatures. In Proceedings of the 52nd Annual ACM SIGACT Symposium on Theory of Computing (STOC’20). 738751.Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. [8] Borgs Christian, Chayes Jennifer, Kahn Jeff, and Lovász László. 2013. Left and right convergence of graphs with bounded degree. Random Structures & Algorithms 42, 1 (2013), 128.Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. [9] Borgs Christian and Imbrie John Z.. 1989. A unified approach to phase diagrams in field theory and statistical mechanics. Communications in Mathematical Physics 123, 2 (1989), 305328.Google ScholarGoogle ScholarCross RefCross Ref
  10. [10] Bressan Marco. 2021. Efficient and near-optimal algorithms for sampling connected subgraphs. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC’21). 11321143.Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. [11] Bressan Marco, Chierichetti Flavio, Kumar Ravi, Leucci Stefano, and Panconesi Alessandro. 2017. Counting graphlets: Space vs time. In Proceedings of the 10th ACM International Conference on Web Search and Data Mining (WSDM’17). 557566.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. [12] Bressan Marco, Chierichetti Flavio, Kumar Ravi, Leucci Stefano, and Panconesi Alessandro. 2018. Motif counting beyond five nodes. ACM Transactions on Knowledge Discovery from Data (TKDD) 12, 4 (2018), 125.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Bressan Marco, Leucci Stefano, and Panconesi Alessandro. 2021. Faster motif counting via succinct color coding and adaptive sampling. ACM Transactions on Knowledge Discovery from Data (TKDD) 15, 6 (2021), 127.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. [14] Cannon Sarah and Perkins Will. 2020. Counting independent sets in unbalanced bipartite graphs. In Proceedings of the Thirty-First Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’20). 14561466.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Carlson Charles, Davies Ewan, and Kolla Alexandra. 2020. Efficient algorithms for the Potts model on small-set expanders. arXiv:2003.01154. Retrieved from https://arxiv.org/abs/2003.01154Google ScholarGoogle Scholar
  16. [16] Chen Xiaowei, Li Yongkun, Wang Pinghui, and Lui John C. S.. 2016. A general framework for estimating graphlet statistics via random walk. Proceedings of the VLDB Endowment 10, 3 (2016), 253–264.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Chen Zongchen, Galanis Andreas, Goldberg Leslie A, Perkins Will, Stewart James, and Vigoda Eric. 2021. Fast algorithms at low temperatures via Markov chains. Random Structures & Algorithms 58, 2 (2021), 294321.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Chen Zongchen, Galanis Andreas, Štefankovič Daniel, and Vigoda Eric. 2022. Sampling colorings and independent sets of random regular bipartite graphs in the non-uniqueness region. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’22). SIAM, 21982207.Google ScholarGoogle ScholarCross RefCross Ref
  19. [19] Chen Zongchen, Liu Kuikui, and Vigoda Eric. 2020. Rapid mixing of Glauber dynamics up to uniqueness via contraction. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS’20). IEEE, 13071318.Google ScholarGoogle Scholar
  20. [20] Chen Zongchen, Liu Kuikui, and Vigoda Eric. 2021. Optimal mixing of Glauber dynamics: Entropy factorization via high-dimensional expansion. In Proceedings of the 53rd Annual ACM SIGACT Symposium on Theory of Computing (STOC’21). 15371550.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Coulson Matthew, Davies Ewan, Kolla Alexandra, Patel Viresh, and Regts Guus. 2020. Statistical physics approaches to unique games. In Proceedings of the 35th Computational Complexity Conference (CCC’20).Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. [22] Dyer Martin, Goldberg Leslie Ann, Greenhill Catherine, and Jerrum Mark. 2004. The relative complexity of approximate counting problems. Algorithmica 38, 3 (2004), 471500.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Friedli Sacha and Velenik Yvan. 2017. Statistical Mechanics of Lattice Systems: A Concrete Mathematical Introduction. Cambridge University Press.Google ScholarGoogle ScholarCross RefCross Ref
  24. [24] Friedrich Tobias, Göbel Andreas, Krejca Martin S, and Pappik Marcus. 2020. Polymer dynamics via cliques: New conditions for approximations. arXiv:2007.08293. Retrieved from https://arxiv.org/abs/2007.08293Google ScholarGoogle Scholar
  25. [25] Galanis Andreas, Goldberg Leslie Ann, and Stewart James. 2021. Fast algorithms for general spin systems on bipartite expanders. ACM Transactions on Computation Theory (TOCT) 13, 4 (2021), 118.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. [26] Galanis Andreas, Goldberg Leslie Ann, and Stewart James. 2022. Fast mixing via polymers for random graphs with unbounded degree. Information and Computation 285, Part B (2022), 104894.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. [27] Galanis Andreas, Štefankovič Daniel, and Vigoda Eric. 2016. Inapproximability of the partition function for the antiferromagnetic Ising and hard-core models. Combinatorics, Probability and Computing 25, 4 (2016), 500559.Google ScholarGoogle ScholarCross RefCross Ref
  28. [28] Goldreich Oded and Ron Dana. 1997. Property testing in bounded degree graphs. In Proceedings of the 29th Annual ACM Symposium on Theory of Computing (STOC’97). 406415.Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. [29] Grochow Joshua A. and Kellis Manolis. 2007. Network motif discovery using subgraph enumeration and symmetry-breaking. In Annual International Conference on Research in Computational Molecular Biology. Springer, 92106.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Gruber Christian and Kunz Hervé. 1971. General properties of polymer systems. Communications in Mathematical Physics 22, 2 (1971), 133161.Google ScholarGoogle ScholarCross RefCross Ref
  31. [31] Haggstrom Olle and Nelander Karin. 1999. On exact simulation of Markov random fields using coupling from the past. Scandinavian Journal of Statistics 26, 3 (1999), 395411.Google ScholarGoogle ScholarCross RefCross Ref
  32. [32] He Kun, Wang Chunyang, and Yin Yitong. 2022. Sampling Lovász local lemma for general constraint satisfaction solutions in near-linear time. In Proceedings of the 2022 IEEE 63rd Annual Symposium on Foundations of Computer Science (FOCS’22). IEEE, 147158.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] He Kun, Wu Kewen, and Yang Kuan. 2023. Improved bounds for sampling solutions of random CNF formulas. In Proceedings of the 2023 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’23). SIAM, 33303361.Google ScholarGoogle ScholarCross RefCross Ref
  34. [34] Tyler Helmuth, Matthew Jenssen, and Will Perkins. 2023. Finite-size scaling, phase coexistence, and algorithms for the random cluster model on random graphs. In Annales de l’Institut Henri Poincare (B) Probabilites et statistiques, Vol. 59. Institut Henri Poincaré, 817–848.Google ScholarGoogle Scholar
  35. [35] Helmuth Tyler, Perkins Will, and Regts Guus. 2020. Algorithmic Pirogov–Sinai theory. Probability Theory and Related Fields 176, 3 (2020), 851895.Google ScholarGoogle ScholarCross RefCross Ref
  36. [36] Huber Mark. 2004. Perfect sampling using bounding chains. The Annals of Applied Probability 14, 2 (2004), 734753.Google ScholarGoogle ScholarCross RefCross Ref
  37. [37] Janson Svante, Rucinski Andrzej, and Luczak Tomasz. 2011. Random Graphs. John Wiley & Sons.Google ScholarGoogle Scholar
  38. [38] Jenssen Matthew, Keevash Peter, and Perkins Will. 2020. Algorithms for #BIS-hard problems on expander graphs. SIAM J. Comput. 49, 4 (2020), 681710.Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. [39] Jenssen Matthew, Potukuchi Aditya, and Perkins Will. 2022. Approximately counting independent sets in bipartite graphs via graph containers. In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’22). SIAM, 499516.Google ScholarGoogle ScholarCross RefCross Ref
  40. [40] Jha Madhav, Seshadhri C., and Pinar Ali. 2015. Path sampling: A fast and provable method for estimating 4-vertex subgraph counts. In Proceedings of the 24th International Conference on World Wide Web (WWW’15). 495505.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Kahkeshani Reza. 2013. A generalization of the Catalan numbers. Journal of Integer Sequences 16, 2 (2013), 3.Google ScholarGoogle Scholar
  42. [42] Kashtan Nadav, Itzkovitz Shalev, Milo Ron, and Alon Uri. 2004. Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs. Bioinformatics 20, 11 (2004), 17461758.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Koteckỳ Roman and Preiss David. 1986. Cluster expansion for abstract polymer models. Communications in Mathematical Physics 103, 3 (1986), 491498.Google ScholarGoogle ScholarCross RefCross Ref
  44. [44] Kötzing Timo and Krejca Martin S.. 2019. First-hitting times under drift. Theoretical Computer Science 796 (2019), 5169.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Liao Chao, Lin Jiabao, Lu Pinyan, and Mao Zhenyu. 2019. Counting independent sets and colorings on random regular bipartite graphs. In Proceedings of the Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM’19). Schloss Dagstuhl-Leibniz-Zentrum fuer Informatik.Google ScholarGoogle Scholar
  46. [46] Lu Xuesong and Bressan Stéphane. 2012. Sampling connected induced subgraphs uniformly at random. In Proceedings of the International Conference on Scientific and Statistical Database Management. Springer, 195212.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Matsuno Ryuta and Gionis Aristides. 2020. Improved mixing time for k-subgraph sampling. In Proceedings of the 2020 SIAM International Conference on Data Mining. SIAM, 568576.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Paramonov Kirill, Shemetov Dmitry, and Sharpnack James. 2019. Estimating graphlet statistics via lifting. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 587595.Google ScholarGoogle ScholarDigital LibraryDigital Library
  49. [49] Propp James Gary and Wilson David Bruce. 1996. Exact sampling with coupled Markov chains and applications to statistical mechanics. Random Structures & Algorithms 9, 1-2 (1996), 223252.Google ScholarGoogle ScholarDigital LibraryDigital Library
  50. [50] Read-McFarland Andrew and Štefankovič Daniel. 2021. The hardness of sampling connected subgraphs. In Proceedings of the Latin American Symposium on Theoretical Informatics. Springer, 464475.Google ScholarGoogle Scholar
  51. [51] Shervashidze Nino, Vishwanathan SVN, Petri Tobias, Mehlhorn Kurt, and Borgwardt Karsten. 2009. Efficient graphlet kernels for large graph comparison. In Proceedings of the Artificial Intelligence and Statistics. PMLR, 488495.Google ScholarGoogle Scholar
  52. [52] Sly Allan. 2010. Computational transition at the uniqueness threshold. In Proceedings of the 2010 IEEE 51st Annual Symposium on Foundations of Computer Science (FOCS’10). IEEE, 287296.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Sly Allan and Sun Nike. 2012. The computational hardness of counting in two-spin models on d-regular graphs. In Proceedings of the 2012 IEEE 53rd Annual Symposium on Foundations of Computer Science. IEEE, 361369.Google ScholarGoogle ScholarDigital LibraryDigital Library
  54. [54] Weitz Dror. 2006. Counting independent sets up to the tree threshold. In Proceedings of the 38th Annual ACM Symposium on Theory of Computing. 140149.Google ScholarGoogle ScholarDigital LibraryDigital Library
  55. [55] Şerban Nacu and Yuval Peres. 2005. Fast simulation of new coins from old. The Annals of Applied Probability 15, 1A (2005), 93115.Google ScholarGoogle Scholar

Index Terms

  1. Fast and Perfect Sampling of Subgraphs and Polymer Systems

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM Transactions on Algorithms
            ACM Transactions on Algorithms  Volume 20, Issue 1
            January 2024
            297 pages
            ISSN:1549-6325
            EISSN:1549-6333
            DOI:10.1145/3613497
            • Editor:
            • Edith Cohen
            Issue’s Table of Contents

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 22 January 2024
            • Online AM: 10 November 2023
            • Accepted: 23 October 2023
            • Revised: 15 May 2023
            • Received: 1 June 2022
            Published in talg Volume 20, Issue 1

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          Full Text

          View this article in Full Text.

          View Full Text