Skip to main content
Log in

An Empirical Evaluation of Algorithms for Link Prediction

  • Published:
Information Systems Frontiers Aims and scope Submit manuscript

Abstract

Online social networks (OSNs) analysis has been widely used in the field of information systems (IS), thus link prediction, one of the most important core techniques of OSNs analysis, plays a vital role in the development of IS. Despite the recent development of numerous link prediction approaches, there is still a lack of comprehensive studies that measure and evaluate their performance, which hinders the rational selection and full utilization of existing prediction approaches. This study proposes a novel taxonomy of link prediction approaches based on their prediction principles. Furthermore, it selects eighteen representative approaches from various categories to perform an empirical evaluation on six real-world benchmark datasets. The features of different types of predication approaches have been analyzed based evaluation test results. The research provides researchers with improved understandings on link prediction approaches and offers insightful performance related information to practitioners for developing more effective information systems.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data Availability

The data sets supporting the results of this article are included within the article and its additional files.

References

  • Adamic, L. A., & Adar, E. (2003). Friends and neighbors on the web. Social Networks, 25(3), 211–230.

    Article  Google Scholar 

  • Al Hasan, M., & Zaki, M. J. (2011). A survey of link prediction in social networks. Social network data analytics, 2011, 243–275.

    Article  Google Scholar 

  • Awal, G. K., & Bharadwaj, K. (2019). Leveraging collective intelligence for behavioral prediction in signed social networks through evolutionary approach. Information System Frontiers, 21(2), 417–439.

    Article  Google Scholar 

  • Aydin, M. N., & Perdahci, N. Z. (2019). Dynamic network analysis of online interactive platform. Information System Frontiers, 21(2), 229–240.

    Article  Google Scholar 

  • Brin, S., & Page, L. (1998). The anatomy of a large-scale hyper textual web search engine. Computer Networks and ISDN Systems, 30(1), 107–117.

    Article  Google Scholar 

  • Cao, S., Lu, W., & Xu, Q. (2015). Grarep: Learning graph representations with global structural information. In ACM International Conference on Information and Knowledge Management (CIKM 2015) (pp. 891–900). Melbourne, VIC, Australia: ACM.

  • Chung, W., Zhang, Y., & Pan, J. (2023). A Theory-based deep-learning approach to detecting disinformation in financial social media. Information Systems Frontiers, 25(2), 473–492.

    Google Scholar 

  • Daud, N. N., Ab Hamid, S. H., Saadoon, M., Sahran, F., & Anuar, N. B. (2020). Applications of link prediction in social networks: A review. Journal of Network and Computer Applications, 166, 102716.

  • Fang, X., Hu, P. J., Li, Z., & Tsai, W. (2013). Predicting adoption probabilities in social networks. Information Systems Research, 24(1), 128–145.

    Article  Google Scholar 

  • Fogués, R. L., Such, J. M., Espinosa, A., & García-Fornes, A. (2014). BFF: A tool for eliciting tie strength and user communities in social networking services. Information System Frontiers, 16(2), 225–237.

    Article  Google Scholar 

  • Gao, H., & Huang, H. (2018). Deep attributed network embedding. In International Joint Conference on Artificial Intelligence (IJCAI 2018) (pp. 3364–3370). Stockholm, Sweden: IJCAI.

  • Grover, A., & Leskovec, J. (2016). Node2vec: Scalable feature learning for networks. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2016), (pp. 855–864). ACM: San Francisco, CA, USA.

  • Haghani, S., & Keyvanpour, M. R. (2019). A systemic analysis of link prediction in social network. Artificial Intelligence Review, 52(3), 1961–1995.

    Article  Google Scholar 

  • Hanely, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143, 29–36.

    Article  Google Scholar 

  • Huang, T., Zhou, L., Jin, Z., Huang, Y., & Lü, K. (2020a). A multi-component attribute network embedding for link prediction. In Conference on Business Informatics (CBI 2020) (pp. 58–65). Antwerp, Belgium: CBI.

  • Huang, T., Zhou, L., Wang, L., Du, G., & Lü, K. (2020b). Attributed network embedding with community preservation. In International Conference on Data Science and Advanced Analytics (DSAA 2020) (pp. 334–343). Sydney, Australia: DSAA.

  • Jaccard, P. (1901). Étude comparative de la distribution florale dans une portion des Alpes et des Jura. Bulletin of the Chemical Society of Ethiopia, 37, 547–579.

    Google Scholar 

  • Jonathan, L. H., Joseph, A. K., Terveen, L. G., & Riedl, J. (2004). Evaluating collaborative filtering recommender systems. ACM Transactions on Information Systems, 22(1), 5–53.

    Article  Google Scholar 

  • Kane, G. C., & Ransbotham, S. (2016). Research note-content and collaboration: an affiliation network approach to information quality in online peer production communities. Information Systems Research, 27(2), 424–439.

    Article  Google Scholar 

  • Katz, L. (1953). A new status index derived from sociometric analysis. Psychometrika, 18(1), 39–43.

    Article  Google Scholar 

  • Lü, L., & Zhou, T. (2011). Link prediction in complex networks: A survey. Physica a: Statistical Mechanics and Its Applications, 390(6), 1150–1170.

    Article  Google Scholar 

  • Ma, G., Yan, H., Qian, Y., Wang, L., Dang, C., & Zhao, Z. (2021). Path-based estimation for link prediction. International Journal of Machine Learning and Cybernetics, 12, 2443–2458.

    Article  Google Scholar 

  • Ma, X., Tan, S., Xie, X., Zhong, X., & Deng, J. (2022). Joint multi-label learning and feature extraction for temporal link prediction. Pattern Recognition, 121, 108216.

    Article  Google Scholar 

  • Martínez, V., Berzal, F., Cubero, J. (2016). A survey of link prediction in complex networks. ACM Computer Survey, 49(4), 69, 33 page.

  • Matook, S., Brown, S., & Rolf, J. (2015). Forming an intention to act on recommendations given via online social networks. European Journal of Information Systems, 24(1), 76–92.

    Article  Google Scholar 

  • Newman, M. E. (2001). Clustering and preferential attachment in growing networks. Physical Review E, 64(2), 025102.

    Article  Google Scholar 

  • Newman, M. E. J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69, 026113.

    Article  Google Scholar 

  • Nowell, D. L., & Kleinberg, J. (2007). The link prediction problem for social networks. Journal of the American Society for Information Science and Technique, 58(7), 1019–1031.

    Article  Google Scholar 

  • Perozzi, B., Al-Rfou, R., & Skiena, S. (2014). Deepwalk: Online learning of social representations. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2014) (pp. 701–710). New York City, USA: ACM.

  • Rosvall, M., & Bergstrom, C. T. (2008). Maps of random walks on complex networks reveal community structure. Proceedings of the National Academy of Sciences, 105(4), 1118–1123.

    Article  Google Scholar 

  • Sasidharan, S., Santhanam, R., Brass, D. J., & Sambamurthy, V. (2012). The effects of social network structure on enterprise systems success: a longitudinal multilevel analysis. Information Systems Research, 23(3–1), 658–678.

    Article  Google Scholar 

  • Shi, W., Huang, L., Wang, C. D., Li, J. H., Tang, Y., & Fu, C. (2019). Network embedding via community based variational autoencoder. IEEE Access, 7, 25323–25333.

    Article  Google Scholar 

  • Tang, J., Qu, M., Wang, M., Zhang, M., Yan, J., & Mei, Q. (2015). Line: Large-scale information network embedding. In ACM Web Conference (WWW 2015) (pp. 1067–1077). Florence, Italy: ACM.

  • Tong, H., Faloutsos, C., & Pan, J. Y. (2006). Fast random walk with restart and its applications. In IEEE International Conference on Data Mining (ICDM 2006) (pp. 613–622). Hong Kong, China: IEEE.

  • Traud, A. L., Mucha, P. J., & Porter, M. A. (2012). Social structure of facebook networks. Physica A: Statistical Mechanics and Its Applications, 391(16), 4165–4180.

    Article  Google Scholar 

  • Vaghefi, M. S., & Nazareth, D. L. (2021). Mining online social networks: deriving user preferences through node embedding. Journal of the Association for Information Systems, 22(6), 1625–1658.

    Article  Google Scholar 

  • Wang, X., Cui, P., Wang, J., Pei, J., Zhu, W., & Yang, S. (2017). Community preserving network embedding. In AAAI Conference on Artificial Intelligence (AAAI 2017) (pp. 203–209). San Francisco, California, USA: AAAI.

  • Wang, D., Cui, P., & Zhu, W. (2016). Structural deep network embedding. In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2016) (pp. 1225–1234). San Francisco, CA, USA: ACM.

  • Wang, P., Xu, B., Wu, Y., & Zhou, X. (2015). Link prediction in social networks: The state-of-the-art. Science China Information Sciences, 58(1), 1–38.

    Google Scholar 

  • Wenningera, H., Krasnovab, H., & Buxmann, P. (2019). Understanding the role of social networking sites in the subjective well-being of users: A diary study. European Journal of Information Systems, 28(2), 126–148.

    Article  Google Scholar 

  • Xiao, Y., Li, R., Lu, X., & Liu, Y. (2021). Link prediction based on feature representation and fusion. Information Sciences, 548, 1–17.

    Article  Google Scholar 

  • Zhang, Z., Yang, H., Bu, J., Zhou, S., Yu, P., Zhang, J., & Wang, C. (2018). ANRL: Attributed network representation learning via deep neural networks. In International Joint Conference on Artificial Intelligence (IJCAI 2018) (pp. 3155–3161). Stockholm, Sweden: IJCAI.

  • Zhang, Y., Zhang, X., Wang, J., Liang, H., Lei, W., Sun, Z., Jatowt, A., & Yang, Z. (2021). Generalized relation learning with semantic correlation awareness for link prediction. In AAAIConference on Artificial Intelligence (AAAI 2021) (pp. 4679–4687). Virtual Event.

  • Zhao, Z., Gou, Z., Du, Y., Ma, J., Li, T., & Zhang, R. (2022). A novel link prediction algorithm based on inductive matrix completion. Expert Systems with Applications, 188, 116033.

    Article  Google Scholar 

  • Zhou, S., Yang, H., Wang, X., Bu, J., Ester, M., Yu, P., Zhang, J., & Wang, C. (2018). PRRE: Personalized relation ranking embedding for attributed networks. In ACM International Conference on Information and Knowledge Management (CIKM 2018) (pp.823–832). Torino, Italy: ACM.

Download references

Acknowledgements

This research is supported by the National Natural Science Foundation of China (62062066, 61762090, 61966036 and 62276227), Yunnan Fundamental Research Projects (202201AS070015); Yunnan Key Laboratory of Intelligent Systems and Computing (202205AG070003), the Blockchain and Data Security Governance Engineering Research Center of Yunnan Provincial Department of Education, University Key Laboratory of Internet of Things Technology and Application of Yunnan Province.

Funding

a) National Natural Science Foundation of China Award Number: 61966036.

b) National Natural Science Foundation of China Award Number: 61762090.

c) National Natural Science Foundation of Ching Award Number: 62062066.

d) Yunnan Fundamental Research Projects Award Number: 202201AS070015.

Author information

Authors and Affiliations

Authors

Contributions

Lihua Zhou: Conceptualization, Methodology, Software. Tong Huang: Data curation, Writing- Original draft preparation. Lizhen Wang & Hongmei Chen: Writing- Reviewing and Editing. Kevin Lü: Supervision. Guowan Du: Writing the algorithm code.

Corresponding author

Correspondence to Lihua Zhou.

Ethics declarations

Ethics Approval

Not applicable.

Consent to Participate

Not applicable.

Consent for Publication

Written informed consent for publication was obtained from all participants.

Competing Interests

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

Appendix

The “Ave “ in Table 7 means column average, and bold numbers represent the best results in the column.

Table 7 Link prediction performance comparisons of different approaches on all datasets

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Huang, T., Zhou, L., Lü, K. et al. An Empirical Evaluation of Algorithms for Link Prediction. Inf Syst Front (2023). https://doi.org/10.1007/s10796-023-10440-3

Download citation

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10796-023-10440-3

Keywords

Navigation