Abstract:Hidden Naive Bayesian Model (HNB) and Tree Augmented Naive Bayesian Model (TAN) alleviate the strong independence assumption of Local Naive Bayesian Model (LNB) by mining the intrinsic associations between co-neighboring nodes, but ignore that there are both closely correlated nodes and relatively independent nodes in the real network. On this basis, a package criterion is designed, which divides the co-neighboring nodes into correlated co-neighboring nodes and independent co-neighboring nodes according to the degree of association. Then, packaging HNB and TAN respectively, so that the packaged-based hybrid naive Bayesian models are obtained. On FWFW networks with high average number of co-neighbors, the AUC values of the HNB and TAN models after packaging are increased by 12% and 11.6%, respectively. The experimental results show that the proposed method can effectively improve the link prediction performance and has good robustness.
[1] BATOOL K, NIAZI M A. Modeling the internet of things: a hybrid modeling approach using complex networks and agent-based models[J]. Complex Adaptive Systems Modeling, 2017, 5(1): 1-19. [2] HUANG Q J, ZHANG X, WANG X J, et al. The degree-related clustering coefficient and its application to link prediction[J]. Physica A: Statistical Mechanics and Its Applications, 2016, 454: 24-33. [3] YANG Y, LICHTENWALTER R N, CHAWLA N V. Evaluating link prediction methods[J]. Knowledge and Information Systems, 2015, 45(3): 751-782. [4] LI S B, HUANG J W, ZHANG Z G, et al. Similarity-based future common neighbors model for link prediction in complex networks[J]. Scientific Reports, 2018, 8(1): 1-11. [5] MLIKA Z, GOONEWARDENA M, AJIB W, et al. User-base-station association in HetSNets: complexity and efficient algorithms[J]. IEEE Trans on Vehicular Technology, 2017, 66(2): 1484-1495. [6] ZHANG L L, LI J, ZHANG Q L, et al. Domain knowledge-based link prediction in customer-product bipartite graph for product recommendation[J]. International Journal of Information Technology & Decision Making, 2019, 18(1): 311-338. [7] LÜ L Y, ZHOU T. Link Prediction in complex networks: a survey[J]. Physica A: Statistical Mechanics and Its Applications,2011, 390(6): 1150-1170. [8] LORRAIN F, WHITE H C. Structural equivalence of individuals in social networks[J]. The Journal of Mathematical Sociology, 1971, 1(1): 49-80. [9] ADAMIC L A, ADAR E. Friends and neighbors on the Web[J]. Social Networks, 2003, 25(3): 211-230. [10] ZHOU T, LÜ L Y, ZHANG Y C. Predicting missing links via local information[J]. The European Physical Journal B, 2009, 71(4): 623-630. [11] 郁湧, 王莹港, 罗正国, 等. 基于聚类系数和节点中心性的链路预测算法[J]. 清华大学学报(自然科学版), 2022, 62(1): 98-104. YU Y, WANG Y G, LUO Z G, et al. Link prediction algorithm based on clustering coefficient and node centrality[J]. Journal of Tsinghua University(Science and Technology), 2022, 62(1): 98-104. [12] KLEIN D J, RANDI M. Resistance distance[J]. Journal of Mathematical Chemistry, 1993, 12(1): 81-95. [13] BRIN S, PAGE L. The anatomy of a large-scale hypertextual web search engine[J]. Computer Networks and ISDN Systems, 1998, 30(1): 107-117. [14] 吕亚楠, 韩华, 贾承丰, 等. 基于有偏向的重启随机游走链路预测算法[J]. 复杂系统与复杂性科学, 2018, 15(4): 17-24. LÜ Y N, HAN H, JIA C F, et al. Link prediction algorithm based on biased random walk with restart[J]. Complex Systems and Complexity Science, 2018, 15(4): 17-24. [15] KATZ L. A new status index derived from sociometric analysis[J]. Psychometrika, 1953, 18(1): 39-43. [16] LIU Z, ZHANG Q M, Lü L Y, et al. Link prediction in complex networks: a local naive Bayes model[J]. Europhysics Letters, 2011, 96(4): 48007. [17] 伍杰华, 朱岸青, 蔡雪莲, 等. 基于隐朴素贝叶斯模型的社会关系推荐[J]. 计算机应用研究, 2014, 31(5): 1381-1384. WU J H, ZHU A Q, CAI X L, et al. Hidden naÏve Bayesian model for social relation recommendation[J]. Application Research of Computer, 2014, 31(5): 1381-1384. [18] WU J. A generalized tree augmented naive Bayes link prediction model[J]. Journal of computational science, 2018, 27: 206-217. [19] HEYMANS J J, ULANOWIC R E, BONDAVALLI C. Network analysis of the South Florida Everglades graminoid marshes and comparison with nearby cypress ecosystems[J]. Ecological Modelling, 2002, 149(2): 5-23. [20] ALMUNIA J, BASTERRETXEA G, ARISTEGUI J, et al. Benthic-pelagic switching in a coastal subtropical lagoon[J]. Estuarine Coastal and Shelf Science, 1999, 49(3): 363-384. [21] BATAGELJ V, MRVAR A. Pajek-program for large network analysis[J]. Connections, 1998, 21(2): 47-57. [22] WATTS D J, STROGATZ S H. Collective dynamics of ‘small-world’ networks[J]. Nature, 1998, 393(6684): 440-442. [23] ADAMIC L A, GLANCE N. The political blogosphere and the 2004 US election: divided they blog[C]// Proceedings of the 3rd International Workshop on Link Discovery. New York: ACM Press, 2005: 36-43. [24] GUIMERA R, DANOD L, DIAZ-GUILEAR A, et al. Self-similar community structure in a network of human interactions[J]. Physical Review E, 2003, 68(6): 65-73. [25] ZENG G P, ZENG E. On the three-way equivalence of AUC in credit scoring with tied scores[J]. Communications in Statistics-Theory and Methods, 2019, 48(7): 1635-1650. [26] WU Z H, LIN Y F, ZHAO Y J, et al. Improving local clustering based top-L link prediction methods via asymmetric link clustering information[J]. Physica A: Statistical Mechanics and Its Applications, 2018, 492: 1859-1874.