|
|
Subject Words Extraction Algorithm Based on Keyword Co-occurrence Network |
ZHANG Shu’an1, WANG Xi2, DAI Jipeng1, SUI Yi1, SUN Rencheng1
|
1. School of Computer Science and Technology, QingDao University, Qingdao 266071, China; 2. Communication Dispatching Department, Qingdao Emergency Center,Qingdao 266035, China |
|
|
Abstract Aiming at the problems of inaccurate keywords extraction and only considering single correlation in subject words extraction, a subject words extraction algorithm combining integration idea with complex network is proposed. Firstly, the keywords of topic data are extracted through the integration algorithm to improve the accuracy of keywords extraction. Secondly, the traditional word co-occurrence formula is improved to calculate the co-occurrence degree of keywords, and a keywords co-occurrence network is established. Based on the network, the optimal connected subgraph is obtained. At the same time, the importance of keywords is measured by taking the centrality of node degree as the weight, and the subject words are mapped. Finally, the micro-blog topic data set is used to verify the example, which proves that the algorithm is effective and better than the traditional word co-occurrence algorithm, and it is applied in the Qingdao community topic data set.
|
Received: 08 September 2021
Published: 19 April 2023
|
|
|
|
|
No related articles found! |
|
|
|
|