|
|
A Topic Text Network Construction Method Based on PL-LDA Model |
ZHANG Zhiyuan1,2, HUO Weigang1
|
1. School of Computer Science and Technology, Civil Aviation University of China, Tianjin 300300, China; 2. College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China |
|
|
Abstract Labeled LDA can mine words’ probabilities under a given topic, however, it can’t analyze the association relationships among these topic words. Although the correlation between word pairs can be calculated by utilizing PMI (Pointwise Mutual Information), their relationship to the given topic is lost. Motivated by the operation of counting word pairs in a fixed window used in PMI, this paper proposes a topic model called PL-LDA (Pointwise Labeled LDA), which can compute the joint probabilities between word pairs under a given topic. Experimental results on aviation safety reports show that this model achieves results with good interpretability. Based on the results of PL-LDA, this paper constructs a topic text network, which provides rich and effective information for analyzers including reflecting the distribution of topic words and displaying the complex relationships among them.
|
Received: 01 May 2015
Published: 24 February 2025
|
|
|
|
|
|
|
|