Topic Modeling of Political Dynamics with Shifted Cosine Similarity

Yifan Luo, Tao Wan, Zengchang Qin: Topic Modeling of Political Dynamics with Shifted Cosine Similarity. Integrated Uncertainty in Knowledge Modelling and Decision Making, vol. 13199, Springer, Cham, 2022, ISBN: 978-3-030-98017-7.

Abstract

Topic modeling with community detection can be used to explore the latent semantic structure of documents, we can utilize a network, i.e., a graph to depict the semantic relation between words. In some network based topic models, in order to obtain a network with obvious community structure, the similarity between words (vertices) is essential. Word embeddings trained from a large corpus empirically perform as well as in rich semantic representation, thus this research is intended to construct a novel similarity in a network based topic model (NAM). In this paper, we first intuitively propose a similarity measure based on shifted cosine similarity between word embeddings. This similarity is exploited to replace the similarity based on typical point-wise mutual information (PMI). Secondly, based on different similarity measures, topics of corpus in a global period are induced by NAM. Finally, we use NAM to capture the dynamic changes of political topics in China and interpret the dynamic processes using historical background. Although our similarity measure introduces semantic differences caused by the difference between data sets and has one more parameter, the experimental results show the effectiveness of our new proposed measure.

See all documents refering Cortext Manager

* Information to the authors (GDPR)