Hot topic extraction based on Chinese Microblog's Features topic model
Citations Over TimeTop 10% of 2016 papers
Abstract
Microblog, with its wide participation and convenience, has changed the way that people get news about current events. In recent years, lots of breaking news and hot topics are released by microblog platform firstly, as well as its much wider and more distribution than traditional media platform. Extracting these useful information in real-time will help us to grasp the latest and hottest topics which currently discussed by microblog users. However, due to the micrblog content is short and sparse, traditional topic extraction methods can not be used directly. In this paper, we propose a new topic model named Microblog Features Latent Dirichlet Allocation (MF-LDA) to extract microblog topics. We incorporate five microblog's unique features: support, comment, retweet, publish time and user authority into LDA model. These features are utilized to compute each microblog's attention value, authority value and word frequency. And the higher feature value of a term, the greater probability of a hot topic it be. Experimental results on real datasets demonstrated our MF-LDA model is more efficient and accurate than other methods in hot topic extraction.
Related Papers
- → Topic Modeling on News Articles using Latent Dirichlet Allocation(2022)10 cited
- → Hot topic extraction based on Chinese Microblog's Features topic model(2016)6 cited
- → Making Recommendations on Microblogs through Topic Modeling(2014)4 cited
- → Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey(2017)164 cited
- → Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case Study using Latent Dirichlet Allocation Method(2023)2 cited