Semi-supervised Dirichlet-Hawkes process with applications of topic detection and tracking in Twitter
Citations Over TimeTop 19% of 2016 papers
Abstract
Understanding ongoing topics and their evolutions in social media is of great importance. Although topic analysis is not a novel research question, social media environment has presented new challenges. First, with insufficient co-occurrence information, short text have undermined many word co-occurrence oriented topic models' applicability. Second, real time message streams make traditional discretized topic tracking methods hard to function. Third, topics' evolution mechanisms are of great importance in social media context, but many studies have ignored them. Forth, topics have more complicated correlation among each other. Considering the existing problems, this paper has proposed a Semi-Supervised Dirichlet-Hawkes Process (SDHP) to deal with topic detection and tracking from social media. The main contributions of this paper are reflected in: (1) SDHP can handle short text problem efficiently; (2) SDHP can track topics from continuous message stream; (3) SDHP can reveal topics' underlying evolution patterns; and (4) SDHP can capture topics' correlations We have evaluated SDHP's ability in both topic detection and tracking in 8 real datasets from Twitter, and the algorithm's performances are very promising.
Related Papers
- → Topic Modeling on News Articles using Latent Dirichlet Allocation(2022)10 cited
- → Hot topic extraction based on Chinese Microblog's Features topic model(2016)6 cited
- → Automatic Topic Clustering Using Latent Dirichlet Allocation with Skip-Gram Model on Final Project Abstracts(2017)2 cited
- → Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey(2017)164 cited
- → Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case Study using Latent Dirichlet Allocation Method(2023)2 cited