A Novel Hybrid HDP-LDA Model for Sentiment Analysis
Citations Over TimeTop 21% of 2013 papers
Abstract
Sentiment analysis studies the public opinions towards an entity, and it is an important research area in data mining. Recently, a lot of sentiment analysis models have been proposed, including supervised and unsupervised approaches. However, the role of supervised models has been undermined by the phenomenon of big data, and the unsupervised ones are drawing more and more attention. But, most current unsupervised methods are based on Latent Dirichlet Allocation (LDA), and they need to specify the number of aspects in advance, making them subjective. In addition, these methods treat factual words and opinioned words the same, and assume that one sentence contains only one aspect, all of which make the existing unsupervised methods unsatisfactory. To solve these problems, this paper proposes a novel hybrid Hierarchical Dirichlet Process-Latent Dirichlet Allocation (HDP-LDA) model. This model can automatically determine the number of aspects, distinguish factual words from opinioned words, and further effectively extracts the aspect specific sentiment words. Experiment result shows that our model can clearly capture the aspects people mentioned and the specific sentiment words they use in each aspect, improving the performance of sentiment analysis efficiently. At last, we compared our model with the influential topic models, namely, JST, AUSM and Maxine-LDA, on the online restaurant review, and found our model performs very well.
Related Papers
- → Exploring Symmetrical and Asymmetrical Dirichlet Priors for Latent Dirichlet Allocation(2018)17 cited
- → A Novel Hybrid HDP-LDA Model for Sentiment Analysis(2013)15 cited
- → Dirichlet Mixture Allocation for Multiclass Document Collections Modeling(2009)6 cited
- → Computational social science using topic modeling: Analyzing patients' values using a large hospital survey(2018)4 cited
- → Latent Dirichlet Allocation based multilevel classification(2014)1 cited