Topic and Role Discovery in Social Networks with Experiments on Enron and Academic Email
Citations Over TimeTop 1% of 2007 papers
Abstract
Previous work in social network analysis (SNA) has modeled the existence of links from one entity to another, but not the attributes such as language content or topics on those links. We present the Author-Recipient-Topic (ART) model for social network analysis, which learns topic distributions based on the direction-sensitive messages sent between entities. The model builds on Latent Dirichlet Allocation (LDA) and the Author-Topic (AT) model, adding the key attribute that distribution over topics is conditioned distinctly on both the sender and recipient---steering the discovery of topics according to the relationships between people. We give results on both the Enron email corpus and a researcher's email archive, providing evidence not only that clearly relevant topics are discovered, but that the ART model better predicts people's roles and gives lower perplexity on previously unseen messages. We also present the Role-Author-Recipient-Topic (RART) model, an extension to ART that explicitly represents people's roles.
Related Papers
- → Topic Modeling on News Articles using Latent Dirichlet Allocation(2022)10 cited
- → Automatic Topic Clustering Using Latent Dirichlet Allocation with Skip-Gram Model on Final Project Abstracts(2017)2 cited
- → Estimating Word Probabilities with Neural Networks in Latent Dirichlet Allocation(2017)1 cited
- → Latent Dirichlet Allocation (LDA) and Topic modeling: models, applications, a survey(2017)164 cited
- → Topic Modelling of Swedish Newspaper Articles about Coronavirus: a Case Study using Latent Dirichlet Allocation Method(2023)2 cited