Separate and inequal
Citations Over TimeTop 23% of 2008 papers
Abstract
Web pages, like people, are often known by others in a variety of contexts. When those contexts are sufficiently distinct, a page's importance may be better represented by multiple domains of authority, rather than by one that indiscriminately mixes reputations. In this work we determine domains of authority by examining the contexts in which a page is cited. However, we find that it is not enough to determine separate domains of authority; our model additionally determines the local flow of authority based upon the relative similarity of the source and target authority domains. In this way, we differentiate both incoming and outgoing hyperlinks by topicality and importance rather than treating them indiscriminately. We find that this approach compares favorably to other topical ranking methods on two real-world datasets and produces an approximately 10% improvement in precision and quality of the top ten results over PageRank.
Related Papers
- → A unified framework for Web link analysis(2003)22 cited
- → Hyperlink Classification: A New Approach to Improve PageRank(2007)3 cited
- → A survey: hyperlink analysis in webpage ranking algorithms(2014)2 cited
- → Analysis and Evaluation the Websites of Agridata Base on Link Analysis(2014)1 cited
- → A Site-Ranking Algorithm for a Small Group of Sites(2007)