Fast generation of result snippets in web search
Citations Over TimeTop 1% of 2007 papers
Abstract
The presentation of query biased document snippets as part of results pages presented by search engines has become an expectation of search engine users. In this paper we explore the algorithms and data structures required as part of a search engine to allow efficient generation of query biased snippets. We begin by proposing and analysing a document compression method that reduces snippet generation time by 58% over a baseline using the zlib compression library. These experiments reveal that finding documents on secondary storage dominates the total cost of generating snippets, and so caching documents in RAM is essential for a fast snippet generation process. Using simulation, we examine snippet generation performance for different size RAM caches. Finally we propose and analyse document reordering and compaction, revealing a scheme that increases the number of document cache hits with only a marginal affect on snippet quality. This scheme effectively doubles the number of documents that can fit in a fixed size cache.
Related Papers
- → The influence of caption features on clickthrough patterns in web search(2007)118 cited
- → A large-scale full-text search engine using DotLuence(2011)5 cited
- → Predicting Search Performance from Mobile Touch Interactions on Cross-device Search Engine Result Pages(2018)4 cited
- → Effective Time Ratio: A Measure for Web Search Engines with Document Snippets(2010)1 cited
- → Helping results assessment by adding explainable elements to the deep relevance matching model(2021)24 cited