[ספר][B] An introduction to information retrieval

CD Manning - 2009‏ - edl.emi.gov.et
As recently as the 1990s, studies showed that most people preferred getting information
from other people rather than from information retrieval systems. Of course, in that time …

Faster top-k document retrieval using block-max indexes

S Ding, T Suel - Proceedings of the 34th international ACM SIGIR …, 2011‏ - dl.acm.org
Large search engines process thousands of queries per second over billions of documents,
making query processing a major performance bottleneck. An important class of optimization …

Quasi-succinct indices

S Vigna - Proceedings of the sixth ACM international conference …, 2013‏ - dl.acm.org
Compressed inverted indices in use today are based on the idea of gap compression:
documents pointers are stored in increasing order, and the gaps between successive …

Scalability challenges in web search engines

BB Cambazoglu, R Baeza-Yates - Advanced topics in information retrieval, 2011‏ - Springer
Continuous growth of the Web and user bases forces web search engine companies to
make costly investments on very large compute infrastructures. The scalability of these …

Examining the additivity of top-k query processing innovations

J Mackenzie, A Moffat - Proceedings of the 29th ACM International …, 2020‏ - dl.acm.org
Research activity spanning more than five decades has led to index organizations,
compression schemes, and traversal algorithms that allow extremely rapid response to …

Compressing inverted indexes with recursive graph bisection: A reproducibility study

J Mackenzie, A Mallia, M Petri, JS Culpepper… - Advances in Information …, 2019‏ - Springer
Document reordering is an important but often overlooked preprocessing stage in index
construction. Reordering document identifiers in graphs and inverted indexes has been …

Scalable techniques for document identifier assignment in inverted indexes

S Ding, J Attenberg, T Suel - … of the 19th international conference on …, 2010‏ - dl.acm.org
Web search engines depend on the full-text inverted index data structure. Because the
query processing performance is so dependent on the size of the inverted index, a plethora …

Faster index reordering with bipartite graph partitioning

J Mackenzie, M Petri, A Moffat - … of the 44th International ACM SIGIR …, 2021‏ - dl.acm.org
We revisit the Bipartite Graph Partitioning approach to document reordering (Dhulipala et al.,
KDD 2016), and consider a range of algorithmic and heuristic refinements that lead to faster …

Tradeoff options for bipartite graph partitioning

J Mackenzie, M Petri, A Moffat - IEEE Transactions on …, 2022‏ - ieeexplore.ieee.org
Web connectivity graphs and similar linked data such as inverted indexes are important
components of the information access systems provided by social media and web search …

Compact full-text indexing of versioned document collections

J He, H Yan, T Suel - Proceedings of the 18th ACM conference on …, 2009‏ - dl.acm.org
We study the problem of creating highly compressed full-text index structures for versioned
document collections, that is, collections that contain multiple versions of each document …