[КНИГА][B] Modern information retrieval

R Baeza-Yates, B Ribeiro-Neto - 1999 - people.ischool.berkeley.edu
Information retrieval (IR) has changed considerably in recent years with the expansion of the
World Wide Web and the advent of modern and inexpensive graphical user interfaces and …

Data-Centric Systems and Applications

MJ Carey, S Ceri, P Bernstein, U Dayal, C Faloutsos… - Italy: Springer, 2006 - Springer
The rapid growth of the Web in the past two decades has made it the largest publicly
accessible data source in the world. Web mining aims to discover useful information or …

[КНИГА][B] Data-intensive text processing with MapReduce

J Lin, C Dyer - 2022 - books.google.com
Our world is being revolutionized by data-driven methods: access to large amounts of data
has generated new insights and opened exciting new opportunities in commerce, science …

An experimental study of bitmap compression vs. inverted list compression

J Wang, C Lin, Y Papakonstantinou… - Proceedings of the 2017 …, 2017 - dl.acm.org
Bitmap compression has been studied extensively in the database area and many efficient
compression schemes were proposed, eg, BBC, WAH, EWAH, and Roaring. Inverted list …

Composite hashing with multiple information sources

D Zhang, F Wang, L Si - Proceedings of the 34th international ACM …, 2011 - dl.acm.org
Similarity search applications with a large amount of text and image data demands an
efficient and effective solution. One useful strategy is to represent the examples in databases …

Earlybird: Real-time search at twitter

M Busch, K Gade, B Larson, P Lok… - 2012 ieee 28th …, 2012 - ieeexplore.ieee.org
The web today is increasingly characterized by social and real-time signals, which we
believe represent two frontiers in information retrieval. In this paper, we present Early bird …

Mining query logs: Turning search usage data into knowledge

F Silvestri - Foundations and Trends® in Information …, 2009 - nowpublishers.com
Web search engines have stored in their logs information about users since they started to
operate. This information often serves many purposes. The primary focus of this survey is on …

Searching web data: An entity retrieval and high-performance indexing model

R Delbru, S Campinas, G Tummarello - Journal of Web Semantics, 2012 - Elsevier
More and more (semi) structured information is becoming available on the web in the form of
documents embedding metadata (eg, RDF, RDFa, Microformats and others). There are …

[КНИГА][B] Web information retrieval

S Ceri, A Bozzon, M Brambilla, E Della Valle… - 2013 - books.google.com
With the proliferation of huge amounts of (heterogeneous) data on the Web, the importance
of information retrieval (IR) has grown considerably over the last few years. Big players in …

Scalability challenges in web search engines

BB Cambazoglu, R Baeza-Yates - Advanced topics in information retrieval, 2011 - Springer
Continuous growth of the Web and user bases forces web search engine companies to
make costly investments on very large compute infrastructures. The scalability of these …