Scalability challenges in web search engines

BB Cambazoglu, R Baeza-Yates - Advanced topics in information retrieval, 2011 - Springer
Continuous growth of the Web and user bases forces web search engine companies to
make costly investments on very large compute infrastructures. The scalability of these …

[HTML][HTML] Power to the learner: Towards human-intuitive and integrative recommendations with open educational resources

S Bulathwela, M Pérez-Ortiz, E Yilmaz, J Shawe-Taylor - Sustainability, 2022 - mdpi.com
Educational recommenders have received much less attention in comparison with e-
commerce-and entertainment-related recommenders, even though efficient intelligent tutors …

EI-LSH: An early-termination driven I/O efficient incremental c-approximate nearest neighbor search

W Liu, H Wang, Y Zhang, W Wang, L Qin, X Lin - The VLDB Journal, 2021 - Springer
Nearest neighbor in high-dimensional space has been widely used in various fields such as
databases, data mining and machine learning. The problem has been well solved in low …

A toolbox for modelling engagement with educational videos

Y Qiu, K Djemili, D Elezi, AS Srazali… - Proceedings of the …, 2024 - ojs.aaai.org
With the advancement and utility of Artificial Intelligence (AI), personalising education to a
global population could be a cornerstone of new educational systems in the future. This …

Efficient term proximity search with term-pair indexes

H Yan, S Shi, F Zhang, T Suel, JR Wen - Proceedings of the 19th ACM …, 2010 - dl.acm.org
There has been a large amount of research on early termination techniques in web search
and information retrieval. Such techniques return the top-k documents without scanning and …

Peek: A large dataset of learner engagement with educational videos

S Bulathwela, M Perez-Ortiz, E Novak, E Yilmaz… - arxiv preprint arxiv …, 2021 - arxiv.org
Educational recommenders have received much less attention in comparison to e-
commerce and entertainment-related recommenders, even though efficient intelligent tutors …

How good is a span of terms? Exploiting proximity to improve web retrieval

KM Svore, PH Kanani, N Khan - … of the 33rd international ACM SIGIR …, 2010 - dl.acm.org
Ranking search results is a fundamental problem in information retrieval. In this paper we
explore whether the use of proximity and phrase information can improve web retrieval …

Compressing term positions in web indexes

H Yan, S Ding, T Suel - Proceedings of the 32nd international ACM …, 2009 - dl.acm.org
Large search engines process thousands of queries per second on billions of pages,
making query processing a major factor in their operating costs. This has led to a lot of …

Fast first-phase candidate generation for cascading rankers

Q Wang, C Dimopoulos, T Suel - … of the 39th International ACM SIGIR …, 2016 - dl.acm.org
Current search engines use very complex ranking functions based on hundreds of features.
While such functions return high-quality results, they create efficiency challenges as it is too …

Upper-bound approximations for dynamic pruning

C Macdonald, I Ounis, N Tonellotto - ACM Transactions on Information …, 2011 - dl.acm.org
Dynamic pruning strategies for information retrieval systems can increase querying
efficiency without decreasing effectiveness by using upper bounds to safely omit scoring …