- Academic Search

A Stupar, S Michel, R Schenkel - Large-Scale Distributed Systems for …, 2010 - Citeseer

We consider the problem of processing K-Nearest Neighbor (KNN) queries over large data
sets where the index is jointly maintained by a set of machines in a computing cluster. The …

Spara Citera Citerat av 147 Relaterade artiklar Alla 10 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] uniroma1.it

Max-cover in map-reduce

F Chierichetti, R Kumar, A Tomkins - Proceedings of the 19th …, 2010 - dl.acm.org

The NP-hard Max-k-cover problem requires selecting k sets from a collection so as to
maximize the size of the union. This classic problem occurs commonly in many settings in …

Spara Citera Citerat av 142 Relaterade artiklar Alla 11 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] gla.ac.uk

MapReduce indexing strategies: Studying scalability and efficiency

R McCreadie, C Macdonald, I Ounis - Information Processing & …, 2012 - Elsevier

In Information Retrieval (IR), the efficient indexing of terabyte-scale and larger corpora is still
a difficult problem. MapReduce has been proposed as a framework for distributing data …

Spara Citera Citerat av 120 Relaterade artiklar Alla 12 versionerna

An efficient data mining framework on Hadoop using Java persistence API

Y Lai, S ZhongZhi - 2010 10th IEEE International Conference …, 2010 - ieeexplore.ieee.org

Data indexing is common in data mining when working with high-dimensional, large-scale
data sets. Hadoop, a cloud computing project using the MapReduce framework in Java, has …

Spara Citera Citerat av 83 Relaterade artiklar Alla 4 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[PDF][PDF] University of Glasgow at TREC 2009: Experiments with Terrier.

R McCreadie, C Macdonald, I Ounis, J Peng… - TREC, 2009 - academia.edu

In TREC 2009, we extend our Voting Model for the faceted blog distillation, top stories
identification, and related entity finding tasks. Moreover, we experiment with our novel …

Spara Citera Citerat av 66 Relaterade artiklar Alla 7 versionerna Se som HTML-version

DH-TRIE frequent pattern mining on Hadoop using JPA

L Yang, Z Shi, LD Xu, F Liang… - 2011 IEEE International …, 2011 - ieeexplore.ieee.org

The FPgrowth is a famous frequent pattern's algorithm in data mining when working with
high-dimensional, large-scale data sets. It is also known as great complexity on memory for …

Spara Citera Citerat av 32 Relaterade artiklar Alla 3 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] conicet.gov.ar

BSP cost and scalability analysis for MapReduce operations

H Senger, V Gil‐Costa, L Arantes… - Concurrency and …, 2016 - Wiley Online Library

Data abundance poses the need for powerful and easy‐to‐use tools that support processing
large amounts of data. MapReduce has been increasingly adopted for over a decade by …

Spara Citera Citerat av 22 Relaterade artiklar Alla 9 versionerna

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[PDF][PDF] A new data mining algorithm based on MapReduce and Hadoop

H **nxiang, X Henan - Int. J. Signal Proc. Image Process. Pattern …, 2014 - academia.edu

The goal of data mining is to discover hidden useful information in large databases. Mining
frequent patterns from transaction databases is an important problem in data mining. As the …

Spara Citera Citerat av 25 Relaterade artiklar Alla 3 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] academia.edu

[PDF][PDF] Comparing distributed indexing: To MapReduce or not?

R McCreadie, C Macdonald, I Ounis - Proc. LSDS-IR, 2009 - academia.edu

Information Retrieval (IR) systems require input corpora to be indexed. The advent of
terabyte-scale Web corpora has reinvigorated the need for efficient indexing. In this work, we …

Spara Citera Citerat av 40 Relaterade artiklar Alla 12 versionerna Se som HTML-version

[Free GPT-4]
[DeepSeek]

[PDF] psu.edu

Indexing word sequences for ranked retrieval

S Huston, JS Culpepper, WB Croft - ACM Transactions on Information …, 2014 - dl.acm.org

Formulating and processing phrases and other term dependencies to improve query
effectiveness is an important problem in information retrieval. However, accessing word …

Spara Citera Citerat av 17 Relaterade artiklar Alla 9 versionerna

Skapa alarm

Citera

Avancerad sökning

Har sparats i Mitt bibliotek

On single-pass indexing with MapReduce

[PDF][PDF] Rankreduce–processing k-nearest neighbor queries on top of mapreduce

Max-cover in map-reduce

MapReduce indexing strategies: Studying scalability and efficiency

An efficient data mining framework on Hadoop using Java persistence API

[PDF][PDF] University of Glasgow at TREC 2009: Experiments with Terrier.

DH-TRIE frequent pattern mining on Hadoop using JPA

BSP cost and scalability analysis for MapReduce operations

[PDF][PDF] A new data mining algorithm based on MapReduce and Hadoop

[PDF][PDF] Comparing distributed indexing: To MapReduce or not?

Indexing word sequences for ranked retrieval