Big Data with Cloud Computing: an insight on the computing environment, MapReduce, and programming frameworks

A Fernández, S del Río, V López… - … : Data Mining and …, 2014 - Wiley Online Library
The term 'Big Data'has spread rapidly in the framework of Data Mining and Business
Intelligence. This new scenario can be defined by means of those problems that cannot be …

kNN-IS: An Iterative Spark-based design of the k-Nearest Neighbors classifier for big data

J Maillo, S Ramírez, I Triguero, F Herrera - Knowledge-Based Systems, 2017 - Elsevier
Abstract The k-Nearest Neighbors classifier is a simple yet effective widely renowned
method in data mining. The actual application of this model in the big data domain is not …

MRPR: A MapReduce solution for prototype reduction in big data classification

I Triguero, D Peralta, J Bacardit, S García, F Herrera - neurocomputing, 2015 - Elsevier
In the era of big data, analyzing and extracting knowledge from large-scale data sets is a
very interesting and challenging task. The application of standard data mining tools in such …

Evolutionary feature selection for big data classification: A mapreduce approach

D Peralta, S Del Río, S Ramírez-Gallego… - Mathematical …, 2015 - Wiley Online Library
Nowadays, many disciplines have to deal with big datasets that additionally involve a high
number of features. Feature selection methods aim at eliminating noisy, redundant, or …

Representing web graphs

S Raghavan, H Garcia-Molina - Proceedings 19th International …, 2003 - ieeexplore.ieee.org
A Web repository is a large special-purpose collection of Web pages and associated
indexes. Many useful queries and computations over such repositories involve traversal and …

Hierarchical attribute reduction algorithms for big data using MapReduce

J Qian, P Lv, X Yue, C Liu, Z **g - Knowledge-Based Systems, 2015 - Elsevier
Attribute reduction is one of the important research issues in rough set theory. Most existing
attribute reduction algorithms are now faced with two challenging problems. On one hand …

A mapreduce-based k-nearest neighbor approach for big data classification

J Maillo, I Triguero, F Herrera - 2015 IEEE Trustcom/BigDataSE …, 2015 - ieeexplore.ieee.org
The k-Nearest Neighbor classifier is one of the most well known methods in data mining
because of its effectiveness and simplicity. Due to its way of working, the application of this …

Quickfoil: Scalable inductive logic programming

Q Zeng, JM Patel, D Page - Proceedings of the VLDB Endowment, 2014 - dl.acm.org
Inductive Logic Programming (ILP) is a classic machine learning technique that learns first-
order rules from relational-structured data. However, to-date most ILP systems can only be …

Parallel attribute reduction algorithms using MapReduce

J Qian, D Miao, Z Zhang, X Yue - Information Sciences, 2014 - Elsevier
Attribute reduction is the key technique for knowledge acquisition in rough set theory.
However, it is still a challenging task to perform attribute reduction on massive data. During …

Parallel incremental efficient attribute reduction algorithm based on attribute tree

W Ding, T Qin, X Shen, H Ju, H Wang, J Huang, M Li - Information Sciences, 2022 - Elsevier
Attribute reduction is an important application of rough sets. Efficiently reducing massive
dynamic data sets quickly has always been a major goal of researchers. Traditional …