kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data
Maillo, Jesus and Ramirez, Sergio and Triguero, Isaac and Herrera, Francisco (2016) kNN-IS: an iterative spark-based design of the k-nearest neighbors classifier for big data. Knowledge-Based Systems . ISSN 1872-7409 (In Press)
The k-Nearest Neighbors classifier is a simple yet effective widely renowned method in data mining. The actual application of this model in the big data domain is not feasible due to time and memory restrictions. Several distributed alternatives based on MapReduce have been proposed to enable this method to handle large-scale data. However, their performance can be further improved with new designs that fit with newly arising technologies.
Actions (Archive Staff Only)