Skip to main navigation Skip to search Skip to main content

Density-based Algorithms for Big Data Clustering Using MapReduce Framework

  • Princess Sumaya University for Technology

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Clustering is used to extract hidden patterns and similar groups from data. Therefore, clustering as a method of unsupervised learning is a crucial technique for big data analysis owing to the massive number of unlabeled objects involved. Density-based algorithms have attracted research interest, because they help to better understand complex patterns in spatial datasets that contain information about data related to co-located objects. Big data clustering is a challenging task, because the volume of data increases exponentially. However, clustering using MapReduce can help answer this challenge. In this context, density-based algorithms in MapReduce have been largely investigated in the past decade to eliminate the problem of big data clustering. Despite the diversity of the algorithms proposed, the field lacks a structured review of the available algorithms and techniques for desirable partitioning, local clustering, and merging. This study formalizes the problem of density-based clustering using MapReduce, proposes a taxonomy to categorize the proposed algorithms, and provides a systematic and comprehensive comparison of these algorithms according to the partitioning technique, type of local clustering, merging technique, and exactness of their implementations. Finally, the study highlights outstanding challenges and opportunities to contribute to the field of density-based clustering using MapReduce.

Original languageEnglish
Article number3403951
JournalACM Computing Surveys
Volume53
Issue number5
DOIs
StatePublished - 30 Sep 2021

Keywords

  • Big data
  • clustering
  • density clustering
  • mapreduce framework

Fingerprint

Dive into the research topics of 'Density-based Algorithms for Big Data Clustering Using MapReduce Framework'. Together they form a unique fingerprint.

Cite this