Abstract: MapReduce-based clustering of a large dataset is gaining rapid importance in the fields of data science. Its main task is to make groups of data from the large dataset such that all data ...