Clustering large datasets using K-means modified inter and intra clustering (KM-I2C) in Hadoop

Abstract Big data has become popular for processing, storing and managing massive volumes of data. The clustering of datasets has become a challenging issue in the field of big data analytics. The K-means algorithm is best suited for finding similarities between entities based on distance measures w...

Full description

Bibliographic Details
Main Authors: Chowdam Sreedhar, Nagulapally Kasiviswanath, Pakanti Chenna Reddy
Format: Article
Language:English
Published: SpringerOpen 2017-09-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-017-0087-2