The Application of Clustering Algorithm Based on Improved Canopy -Kmeans in Operators Data

Haoqian Mai; Lianglun Cheng

doi:10.12783/dtetr/iceta2016/7005

The Application of Clustering Algorithm Based on Improved Canopy -Kmeans in Operators Data

Haoqian Mai, Lianglun Cheng

Abstract

Kmeans algorithm is commonly used in user segmentation in operators data, but its k value is difficult to be identified. Meanwhile, canopy algorithm can help Kmeans algorithm to determine the k value, but it is seriously impacted by the radius. In order to solve the above problems, an improved Canopy-Kmeans algorithm is proposed. Firstly, the initial data will be divided into K1 coarse clusters by using the Canopy algorithm with smaller radius. And then, we will use the split method or merged method to reconstruct the K1 coarse clusters to K2 convergent clusters (K1â‰«K2). Finally, we can make the final K2 cluster centers be the initial centers on Kmeans algorithm. By the simulation experiment, the improved Canopy-Kmeans algorithm has performed well in running time, clusters result and square error.

Keywords

Canopy Kmeans; clustering; split; merge

DOI
10.12783/dtetr/iceta2016/7005

Refbacks

There are currently no refbacks.

Username
Password
Remember me

ENGINEERINGand TECHNOLOGY RESEARCH

The Application of Clustering Algorithm Based on Improved Canopy -Kmeans in Operators Data

Abstract

Keywords

Refbacks

ENGINEERING
and TECHNOLOGY RESEARCH