Perform clustering of the unlabeled data set. You could use provided initial centroids set or generate your own. Also there could be considered next stopping criteria:
- maximal number of iterations: 100,
- cluster are consistent (no changes in group matrix or centroids on current iteration, which mean that the clusters are balanced).