Sequence Clustering Algorithm Based on Weighed Sequential Pattern Similarity
Indonesian Journal of Electrical Engineering and Computer Science

Abstract
Sequence clustering has become an active issue in the current scientific community. However, the clustering quality is affected heavily by selecting initial clustering centers randomly. In this paper, a new sequence similarity measurement based on weighed sequential patterns is defined. SCWSPS (Sequence Clustering Algorithm Based on Weighed Sequential Pattern Similarity) algorithm is proposed. Sequences with the largest weighted similarity are chosen as the merge objects. The last K-1 synthesis results are deleted from sequence database. Others sequences are divided into K clusters. Moreover, in each cluster, the sequence which has the largest sum of similarities with other sequences is viewed as the updated center. The experimental results and analysis show that the performance of SCWSPS is better than KSPAM and K-means in clustering quality. When the sequence scale is very large, the execution efficiency of SCWSPS is slightly worse than KSPAM and K-means.
Discover Our Library
Embark on a journey through our expansive collection of articles and let curiosity lead your path to innovation.
