-
Density peak clustering algorithm optimized by natural neighbor search
- ZHANG Chunhao, XIE Bin, XU Tongtong, ZHANG Ximei
-
2025, 60(1):
29-44.
doi:10.6040/j.issn.1671-9352.7.2023.4130
-
Abstract
(
52 )
PDF (4969KB)
(
15
)
Save
-
References |
Related Articles |
Metrics
We combine the natural neighbor search algorithm to improve a series of problems of the density peaks clustering(CFSFDP)algorithm, and propose the NaN-CFSFDP algorithm. First, an outlier samples detection method is proposed based on the natural neighbor search algorithm. Then, for the problem that the truncation distance dc is difficult to be taken accurately manually in the CFSFDP algorithm, the calculation of dc is improved in combination with the natural neighbor search algorithm, and the automatic taking of dc is realized. The metric rule of the sample density of the CFSFDP algorithm is redesigned and unified to make it pay more attention to the local information of each sample. Finally, to address the problem that the density peak points in the dataset may be concentrated in dense clusters due to the large density difference between clusters, which leads to cluster loss, the concepts of shared natural neighbors for samples and shared natural neighbors for clusters are proposed to construct a new cluster fusion algorithm. Experimental results on synthetic and real datasets show that the algorithm outperforms or is at least comparable to the comparative method in terms of clustering performance in most cases and has fewer parameters compared to CFSFDP algorithm and its improvements.