[1]李 遥,荀亚玲.基于共享最近邻的客户交易数据聚类算法[J].计算机技术与发展,2022,32(01):73-78.[doi:10. 3969 / j. issn. 1673-629X. 2022. 01. 013]
 LI Yao,XUN Ya-ling.A Customer Transaction Data Clustering Algorithm Based onShared Nearest Neighbors[J].,2022,32(01):73-78.[doi:10. 3969 / j. issn. 1673-629X. 2022. 01. 013]
点击复制

基于共享最近邻的客户交易数据聚类算法()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
32
期数:
2022年01期
页码:
73-78
栏目:
大数据分析与挖掘
出版日期:
2022-01-10

文章信息/Info

Title:
A Customer Transaction Data Clustering Algorithm Based onShared Nearest Neighbors
文章编号:
1673-629X(2022)01-0073-06
作者:
李 遥荀亚玲
太原科技大学 计算机科学与技术学院,山西 太原 030024
Author(s):
LI YaoXUN Ya-ling
School of Computer Science and Technology,Taiyuan University of Science and Technology,Taiyuan 030024,China
关键词:
聚类交易数据客户细分交易树共享最近邻
Keywords:
clusteringtransaction datacustomer segmentationpurchase treeshared nearest neighbor
分类号:
TP301. 6
DOI:
10. 3969 / j. issn. 1673-629X. 2022. 01. 013
摘要:
利用客户交易数据聚类分析,可得到更优异的客户细分效果,有助于企业更详实地了解消费者,制定精准的营销策略。 PurTreeClust 是一种新型的客户交易数据聚类算法,定义了一种新型的度量方式 PurTree 距离,可以很好地分析处理具有层次树结构的交易数据,但未考虑近邻点的影响,仅将交易树分配到距离最近的聚类中心所属类簇,容易出现错误的交易树分配。 该文利用交易树之间的共享最近邻信息,提出一种客户交易数据聚类算法。 该算法在聚类分配时,充分利用共享最近邻,首先分配类簇的从属交易树,然后分配类簇的可能从属交易树,实现聚类分配,可发现更加紧凑清晰的类簇,并避免了交易树错误分配,改善了客户细分效果。 最后采用 6 个真实客户交易数据集进行实验,验证了该算法的有效性。
Abstract:
By clustering analysis of customer transaction data,better customer segmentation effect can be obtained,which is helpful for enterprises to have a more detailed understanding of consumers and develop accurate marketing strategies. As a new clustering algorithm forcustomer transaction data,PurTreeClust defines a new measurement method,PurTree distance,which can analyze and process transactiondata with hierarchical tree structure. However,without considering the influence of neighboring points,only the purchase tree is allocatedto the class cluster belonging to the nearest cluster center,so the wrong purchase tree allocation is prone to occur. We propose a clusteringalgorithm for customer transaction data using the shared nearest neighbors information among purchase trees. The algorithm makes fulluse of the shared nearest neighbors to achieve cluster allocation. Firstly,the subordinate purchase tree of the cluster is allocated,and thenthe possible subordinate purchase tree of the cluster is allocated to realize cluster allocation. It can find more compact and clear clusters,avoid the wrong allocation of the purchase tree, and improve the effect of customer segmentation. Finally, experiments on six realcustomer transaction datasets verify that the proposed algorithm is more effective.

相似文献/References:

[1]蒋璐璐 王适 王宝成 李慧敏 李鑫慧.一种改进的标记分水岭遥感图像分割方法[J].计算机技术与发展,2010,(01):36.
 JIANG Lu-lu,WANG Shi,WANG Bao-cheng,et al.Segmentation of Remote Sensing Image Based on an Improved Labeling Watershed Algorithm[J].,2010,(01):36.
[2]张甜 罗眉 孟晓红 赵宗涛.一种基于状态特征的航天发射故障诊断技术[J].计算机技术与发展,2010,(01):93.
 ZHANG Tian,LUO Mei,MENG Xiao-hong,et al.A Technology in Fault Diagnosis of Spaceflight Launch Based on State Character[J].,2010,(01):93.
[3]王会颖 章义刚.求解聚类问题的改进人工鱼群算法[J].计算机技术与发展,2010,(03):84.
 WANG Hui-ying,ZHANG Yi-gang.An Improved Artificial Fish- Swarm Algorithm of Solving Clustering Analysis Problem[J].,2010,(01):84.
[4]赵敏 倪志伟 刘斌.K—means与朴素贝叶斯在商务智能中的应用[J].计算机技术与发展,2010,(04):179.
 ZHAO Min,NI Zhi-wei,LIU Bin.Application Research of K - Means Clustering and Naive Bayesian Algorithm in Business Intelligence[J].,2010,(01):179.
[5]吴楠 胡学钢.基于聚类分区的序列模式挖掘算法研究[J].计算机技术与发展,2010,(06):109.
 WU Nan,HU Xue-gang.Research on Clustering Partition-Based Approach of Sequential Pattern Mining[J].,2010,(01):109.
[6]耿波 仲红 徐杰 闫娜娜.用关联分析法对负荷预测结果进行二次处理[J].计算机技术与发展,2008,(04):171.
 GENG Bo,ZHONG Hong,XU Jie,et al.Using Correlation Analysis to Treat Load Forecasting Results[J].,2008,(01):171.
[7]游芳 姜建国 张坤.基于二维属性的高维数据聚类算法研究[J].计算机技术与发展,2009,(05):111.
 YOU Fang,JIANG Jian-guo,ZHANG Kun.Cluster- Algorithm Studies Based on Two- Dimensional Attribute Higher - Dimension Data[J].,2009,(01):111.
[8]刘淑英 程国建 彭方.人工神经生长细胞结构网络在医疗诊断的应用[J].计算机技术与发展,2009,(05):231.
 LIU Shu-ying,CHENG Guo-jian,PENG Fang.Applications of Growing Cell Structures of Artificial Neural Network for Medical Diagnosis[J].,2009,(01):231.
[9]范新 沈闻 丁泉勋 沈洁.基于正例和未标文档的半监督分类研究[J].计算机技术与发展,2009,(06):58.
 FAN Xin,SHEN Wen,DING Quan-xun,et al.Research on Semi- Supervised Classification Based on Positive and Unlabeled Text Document[J].,2009,(01):58.
[10]王园园 倪志伟 赵裕啸 伍章俊.基于决策树的模糊聚类评价算法及其应用[J].计算机技术与发展,2009,(09):232.
 WANG Yuan-yuan,NI Zhi-wei,ZHAO Yu-xiao,et al.Fuzzy Clustering Evaluation Algorithm Based on Decision Tree and Application[J].,2009,(01):232.

更新日期/Last Update: 2022-01-10