[1]黄韬 刘胜辉 谭艳娜.基于k-means聚类算法的研究[J].计算机技术与发展,2011,(07):54-57.
 HUANG Tao,LIU Sheng-hui,TAN Yan-na.Research of Clustering Algorithm Based on K-means[J].,2011,(07):54-57.
点击复制

基于k-means聚类算法的研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2011年07期
页码:
54-57
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
Research of Clustering Algorithm Based on K-means
文章编号:
1673-629X(2011)07-0054-04
作者:
黄韬 刘胜辉 谭艳娜
哈尔滨理工大学计算机科学与技术学院
Author(s):
HUANG TaoLIU Sheng-huiTAN Yan-na
Sch.of Computer Sci.and Tech.,Harbin Univ.of Sci.and Tech
关键词:
数据挖掘聚类算法k-means算法
Keywords:
data mining clustering algorithm k-means algorithm
分类号:
TP301.6
文献标志码:
A
摘要:
分析研究聚类分析方法,对多种聚类分析算法进行分析比较,讨论各自的优点和不足,同时针对原k-means算法的聚类结果受随机选取初始聚类中心的影响较大的缺点,提出一种改进算法。通过将对数据集的多次采样,选取最终较优的初始聚类中心,使得改进后的算法受初始聚类中心选择的影响度大大降低;同时,在选取初始聚类中心后,对初值进行数据标准化处理,使聚类效果进一步提高。通过UCI数据集上的数据对新算法Hk-means进行检测,结果显示Hk-means算法比原始的k-means算法在聚类效果上有显著的提高,并对相关领域有借鉴意义
Abstract:
Analyze and research the method of cluster analysis,analyze and compare many kinds of algorithms of cluster analysis,discuss their respective strengths and weaknesses.At the same time,according to the weaknesses of the cluster result of original k-means algorithm is significant influence by selecting the initial cluster centers randomly,a modified algorithm is proposed.Through taking sample many times to data set,choose final superior cluster center,bring down the impact of initial cluster centers to improved algorithm greatly.Simultaneously,the initial data is standadized once the initial cluster center is selected,makes cluster effect improved furthermore.Detecting new algorithm Hk-means through the date of UCI data set,the result shows that Hk-means algorithm is more prominent improved than initial k-means algorithm in cluster effect,and it's useful for conference to relative field

相似文献/References:

[1]项响琴 汪彩梅.基于聚类高维空间算法的离群数据挖掘技术研究[J].计算机技术与发展,2010,(01):120.
 XIANG Xiang-qin,WANG Cai-mei.Study of Outlier Data Mining Based on CLIQUE Algorithm[J].,2010,(07):120.
[2]李雷 丁亚丽 罗红旗.基于规则约束制导的入侵检测研究[J].计算机技术与发展,2010,(03):143.
 LI Lei,DING Ya-li,LUO Hong-qi.Intrusion Detection Technology Research Based on Homing - Constraint Rule[J].,2010,(07):143.
[3]吉同路 柏永飞 王立松.住宅与房地产电子政务中数据挖掘的应用研究[J].计算机技术与发展,2010,(01):235.
 JI Tong-lu,BAI Yong-fei,WANG Li-song.Study and Application of Data Mining in E-government of House and Real Estate Industry[J].,2010,(07):235.
[4]杨静 张楠男 李建 刘延明 梁美红.决策树算法的研究与应用[J].计算机技术与发展,2010,(02):114.
 YANG Jing,ZHANG Nan-nan,LI Jian,et al.Research and Application of Decision Tree Algorithm[J].,2010,(07):114.
[5]赵裕啸 倪志伟 王园园 伍章俊.SQL Server 2005数据挖掘技术在证券客户忠诚度的应用[J].计算机技术与发展,2010,(02):229.
 ZHAO Yu-xiao,NI Zhi-wei,WANG Yuan-yuan,et al.Application of Data Mining Technology of SQL Server 2005 in Customer Loyalty Model in Securities Industry[J].,2010,(07):229.
[6]张笑达 徐立臻.一种改进的基于矩阵的频繁项集挖掘算法[J].计算机技术与发展,2010,(04):93.
 ZHANG Xiao-da,XU Li-zhen.An Advanced Frequent Itemsets Mining Algorithm Based on Matrix[J].,2010,(07):93.
[7]王爱平 王占凤 陶嗣干 燕飞飞.数据挖掘中常用关联规则挖掘算法[J].计算机技术与发展,2010,(04):105.
 WANG Ai-ping,WANG Zhan-feng,TAO Si-gan,et al.Common Algorithms of Association Rules Mining in Data Mining[J].,2010,(07):105.
[8]张广路 雷景生 吴兴惠.一种改进的Apriori关联规则挖掘算法(英文)[J].计算机技术与发展,2010,(06):84.
 ZHANG Guang-lu,LEI Jing-sheng,WU Xing-hui.An Improved Apriori Algorithm for Mining Association Rules[J].,2010,(07):84.
[9]吴楠 胡学钢.基于聚类分区的序列模式挖掘算法研究[J].计算机技术与发展,2010,(06):109.
 WU Nan,HU Xue-gang.Research on Clustering Partition-Based Approach of Sequential Pattern Mining[J].,2010,(07):109.
[10]吴青 傅秀芬.水平分布数据库的正负关联规则挖掘[J].计算机技术与发展,2010,(06):113.
 WU Qing,FU Xiu-fen.Positive and Negative Association Rules Mining on Horizontally Partitioned Database[J].,2010,(07):113.
[11]耿筱媛 张燕平 闫屹.改进的K—means算法在电信客户细分中的应用[J].计算机技术与发展,2008,(05):163.
 GENG Xiao-yuan,ZHANG Yan-ping,YAN Yi.Application of Improved K - means Algorithm Subdivision of Telecom Clients[J].,2008,(07):163.
[12]王鑫 王洪国 王珺 王金枝[].数据挖掘中聚类方法比较研究[J].计算机技术与发展,2006,(10):20.
 WANG Xin,WANG Hong-guo,WANG Jun,et al.Comparison of Clustering Methods in Data Mining[J].,2006,(07):20.
[13]刘华春,候向宁,杨忠. 基于改进K均值算法的入侵检测系统设计[J].计算机技术与发展,2016,26(01):101.
 LIU Hua-chun,HOU Xiang-ning,YANG Zhong. Design of Intrusion Detection System Based on Improved K-means Algorithm[J].,2016,26(07):101.
[14]李 博,李 霞,张 晓,等.MD-KNN 算法在高校精准资助中的应用[J].计算机技术与发展,2020,30(07):91.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 020]
 LI Bo,LI Xia,ZHANG Xiao,et al.Application of MD-KNN in Accurate Subsidy of Colleges[J].,2020,30(07):91.[doi:10. 3969 / j. issn. 1673-629X. 2020. 07. 020]
[15]宋冬冬,王 楠,田树耀,等.基于聚类算法的车辆数据挖掘及可视化研究[J].计算机技术与发展,2020,30(10):204.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 036]
 SONG Dong-dong,WANG Nan,TIAN Shu-yao,et al.Research on Vehicle Data Mining and Visualization Based on Clustering Algorithm[J].,2020,30(07):204.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 036]

备注/Memo

备注/Memo:
哈尔滨市后备带头人基金项目(2004AFXXJ039)黄韬(1982-),男,黑龙江人,硕士研究生,研究方向为企业智能计算;刘胜辉,教授,硕士研究生导师,研究方向为计算机集成制造系统,企业智能计算
更新日期/Last Update: 1900-01-01