«上一篇/Previous Article|本期目录/Table of Contents|下一篇/Next Article»

[1]周爱武陈宝楼王琰.K-Means算法的研究与改进[J].计算机技术与发展,2012,(10):101-104.
　ZHOU Ai-wu,CHEN Bao-lou,WANG Yan.Research and Improvement of K-Means Algorithm[J].,2012,(10):101-104.
点击复制

K-Means算法的研究与改进()

分享到：

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:: 2012年10期

页码:: 101-104

栏目:: 智能、算法、系统工程

出版日期:: 1900-01-01

文章信息/Info

Title:: Research and Improvement of K-Means Algorithm

文章编号:: 1673-629X（2012）10-0101-04

作者:: 周爱武陈宝楼王琰; 安徽大学计算机科学与技术学院

Author(s):: ZHOU Ai-wu; CHEN Bao-lou; WANG Yan; College of Computer Science and Technology, Anhui University

关键词:: K—Means算法; 孤立点; 初始聚类中心

Keywords:: K-Means; outlier; initial clustering centers

分类号:: TP301.6

文献标志码:: A

摘要:: K—Means算法是一种基于划分方法的经典聚类算法，已经在很多领域得到广泛的应用。虽然该算法有很多优点，但其也存在自身的局限性，比如需要用户输入聚类簇个数，初始聚类中心是随机性选择的，算法容易陷入局部最优解，对孤立点比较敏感等。文中首先应用统计学中的标准分数对样本进行孤立点分析，然后提出一种新的初始聚类中心确定策略。对改进的算法和原算法分别做实验进行比较，实验结果表明，改进的算法在准确率、收敛速度和稳定性方面都有很大的提高

Abstract:: K-Means algorithm is a classic clustering algorithm based on the classification method has been widely applied in many fields. Although the algorithm has many advantages,there are also their own limitations,such as user input the number of clusters,initial cluster centers is random selection,the algorithm is easy to fall into local optimal solution is more sensitive to outlier and so on. It firstly analyses sample outlier by statistics standard scores,and then puts forward a new strategy to determine the initial clustering centers. Improved algorithm and the original algorithm were doing experiments to compare,the experimental results show that the improved algorithm＇s accuracyrate,convergence speed and stability are improved greatly

相似文献/References:

[1]李睿肖维民.基于孤立点挖掘的异常检测研究[J].计算机技术与发展,2009,(06):168.
　LI Rui,XIAO Wei-min.Research on Anomaly Intrusion Detection Based on Outlier Mining[J].,2009,(10):168.
[2]方杰张结魁周军.基于有向带权图的页面聚类算法研究[J].计算机技术与发展,2009,(09):49.
　FANG Jie,ZHANG Jie-kui,ZHOU Jun.Study on Page Clustering Algorithms Based on Weighted Directed Graph[J].,2009,(10):49.
[3]张义超卢英李炜.RBF网络隐含层节点的优化[J].计算机技术与发展,2009,(01):103.
　ZHANG Yi-chao,LU Ying,LI Wei.RBF Network of Hidden Layer Nodes Optimization[J].,2009,(10):103.
[4]张亚萍胡学钢.基于K-means的朴素贝叶斯分类算法的研究[J].计算机技术与发展,2007,(11):33.
　ZHANG Ya-ping,HU Xue-gang.Research of Naive Bayesian Classification Based on K- means Method[J].,2007,(10):33.
[5]周爱武于亚飞.K-Means聚类算法的研究[J].计算机技术与发展,2011,(02):62.
　ZHOU Ai-wu,YU Ya-fei.The Research about Clustering Algorithm of K-Means[J].,2011,(10):62.

备注/Memo

备注/Memo:: 安徽省教育科研重点计划项目（KJ2009A57）周爱武（1965-），女，副教授，研究方向为数据库与web技术、数据仓库与数据挖掘、信息系统安全；陈宝楼（1987-），男，硕士研究生，研究方向为数据库与web技术、数据挖掘

常用功能

工具/Tools

统计/Statistics

摘要浏览/Viewed637
全文下载/Downloads283
评论/Comments

更新日期/Last Update: 1900-01-01