[1]朱云贺 张春海 张博.基于数据分段的K-means的优化研究[J].计算机技术与发展,2010,(11):130-132.
 ZHU Yun-he,ZHANG Chun-hai,ZHANG Bo.Optimizing Research on K-means Based on Data Partition[J].,2010,(11):130-132.
点击复制

基于数据分段的K-means的优化研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2010年11期
页码:
130-132
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
Optimizing Research on K-means Based on Data Partition
文章编号:
1673-629X(2010)11-0130-03
作者:
朱云贺 张春海 张博
中国海洋大学信息科学与工程学院
Author(s):
ZHU Yun-heZHANG Chun-haiZHANG Bo
College of Information Science & Engineering,Ocean University of China
关键词:
聚类K-meansPK-means聚类中心
Keywords:
clustering K-means PK-means clustering center
分类号:
TP39
文献标志码:
A
摘要:
K-means聚类算法是一种主流的迭代下降聚类算法,收敛于局部最优化状态。由于K-means随机选取k个初始聚类中心,使得聚类结果的有效性随初始输入而波动,为此文中采取一种预处理的方式来选取初始聚类中心。首先在某种范数的意义下,确定相隔最远的两个数据点之间的距离,然后采用数据分段的方法,将数据集分成k段,在每段中选取一个中心,以此来减小聚类结果随初始输入的波动。实验显示优化后的K-means有效地消除了初始输入的影响,并显著地减少了算法迭代次数和聚类误差
Abstract:
The K-means clustering algorithm is one kind of mainstream iterative drop clustering algorithm,which restrains in the partial optimized state.Because K-means randomly selects initial clustering center,which result in the result of clustering is obviously fluctuate along with the initial input.Thus this paper adopts the pretreatment way to select the initial clustering center.First under one kind of norm,calculate out the farthest distance,then use the method of data partition to divide the data set into k section and select a center in each section.The experiment demonstrates the optimizing K-means eliminate the initial input influence,effectively reduced the iteration number of times and clustering error

相似文献/References:

[1]蒋璐璐 王适 王宝成 李慧敏 李鑫慧.一种改进的标记分水岭遥感图像分割方法[J].计算机技术与发展,2010,(01):36.
 JIANG Lu-lu,WANG Shi,WANG Bao-cheng,et al.Segmentation of Remote Sensing Image Based on an Improved Labeling Watershed Algorithm[J].,2010,(11):36.
[2]张甜 罗眉 孟晓红 赵宗涛.一种基于状态特征的航天发射故障诊断技术[J].计算机技术与发展,2010,(01):93.
 ZHANG Tian,LUO Mei,MENG Xiao-hong,et al.A Technology in Fault Diagnosis of Spaceflight Launch Based on State Character[J].,2010,(11):93.
[3]王会颖 章义刚.求解聚类问题的改进人工鱼群算法[J].计算机技术与发展,2010,(03):84.
 WANG Hui-ying,ZHANG Yi-gang.An Improved Artificial Fish- Swarm Algorithm of Solving Clustering Analysis Problem[J].,2010,(11):84.
[4]赵敏 倪志伟 刘斌.K—means与朴素贝叶斯在商务智能中的应用[J].计算机技术与发展,2010,(04):179.
 ZHAO Min,NI Zhi-wei,LIU Bin.Application Research of K - Means Clustering and Naive Bayesian Algorithm in Business Intelligence[J].,2010,(11):179.
[5]吴楠 胡学钢.基于聚类分区的序列模式挖掘算法研究[J].计算机技术与发展,2010,(06):109.
 WU Nan,HU Xue-gang.Research on Clustering Partition-Based Approach of Sequential Pattern Mining[J].,2010,(11):109.
[6]耿波 仲红 徐杰 闫娜娜.用关联分析法对负荷预测结果进行二次处理[J].计算机技术与发展,2008,(04):171.
 GENG Bo,ZHONG Hong,XU Jie,et al.Using Correlation Analysis to Treat Load Forecasting Results[J].,2008,(11):171.
[7]游芳 姜建国 张坤.基于二维属性的高维数据聚类算法研究[J].计算机技术与发展,2009,(05):111.
 YOU Fang,JIANG Jian-guo,ZHANG Kun.Cluster- Algorithm Studies Based on Two- Dimensional Attribute Higher - Dimension Data[J].,2009,(11):111.
[8]刘淑英 程国建 彭方.人工神经生长细胞结构网络在医疗诊断的应用[J].计算机技术与发展,2009,(05):231.
 LIU Shu-ying,CHENG Guo-jian,PENG Fang.Applications of Growing Cell Structures of Artificial Neural Network for Medical Diagnosis[J].,2009,(11):231.
[9]王园园 倪志伟 赵裕啸 伍章俊.基于决策树的模糊聚类评价算法及其应用[J].计算机技术与发展,2009,(09):232.
 WANG Yuan-yuan,NI Zhi-wei,ZHAO Yu-xiao,et al.Fuzzy Clustering Evaluation Algorithm Based on Decision Tree and Application[J].,2009,(11):232.
[10]段凤玲 李龙澍 曹文婷.具有多态特征和聚类处理的蚁群算法[J].计算机技术与发展,2009,(12):77.
 DUAN Feng-ling,LI Long-shu,CAO Wen-ting.Ant Colony Algorithm with Polymorphism and Clustering Processing[J].,2009,(11):77.
[11]范新 沈闻 丁泉勋 沈洁.基于正例和未标文档的半监督分类研究[J].计算机技术与发展,2009,(06):58.
 FAN Xin,SHEN Wen,DING Quan-xun,et al.Research on Semi- Supervised Classification Based on Positive and Unlabeled Text Document[J].,2009,(11):58.
[12]胡磊,蔡红霞,俞涛. 双重聚类的协同过滤算法在智能家居中的应用[J].计算机技术与发展,2017,27(02):100.
 HU Lei,CAI Hong-xia,YU Tao. Application of Collaborative Filtering Recommendation Based on Double Clustering in Smart Home System[J].,2017,27(11):100.
[13]鲍黎明,黄刚. 基于多叉树确定K值的动态K-means聚类算法[J].计算机技术与发展,2017,27(06):41.
 BAO Li-ming,HUANG Gang. A Dynamic Clustering Algorithm of K-means Based onMulti-branches Tree for K-values[J].,2017,27(11):41.
[14]王义武,杨余旺,于天鹏,等.基于 Spark 平台的 K-means 算法的设计与优化[J].计算机技术与发展,2019,29(03):72.[doi:10.3969/ j. issn.1673-629X.2019.03.015]
 WANG Yi-wu,YANG Yu-wang,YU Tian-peng,et al.Design and Optimization of K-means Algorithm Based on Spark Platform[J].,2019,29(11):72.[doi:10.3969/ j. issn.1673-629X.2019.03.015]
[15]贺艳芳,梁书田.优化加权多视角 K-means 聚类算法[J].计算机技术与发展,2019,29(03):81.[doi:10.3969/ j. issn.1673-629X.2019.03.017]
 HE Yan-fang,LIANG Shu-tian.Optimizing Weighted Multi-view K-means Clustering Algorithm[J].,2019,29(11):81.[doi:10.3969/ j. issn.1673-629X.2019.03.017]
[16]辛壮,万良,李均涛.改进的聚类算法在网络异常行为检测中的应用[J].计算机技术与发展,2019,29(03):111.[doi:10.3969/ j. issn.1673-629X.2019.03.024]
 XIN Zhuang,WAN Liang,LI Jun-tao.Application of Improved Clustering Algorithm in Network Abnormal Behavior Detection[J].,2019,29(11):111.[doi:10.3969/ j. issn.1673-629X.2019.03.024]

备注/Memo

备注/Memo:
青岛市科技计划项目(08-1-3-2-jcb)朱云贺(1986-),男,山东菏泽人,硕士研究生,研究方向为软件工程、数据库理论与应用;张春海,教授,硕士生导师,研究方向数据库理论及应用、软件工程、工作流、构件计算
更新日期/Last Update: 1900-01-01