[1]伊瑶瑶,茅苏. Hadoop下的关联规则分析研究[J].计算机技术与发展,2015,25(09):84-88.
 YI Yao-yao,MAO Su. Research on Association Rules Analysis under Hadoop Platform[J].,2015,25(09):84-88.
点击复制

 Hadoop下的关联规则分析研究()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
25
期数:
2015年09期
页码:
84-88
栏目:
智能、算法、系统工程
出版日期:
2015-09-10

文章信息/Info

Title:
 Research on Association Rules Analysis under Hadoop Platform
文章编号:
1673-629X(2015)09-0084-05
作者:
 伊瑶瑶茅苏
 1.南京邮电大学 计算机学院;2.江苏省无线传感网高技术研究重点实验室;3.宽带无线通信与传感网技术教育部重点实验室
Author(s):
 YI Yao-yaoMAO Su
关键词:
 关联规则支持度置信度频繁项集
Keywords:
association rulessupport degreeconfidence degreefrequent item sets
分类号:
TP301.6
文献标志码:
A
摘要:
 Apriori算法是关联规则挖掘中最基本也是最重要的算法之一。但现有的Apriori算法存在对数据库的扫描次数过多,产生了大量的候选项集合,算法执行效率较低,运行时间显著增加等问题。文中针对现有Apriori扫描数据库过于频繁的问题,在Hadoop平台下对Apriori算法进行改进,提出一种改进算法H-Apriori算法。利用并行方法计算频繁项集,该算法将原始数据集按字母排序,减少频繁项集的计算开销,避免反复扫描数据库带来的时间上的消耗,从而提高算法的执行效率。通过与传统Apriori算法的执行时间相比较,实验结果表明,提出的改进算法H-Apriori明显减少了访问数据库的时间,有较高的执行效率。
Abstract:
 Apriori algorithm is one of the most basic and important algorithms in association rules mining. But the existing Apriori algo-rithm scans database many times,generating a large number of candidate item sets,having a low execution efficiency,increasing running time significantly. Based on existing Apriori algorithm scanning database many times,put forward a new method,H-Apriori,which calcu-lates frequent items of association rules under Hadoop platform for improving efficiency of algorithm,reducing time accessing database, and sorts the original datasets,reducing frequent datasets’ computing cost,avoiding the time consumption by repeated scanning database, improving the efficiency of the algorithm. The experimental results show that the improved algorithm H-Apriori significantly reduces the amount of time to access the database with high execution efficiency.

相似文献/References:

[1]李雷 丁亚丽 罗红旗.基于规则约束制导的入侵检测研究[J].计算机技术与发展,2010,(03):143.
 LI Lei,DING Ya-li,LUO Hong-qi.Intrusion Detection Technology Research Based on Homing - Constraint Rule[J].,2010,(09):143.
[2]王爱平 王占凤 陶嗣干 燕飞飞.数据挖掘中常用关联规则挖掘算法[J].计算机技术与发展,2010,(04):105.
 WANG Ai-ping,WANG Zhan-feng,TAO Si-gan,et al.Common Algorithms of Association Rules Mining in Data Mining[J].,2010,(09):105.
[3]张广路 雷景生 吴兴惠.一种改进的Apriori关联规则挖掘算法(英文)[J].计算机技术与发展,2010,(06):84.
 ZHANG Guang-lu,LEI Jing-sheng,WU Xing-hui.An Improved Apriori Algorithm for Mining Association Rules[J].,2010,(09):84.
[4]耿波 仲红 徐杰 闫娜娜.用关联分析法对负荷预测结果进行二次处理[J].计算机技术与发展,2008,(04):171.
 GENG Bo,ZHONG Hong,XU Jie,et al.Using Correlation Analysis to Treat Load Forecasting Results[J].,2008,(09):171.
[5]文拯 梁建武 陈英.关联规则算法的研究[J].计算机技术与发展,2009,(05):56.
 WEN Zheng,LIANG Jian-wu,CHEN Ying.Research of Association Rules Algorithm[J].,2009,(09):56.
[6]王晓宇 秦锋 程泽凯 邹洪侠.关联规则挖掘技术的研究与应用[J].计算机技术与发展,2009,(05):220.
 WANG Xiao-yu,QIN Feng,CHENG Ze-kai,et al.Investigation and Application of Association Rules Mining[J].,2009,(09):220.
[7]陈伟.Apriori算法的优化方法[J].计算机技术与发展,2009,(06):80.
 CHEN Wei.Method of Apriori Algorithm Optimization[J].,2009,(09):80.
[8]吕刚[] 郑诚.基于本体的关联规则在电子商务中的应用[J].计算机技术与发展,2009,(06):250.
 LU Gang,ZHENG Cheng.Association Rules with Ontological Information in E- Commerce[J].,2009,(09):250.
[9]郑春香 韩承双.关联规则研究及在远程教育考试系统中的应用[J].计算机技术与发展,2009,(08):186.
 ZHENG Chun-xiang,HAN Cheng-shuang.Research on Association Rule Mining and Application of Long- Distance Education System[J].,2009,(09):186.
[10]郑春香 韩承双 董甲东.关联规则技术在教学评价中的应用[J].计算机技术与发展,2009,(09):215.
 ZHENG Chun-xiang,HAN Cheng-shuang,DONG Jia-dong.Application of Association Rule Mining in Teaching Appraisal[J].,2009,(09):215.
[11]顾伟[][],傅德胜[][],蔡玮[]. 基于命题逻辑的关联规则挖掘算法[J].计算机技术与发展,2015,25(03):91.
 GU Wei[][],FU De-sheng[][],CAI Wei[]. Association Rules Mining Algorithm Based on Propositional Logic[J].,2015,25(09):91.
[12]吴红星,王浩. 基于Apriori改进算法的企业Web日志挖掘研究[J].计算机技术与发展,2015,25(04):43.
 WU Hong-xing,WANG Hao. Research on Enterprise Web Log Mining Based on Improved Apriori Algorithm[J].,2015,25(09):43.
[13]杨成,杜秀春,康文杰. 基于关联规则挖掘的关键基础设施安全事件分析[J].计算机技术与发展,2015,25(10):154.
 YANG Cheng,DU Xiu-chun,KANG Wen-jie. Analysis of Critical Infrastructure Reports Based on Association Rules Mining[J].,2015,25(09):154.
[14]田亚凯,陈小惠. 改进关联规则算法在医疗监控中的应用[J].计算机技术与发展,2015,25(10):183.
 TIAN Ya-kai,CHEN Xiao-hui. Application of an Improved Algorithm of Association Rules in Health Monitoring Center[J].,2015,25(09):183.
[15]刘木林,朱庆华. 基于Hadoop的关联规则挖掘算法研究--以Apriori算法为例[J].计算机技术与发展,2016,26(07):1.
 LIU Mu-lin,ZHU Qing-hua. Research on Association Rules Mining Algorithm Based on Hadoop-Taking Apriori as an Example[J].,2016,26(09):1.
[16]张珏[][],陈莉[],田建学[]. 面向零售业的关联规则挖掘的研究与实现[J].计算机技术与发展,2016,26(10):146.
 ZHANG Jue[][],CHEN Li[],TIAN Jian-xue[]. Research and Realization of Association Rules Mining in Supermarket[J].,2016,26(09):146.
[17]张永梅,许静,郭莎. 基于堆排序的重要关联规则挖掘算法研究[J].计算机技术与发展,2016,26(12):45.
 ZHANG Yong-mei,XU Jing,GUO Sha. Research on Association Rules Mining Algorithm for Main Target[J].,2016,26(09):45.
[18]秦军[],郝天曙[],董倩倩[]. 基于MapReduce的Apriori算法并行化改进[J].计算机技术与发展,2017,27(04):64.
 QIN Jun[],HAO Tian-shu[],DONG Qian-qian[]. Parallel Improvement of Apriori Algorithm Based on MapReduce[J].,2017,27(09):64.
[19]施海鹰. 基于关联规则挖掘的分类随机游走算法[J].计算机技术与发展,2017,27(09):1.
 SHI Hai-ying. Random-walk Classification Algorithm with Association Rules Mining[J].,2017,27(09):1.

更新日期/Last Update: 2015-10-16