[1]张 润,冯云霞.基于改进 Apriori 算法的肺癌致病因素研究[J].计算机技术与发展,2020,30(02):143-147.[doi:10. 3969 / j. issn. 1673-629X. 2020. 02. 028]
 ZHANG Run,FENG Yun-xia.Research on Pathogenic Factors of Lung Cancer Based on Improved Apriori Algorithm[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2020,30(02):143-147.[doi:10. 3969 / j. issn. 1673-629X. 2020. 02. 028]
点击复制

基于改进 Apriori 算法的肺癌致病因素研究()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
30
期数:
2020年02期
页码:
143-147
栏目:
应用开发研究
出版日期:
2020-02-10

文章信息/Info

Title:
Research on Pathogenic Factors of Lung Cancer Based on Improved Apriori Algorithm
文章编号:
1673-629X(2020)02--0143-05
作者:
张 润冯云霞
青岛科技大学 信息科学技术学院,山东 青岛 266000
Author(s):
ZHANG RunFENG Yun-xia
School of Information Science and Technology,Qingdao University of Science and Technology, Qingdao 266000,China
关键词:
关联规则Apriori算法Hadoop肺癌
Keywords:
association rulesApriori algorithmHadooplung cancer
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2020. 02. 028
摘要:
随着人民生活水平的不断提高,肿瘤疾病的人数在不断增多,其中肺癌是21世纪严重危害人类健康的重大疾病。 面向肺癌电子病历如此庞大的数据量时,传统Apriori算法的串行计算方式需要频繁扫描数据库,会消耗巨大的内存占用 量。 对此,提出一种基于改进Apriori算法的肺癌风险评估因素分析的方法。 运用Hadoop平台实现并行Apriori算法的优 化,应用HBase文件存储系统对海量数据分布式存储以及MapReduce框架进行分布式计算,最后给出基于Hadoop平台和 MapReduce分布式计算模型的执行流程和测试结果。 实验结果表明,改进算法在处理大数据及时有较好的执行效率以及 良好的可扩展性,得出了肺癌的疾病模式与致病因素之间的隐匿规则,从而验证了改进后的Apriori算法对于辅助肺癌临 床实验具有重要的意义。
Abstract:
With the continuous improvement of people’s living standards,the number of cancer diseases is increasing,among which lung cancer is a serious threat to human health in the 21st century. Faced with such a large data volume in electronic medical records of lung cancer,the serial calculation method of traditional Apriori algorithm requires frequent scanning of the database,which will consume huge memory consumption. To this end,a method based on improved Apriori algorithm for lung cancer risk assessment factor analysis is proposed. The Hadoop platform isused to optimize the parallel Apriori algorithm. The HBase file storage system is used to distribute distributed data and the MapReduce framework. Finally,the execution flow and test results based on Hadoop platform and MapReduce distributed computing model are given. The experiment shows that the improved algorithm has better execution efficiency and scalability in dealing with big data in time,and obtains the hidden rules between the disease pattern and the pathogenic factors of lung cancer,thus verifying the improved Apriori algorithm is of great significance for assisting clinical trials to assist lung cancer.

相似文献/References:

[1]李雷 丁亚丽 罗红旗.基于规则约束制导的入侵检测研究[J].计算机技术与发展,2010,(03):143.
 LI Lei,DING Ya-li,LUO Hong-qi.Intrusion Detection Technology Research Based on Homing - Constraint Rule[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2010,(02):143.
[2]王爱平 王占凤 陶嗣干 燕飞飞.数据挖掘中常用关联规则挖掘算法[J].计算机技术与发展,2010,(04):105.
 WANG Ai-ping,WANG Zhan-feng,TAO Si-gan,et al.Common Algorithms of Association Rules Mining in Data Mining[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2010,(02):105.
[3]王晓宇 秦锋 程泽凯 邹洪侠.关联规则挖掘技术的研究与应用[J].计算机技术与发展,2009,(05):220.
 WANG Xiao-yu,QIN Feng,CHENG Ze-kai,et al.Investigation and Application of Association Rules Mining[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):220.
[4]吕刚[] 郑诚.基于本体的关联规则在电子商务中的应用[J].计算机技术与发展,2009,(06):250.
 LU Gang,ZHENG Cheng.Association Rules with Ontological Information in E- Commerce[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):250.
[5]耿波 仲红 徐杰 闫娜娜.隐私保护的关联规则挖掘研究[J].计算机技术与发展,2009,(02):81.
 GENG Bo,ZHONG Hong,XU Jie,et al.Study on Privacy- Preserving Association Rule Mining[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):81.
[6]刘军锋 李景文 陈大克 邓晓斌.一种改进的关联规则自顶向下算法[J].计算机技术与发展,2008,(02):136.
 LIU Jun-feng,LI Jing-wen,CHEN Da-ke,et al.An Improved Top to Bottom Algorithm for Mining Association Rules[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):136.
[7]王伟 高亮 吴涛.基于遗传算法的长频繁项集挖掘方法[J].计算机技术与发展,2008,(04):19.
 WANG Wei,GAO Liang,WU Tao.A Method of Mining Long Frequent Itemset Based on Genetic Algorithm[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):19.
[8]王鸿铭 沈夏炯 李国雁 臧国轻.基于等价类的关联规则挖掘矩阵算法[J].计算机技术与发展,2008,(04):55.
 WANG Hong-ming,SHEN Xia-jiong,LI Guo-yan,et al.An Association Mining Matrix Algorithm Based on Equivalence Class[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):55.
[9]耿波 仲红 徐杰 闫娜娜.用关联分析法对负荷预测结果进行二次处理[J].计算机技术与发展,2008,(04):171.
 GENG Bo,ZHONG Hong,XU Jie,et al.Using Correlation Analysis to Treat Load Forecasting Results[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):171.
[10]黄名选 陈燕红 张师超[].基于关联规则挖掘的查询扩展检索性能研究[J].计算机技术与发展,2008,(10):103.
 HUANG Ming-xuan,CHEN Yan-hong,ZHANG Shi-chao.Studies on Retrieval Performance of Query Expansion Based on Association Rules Mining[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):103.
[11]张广路 雷景生 吴兴惠.一种改进的Apriori关联规则挖掘算法(英文)[J].计算机技术与发展,2010,(06):84.
 ZHANG Guang-lu,LEI Jing-sheng,WU Xing-hui.An Improved Apriori Algorithm for Mining Association Rules[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2010,(02):84.
[12]文拯 梁建武 陈英.关联规则算法的研究[J].计算机技术与发展,2009,(05):56.
 WEN Zheng,LIANG Jian-wu,CHEN Ying.Research of Association Rules Algorithm[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):56.
[13]陈伟.Apriori算法的优化方法[J].计算机技术与发展,2009,(06):80.
 CHEN Wei.Method of Apriori Algorithm Optimization[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):80.
[14]郑春香 韩承双.关联规则研究及在远程教育考试系统中的应用[J].计算机技术与发展,2009,(08):186.
 ZHENG Chun-xiang,HAN Cheng-shuang.Research on Association Rule Mining and Application of Long- Distance Education System[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):186.
[15]郑春香 韩承双 董甲东.关联规则技术在教学评价中的应用[J].计算机技术与发展,2009,(09):215.
 ZHENG Chun-xiang,HAN Cheng-shuang,DONG Jia-dong.Application of Association Rule Mining in Teaching Appraisal[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):215.
[16]向哲 林国龙 杨斌.兴趣度在增量的关联规则挖掘中的研究[J].计算机技术与发展,2009,(10):33.
 XIANG Zhe,LIN Guo-long,YANG Bin.Interestingness Research of Association Rules in Incremental Mining Data[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):33.
[17]王敏 刘希玉.Apriori算法在税务系统中的应用[J].计算机技术与发展,2009,(11):175.
 WANG Min,LIU Xi-yu.Application of Apriori Algorithm in Tax System[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):175.
[18]董彩云 刘培华.数据挖掘技术在远程教育教学中的应用[J].计算机技术与发展,2009,(02):179.
 DONG Cai-yun,LIU Pei-hua.Application of Data Mining Technology in Instance Education[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2009,(02):179.
[19]袁万莲 郑诚 翟明清.一种改进的Apriori算法[J].计算机技术与发展,2008,(05):51.
 YUAN Wan-lian,ZHENG Cheng,ZHAI Ming-qing.An Improvement on Apriori Algorithm[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):51.
[20]陈永平 刘俞 苏新.基于兴趣度的Web页面关联规则的研究[J].计算机技术与发展,2008,(05):86.
 CHEN Yong-ping,LIU Yu,SU Xin.Research of Web Page Association Rules Based on Interesting[J].COMPUTER TECHNOLOGY AND DEVELOPMENT,2008,(02):86.

更新日期/Last Update: 2020-02-10