[1]马婉贞,钱育蓉. 基于标签匹配的协同过滤推荐算法研究[J].计算机技术与发展,2017,27(07):25-28.
 MA Wan-zhen,QIAN Yu-rong. Investigation on Collaborative Filtering Recommendation Algorithm with Tag Matching[J].,2017,27(07):25-28.
点击复制

 基于标签匹配的协同过滤推荐算法研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
27
期数:
2017年07期
页码:
25-28
栏目:
智能、算法、系统工程
出版日期:
2017-07-10

文章信息/Info

Title:
 Investigation on Collaborative Filtering Recommendation Algorithm with Tag Matching
文章编号:
1673-629X(2017)07-0025-04
作者:
 马婉贞钱育蓉
 新疆大学 软件学院
Author(s):
 MA Wan-zhenQIAN Yu-rong
关键词:
 协同过滤算法标签计算HadoopMapReduce标签匹配
Keywords:
 collaborative filtering algorithmtag computingHadoopMapReducetag matching
分类号:
TP301.6
文献标志码:
A
摘要:
 随着微博用户数量的上升,微博信息量成倍增长,基于冗杂的微博信息向微博用户快速推荐感兴趣的好友是不容回避的技术问题.针对这一问题,基于微博大数据,以Hadoop为平台,HBase为基础,MapReduce为编程框架,提出了基于Apriori算法与Item-based协同过滤算法的组合算法,并构建了推荐好友系统.该系统通过Apriori算法对冗杂的微博内容记录进行频繁项集的计算,得出能表达用户喜好的标签,以提升系统的时间性能;通过Item-based算法对标签进行匹配推荐,以缩短系统的推荐时间以及资源占用率.为了验证所构建系统的有效性和可靠性,分别进行了两组对比实验,第一组实验为添加了Apriori算法的协同过滤算法与传统协同过滤算法在时间性能方面的对比测试,第二组实验则为Apriori算法混合Item-based协同过滤算法与混合K-means算法的对比测试.实验结果表明,在庞大的微博容量下,与传统协同过滤算法相比,所提出算法的运行时间缩短了24%~44%;与混合K-means聚类算法相比,所提出算法在算法运行时间和CPU占用率均有1.2~1.5倍的提升.可见,提出的算法可显著缩短推荐时间,减少资源消耗率,提高推荐效率.
Abstract:
 With the rising of micro-blogging users,microblog information capacity has grown rapidly.Fast recommendation of interested friends for micro-blogging users based on the jumbled microblog information becomes inevitable problem.Therefore faced with massive data of microblog,with Hadoop as platform and MapReduce as program frame and based on HBase,a hybrid algorithm of Apriori & Item-based collaborative filtering recommendation algorithm has been proposed and a recommended friends system has been established,in which system computation of frequent item set with massive microblog content records has been conducted to express users’ favorites with tags for promotion of its time performances via Apriori algorithm and thus recommendation of tags has been matched via Item-based algorithm for decrease of recommendation time and occupancy rate of system resource.In order to verify its effectiveness and reliability,two groups of contrast experiments have been conducted,in which the first one involves contrast tests of time performances with collaborative filtering algorithm based on Apriori algorithm vs traditional collaborative filtering algorithm and the other one is composed of contrast tests of hybrid algorithm combined Apriori algorithm with Item-based collaborative filtering algorithm vs hybrid K-means algorithm.The results of contrast experiments show that in large micro-blogging capacity,compared with hybrid K-means clustering algorithm,the proposed algorithm has decreased the running time by 24%~44% and has lifted 1.2~1.5 times in operation time and CPU occupancy rate.Obviously,the time and recommended resource consumption can be greatly reduced and efficiency recommended improved for proposed algorithm.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(07):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(07):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(07):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(07):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(07):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(07):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(07):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(07):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(07):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(07):47.
[11]叶树鑫[],何聚厚[][]. 协作学习中基于协同过滤的学习资源推荐研究[J].计算机技术与发展,2014,24(10):63.
 YE Shu-xin[],HE Ju-hou[][]. esearch on Learning Material Recommendation Based on Collaborative Filtering Algorithm in Cooperative Learning[J].,2014,24(07):63.
[12]谢人强,陈震. 基于共同评分项和权重计算的推荐算法研究[J].计算机技术与发展,2016,26(09):69.
 XIE Ren-qiang,CHEN Zhen. Research on Recommendation Algorithm Based on Co-rating and Weight Calculation[J].,2016,26(07):69.

更新日期/Last Update: 2017-08-22