[1]于 澍,曹 琦,刘 涛.基于随机森林的微博互动特征分析[J].计算机技术与发展,2019,29(10):51-54.[doi:10. 3969 / j. issn. 1673-629X. 2019. 10. 011]
 YU Shu,CAO Qi,LIU Tao.Analysis of Interactive Characteristics of Weibo Based on Random Forest[J].,2019,29(10):51-54.[doi:10. 3969 / j. issn. 1673-629X. 2019. 10. 011]
点击复制

基于随机森林的微博互动特征分析()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
29
期数:
2019年10期
页码:
51-54
栏目:
应用开发研究
出版日期:
2019-10-10

文章信息/Info

Title:
Analysis of Interactive Characteristics of Weibo Based on Random Forest
文章编号:
1673-629X(2019)10-0051-04
作者:
于 澍曹 琦刘 涛
东北石油大学 计算机与信息技术学院,黑龙江 大庆 163318
Author(s):
YU ShuCAO QiLIU Tao
School of Computer and Information Technology,Northeast Petroleum University,Daqing 163318,China
关键词:
数据挖掘随机森林机器学习数据分析决策树
Keywords:
data miningrandom forestmachine learningdata analysisdecision tree
分类号:
TP181
DOI:
10. 3969 / j. issn. 1673-629X. 2019. 10. 011
摘要:
微博凭借其开放性、低门槛已成为最常用的社交媒体平台之一,其海量数据背后蕴藏着巨大的价值亟待研究。 而准确地判断微博的传播趋势,降低不良微博带来的影响已成为当前面临的主要问题。 文中以新浪微博为研究对象,将随机森林算法与数据分析处理相结合,对微博的博文发布一周后的转评赞行为进行预测,将数据特征分为三类并分析了每类特征对预测结果的影响。 首先,简述了决策树及随机森林算法的原理;其次,对微博数据进行分析,将提取的特征分为用户特征、时间特征和文本类特征三类;最后,通过三组对比实验验证了随机森林算法在微博互动预测上的可行性,并分析了三类特征对预测结果的影响。 实验结果表明,用户特征对预测准确率的影响较大。
Abstract:
Weibo has become one of the most commonly used social media platforms due to its openness and low threshold,and the huge value behind its massive data needs to be studied. To accurately judge the spread trend of Weibo and reduce the impact of bad Weibo has become the main problem. Taking Sina Weibo as the research object,we combine random forest algorithm with data analysis and processing to predict the behavior of the review and praise of Weibo after one week of blog post release. We divide data features into three categories and analyze the influence of each type of features on the predicted results. Firstly,the principle of decision tree and random forest algorithm is briefly described. Secondly,the microblog data is analyzed,and the extracted features are divided into three categories:user feature,time feature and text class feature. Finally,three sets of contrast experiments are verified. The feasibility of the random forest algorithm in the interactive prediction of Weibo,and the influence of the three types of features on the prediction results are analyzed. The experiment shows that the user feature has a greater impact on the accuracy of prediction.

相似文献/References:

[1]项响琴 汪彩梅.基于聚类高维空间算法的离群数据挖掘技术研究[J].计算机技术与发展,2010,(01):120.
 XIANG Xiang-qin,WANG Cai-mei.Study of Outlier Data Mining Based on CLIQUE Algorithm[J].,2010,(10):120.
[2]李雷 丁亚丽 罗红旗.基于规则约束制导的入侵检测研究[J].计算机技术与发展,2010,(03):143.
 LI Lei,DING Ya-li,LUO Hong-qi.Intrusion Detection Technology Research Based on Homing - Constraint Rule[J].,2010,(10):143.
[3]吉同路 柏永飞 王立松.住宅与房地产电子政务中数据挖掘的应用研究[J].计算机技术与发展,2010,(01):235.
 JI Tong-lu,BAI Yong-fei,WANG Li-song.Study and Application of Data Mining in E-government of House and Real Estate Industry[J].,2010,(10):235.
[4]杨静 张楠男 李建 刘延明 梁美红.决策树算法的研究与应用[J].计算机技术与发展,2010,(02):114.
 YANG Jing,ZHANG Nan-nan,LI Jian,et al.Research and Application of Decision Tree Algorithm[J].,2010,(10):114.
[5]赵裕啸 倪志伟 王园园 伍章俊.SQL Server 2005数据挖掘技术在证券客户忠诚度的应用[J].计算机技术与发展,2010,(02):229.
 ZHAO Yu-xiao,NI Zhi-wei,WANG Yuan-yuan,et al.Application of Data Mining Technology of SQL Server 2005 in Customer Loyalty Model in Securities Industry[J].,2010,(10):229.
[6]张笑达 徐立臻.一种改进的基于矩阵的频繁项集挖掘算法[J].计算机技术与发展,2010,(04):93.
 ZHANG Xiao-da,XU Li-zhen.An Advanced Frequent Itemsets Mining Algorithm Based on Matrix[J].,2010,(10):93.
[7]王爱平 王占凤 陶嗣干 燕飞飞.数据挖掘中常用关联规则挖掘算法[J].计算机技术与发展,2010,(04):105.
 WANG Ai-ping,WANG Zhan-feng,TAO Si-gan,et al.Common Algorithms of Association Rules Mining in Data Mining[J].,2010,(10):105.
[8]张广路 雷景生 吴兴惠.一种改进的Apriori关联规则挖掘算法(英文)[J].计算机技术与发展,2010,(06):84.
 ZHANG Guang-lu,LEI Jing-sheng,WU Xing-hui.An Improved Apriori Algorithm for Mining Association Rules[J].,2010,(10):84.
[9]吴楠 胡学钢.基于聚类分区的序列模式挖掘算法研究[J].计算机技术与发展,2010,(06):109.
 WU Nan,HU Xue-gang.Research on Clustering Partition-Based Approach of Sequential Pattern Mining[J].,2010,(10):109.
[10]吴青 傅秀芬.水平分布数据库的正负关联规则挖掘[J].计算机技术与发展,2010,(06):113.
 WU Qing,FU Xiu-fen.Positive and Negative Association Rules Mining on Horizontally Partitioned Database[J].,2010,(10):113.

更新日期/Last Update: 2019-10-10