[1]李梦洁,邵曦.基于文本属性的微博用户相似度研究[J].计算机技术与发展,2018,28(05):17-22.[doi:10.3969/j.issn.1673-629X.2018.05.005]
 LI Meng-jie,SHAO Xi. Research on Micro-blog User Similarity Based on Text Similarity[J].,2018,28(05):17-22.[doi:10.3969/j.issn.1673-629X.2018.05.005]
点击复制

基于文本属性的微博用户相似度研究()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
28
期数:
2018年05期
页码:
17-22
栏目:
智能、算法、系统工程
出版日期:
2018-05-10

文章信息/Info

Title:
 Research on Micro-blog User Similarity Based on Text Similarity
文章编号:
1673-629X(2018)05-0017-06
作者:
李梦洁邵曦
南京邮电大学 通信与信息工程学院,江苏 南京 210003
Author(s):
LI Meng-jieSHAO Xi
School of Communication and Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
关键词:
微博社交网络用户相似度文本相似度余弦相似度层次分析法
Keywords:
 Key words:Micro-blogsocial networkuser similaritytext similaritycosine similarityanalytic hierarchy process
分类号:
TP301
DOI:
10.3969/j.issn.1673-629X.2018.05.005
文献标志码:
A
摘要:
传统的相似度计算方法忽略了用户主观输出的微博文本信息,而这正是体现用户兴趣点的重要元素,只有结合了用户自身信息及其在社交平台上的互动内容,才能相对全面描述一个用户,由此提出基于文本属性的相似用户计算方法。把相似度主要划分为背景相似、兴趣相似两个角度,其中兴趣相似度主要由文本相似度决定,分词并采用 TF-IDF 变换后进行余弦相似度计算;再结合用户所在地、使用设备、发微博的时间、转发关系以及好友关系来描述用户相似度;并利用层次分析法分配各个属性的权重,构建相似度计算模型。通过实验系统性对比了基于文本属性的计算方法与改进前的算法:改进算法得出的用户相似度的 F 1 度量值提高了 34.3%,证明了该算法的优越性。
Abstract:
Traditional similarity calculation method ignores the subjective information of the users,which is an important element that reflects user’s interest point.In order to fully describe the user’s information,the user’s background information and their interactive content on the social platform should be considered.Therefore,we present a calculating method of Micro-blog user similarity,which is bound up with text similarity.The user similarity is mainly divided by the background similarity and interest similarity which is mainly determined by the text similarity.The cosine similarity should be calculated after the word segmentation and TF-IDF.User similarity is also described by user’s location,the device they use,the time they send Weibo,the text they re-post and the relationship between them.Finally,the method uses AHP to determine the weight of each attribute and build an integrated similarity calculation model.Through the experiment,systematically compared with the calculating method of user similarity combined with text similarity and the one before improving,the results show that the former increase the F 1 metric by 34.3%,which shows its superiority.

相似文献/References:

[1]舒琰,向阳,张骐,等.基于PageRank的微博排名MapReduce算法研究[J].计算机技术与发展,2013,(02):73.
 SHU Yan,XIANG Yang,ZHANG Qi,et al.Research on MapReduce Algorithm of Micro Blog Ranking Based on PageRank[J].,2013,(05):73.
[2]潘明慧,牛耘. 基于多线索混合词典的微博情绪识别[J].计算机技术与发展,2014,24(09):28.
 PAN Ming-hui,NIU Yun. Emotion Recognition of Micro-blogs Based on a Hybrid Lexicon[J].,2014,24(05):28.
[3]李桃陶,周斌,王忠振. 基于社交网络的图数据挖掘应用研究[J].计算机技术与发展,2014,24(10):6.
 LI Tao-tao,ZHOU Bin,WANG Zhong-zhen. Research on Graph Data Mining Application Based on Social Network[J].,2014,24(05):6.
[4]张付霞,蒋朝惠. 基于DSNPP算法的社交网络隐私保护方法[J].计算机技术与发展,2015,25(08):152.
 ZHANG Fu-xia,JIANG Chao-hui. Privacy-preserving Approach in Social Networks Based on DSNPP Algorithm[J].,2015,25(05):152.
[5]陈思憬,骆冰清,孙知信.基于混合好友路径信任度的社交好友推荐算法[J].计算机技术与发展,2018,28(02):74.[doi:10.3969/j.issn.1673-629X.2018.02.017]
 CHEN Si-jing,LUO Bing-qing,SUN Zhixin.Social Friend Recommendation Algorithm Based on Trust of Paths between Mixed Friends[J].,2018,28(05):74.[doi:10.3969/j.issn.1673-629X.2018.02.017]
[6]苏小英[][],孟环建[]. 基于神经网络的微博情感分析[J].计算机技术与发展,2015,25(12):161.
 SU Xiao-ying[][],MENG Huan-jian[]. Sentiment Analysis of Micro-blog Based on Neural Networks[J].,2015,25(05):161.
[7]余莎莎[],王友国[],朱亮[]. 基于SIR社交网络中商业谣言传播研究[J].计算机技术与发展,2016,26(11):195.
 YU Sha-sha[],WANG You-guo[],ZHU Liang[]. Research on Online Business Rumors Transmission Based on an Improved SIR Model[J].,2016,26(05):195.
[8]余莎莎[],王友国[],朱亮[]. 基于网络博弈论的谣言扩散建模研究[J].计算机技术与发展,2017,27(04):6.
 YU Sha-sha[],WANG You-guo[],ZHU Liang[]. Investigation on Rumor Diffusion Modeling with Network Game Theory[J].,2017,27(05):6.
[9]陈阳[],邵曦[],赵海博[]. 基于UR-LDA的微博主题挖掘[J].计算机技术与发展,2017,27(06):173.
 CHEN Yang[],SHAO Xi[],ZHAO Hai-bo[]. Microblog Topic Mining Based on UR-LDA[J].,2017,27(05):173.
[10]付明明,余莎莎,应志领. 在线社交网络的双谣言模型研究[J].计算机技术与发展,2017,27(09):53.
 FU Ming-ming,YU Sha-sha,YING Zhi-ling. Research on Double Rumor Model in Online Social Network[J].,2017,27(05):53.
[11]房旋[],陈升波[],宫婧[][],等. 基于社交影响力的推荐算法[J].计算机技术与发展,2016,26(06):31.
 FANG Xuan[],CHEN Sheng-bo[],GONG Jing[][],et al. A Recommendation Algorithm Based on Social Influence[J].,2016,26(05):31.

更新日期/Last Update: 2018-06-26