[1]吴旭东 成卫青 黄卫东.改进的主客观结合的词语语义相似度算法[J].计算机技术与发展,2012,(09):45-49.
 WU Xu-dong,CHENG Wei-qing,HUANG Wei-dong.An Improved Subjective and Objective Combination Method for Measuring Word Semantic Similarity[J].,2012,(09):45-49.
点击复制

改进的主客观结合的词语语义相似度算法()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2012年09期
页码:
45-49
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
An Improved Subjective and Objective Combination Method for Measuring Word Semantic Similarity
文章编号:
1673-629X(2012)09-0045-05
作者:
吴旭东1 成卫青1 黄卫东2
[1]南京邮电大学计算机学院[2]南京邮电大学经济与管理学院
Author(s):
WU Xu-dong CHENG Wei-qing HUANG Wei-dong
[1]College of Computer, Nanjing University of Posts and Telecommunications[2]College of Economics & Management, Nanjing University of Posts and Telecommunications
关键词:
词语语义相似度知网客观相似度主观相似度
Keywords:
word semantic similarity howNet objective similarity subjective similarity
分类号:
TP301.6
文献标志码:
A
摘要:
鉴于词语表达形式与词语语义的多样性,词语语义相似度计算是自然语言处理、智能检索、文档聚类等领域的一个研究热点。文中根据词语表达方式的特点,在基于词语语义词典和基于大规模语料库这两种计算词语语义相似度方法的基础之上,提出一种改进的主观和客观相结合的词语相似度计算方法。从方法论的角度,本算法既融合了主观经验主义思想也融合了客观的理性主义思想,使得词语语义相似度的计算结果能够更加准确。实验结果表明采用文方法是有效的,能够显著提高词语语义相似度计算结果的准确性
Abstract:
In view of the diversity of word expression form and word semantics, the word semantic calculation is a hot research topic in the fields of natural language processing ,intelligent search, document clustering and so on. According to the features of word expression, based on the two methods which is based on word semantic dictionary and the other is based on large-scale corpus to calculate word semanteme, an improved method combining subjective and objective methods to calculate word semantic similarity is proposed. From the point of view of the methodology, the method has combined both subjective experience and objective rationality, making it possible to improve the accuracy of the word semantic similarity. Experimental results show that the proposed method is effective and can significantly improve the accuracy of the word semantic similarity

相似文献/References:

[1]张明宝 马静.一种基于知网的中文词义消歧算法[J].计算机技术与发展,2009,(02):9.
 ZHANG Ming-bao,MA Jing.An Approach to Chinese Word Sense Disambiguation Based on HowNet[J].,2009,(09):9.
[2]魏凯斌 冉延平 余牛.语义相似度的计算方法研究与分析[J].计算机技术与发展,2010,(07):102.
 WEI Kai-bin,RAN Yan-ping,YU Niu.The Research and Analysis of Computing Methods on Semantic Similarity[J].,2010,(09):102.
[3]闫蓉 张蕾.一种新的汉语词义消歧方法[J].计算机技术与发展,2006,(03):22.
 YAN Rong,ZHANG Lei.New Chinese Word Sense Disambiguation Method[J].,2006,(09):22.
[4]周永梅 陶红 陈姣姣 张再跃.自动问答系统中的句子相似度算法的研究[J].计算机技术与发展,2012,(05):75.
 ZHOU Yong-mei,TAO Hong,CHEN Jiao-jiao,et al.Study on Sentence Similarity Approach of Automatic Ask & Answer System[J].,2012,(09):75.
[5]张培颖[],房龙云[]. 多特征结合的词语相似度计算模型[J].计算机技术与发展,2014,24(12):37.
 ZHANG Pei-ying[],FANG Long-yun[]. Word Similarity Computation Model of Multi-features Combination[J].,2014,24(09):37.
[6]赵涛[],张太红[][],陈燕红[]. 中文农业网页去重及相似度判断研究[J].计算机技术与发展,2015,25(01):191.
 ZHAO Tao[],ZHANG Tai-hong[][],CHEN Yan-hong[]. Research on Duplicate Removal and Similarity Evaluation of Chinese Agricultural Web Pages[J].,2015,25(09):191.
[7]王小林,陆骆勇,邰伟鹏. 基于信息熵的新的词语相似度算法研究[J].计算机技术与发展,2015,25(09):119.
 WANG Xiao-lin,LU Luo-yong,TAI Wei-peng. Research of a New Algorithm of Words Similarity Based on Information Entropy[J].,2015,25(09):119.
[8]闫红[],李付学[],周云[]. 基于HowNet句子相似度的计算[J].计算机技术与发展,2015,25(11):53.
 YAN Hong[],LI Fu-xue[],ZHOU Yun[]. Calculation of Sentence Similarity Based on HowNet[J].,2015,25(09):53.
[9]建宇,周爱武,肖云,等. 基于特征空间的文本聚类[J].计算机技术与发展,2017,27(09):75.
 HUANG Jian-yu,ZHOU Ai-wu,XIAO Yun,et al. Text Clustering Based on Feature Space[J].,2017,27(09):75.
[10]殷 硕,王卫亚,柳有权.基于语义特征抽取的文本聚类研究[J].计算机技术与发展,2020,30(03):46.[doi:10. 3969 / j. issn. 1673-629X. 2020. 03. 009]
 YIN Shuo,WANG Wei-ya,LIU You-quan.Research on Text Clustering Based on Semantic Feature Extraction[J].,2020,30(09):46.[doi:10. 3969 / j. issn. 1673-629X. 2020. 03. 009]
[11]李蕾,杨丽花.基于知网的词语语义相似度改进算法[J].计算机技术与发展,2019,29(04):42.[doi:10. 3969 / j. issn. 1673-629X. 2019. 04. 009]
 LI Lei,YANG Li-hua.Improved Algorithm of Word Semantic Similarity Based on HowNet[J].,2019,29(09):42.[doi:10. 3969 / j. issn. 1673-629X. 2019. 04. 009]

备注/Memo

备注/Memo:
国家自然科学基金资助项目(61170322,71171117);软件开发环境国家重点实验室开放课题(SKLSDE-2011KF-OX);江苏省自然科学基金资助项目(BK2010524)吴旭东(1986-),男,硕士生,CCF会员,研究方向为文本聚类和智能检索;成卫青,副教授,博士,研究方向为网络测量
更新日期/Last Update: 1900-01-01