[1]闫红[],李付学[],周云[]. 基于HowNet句子相似度的计算[J].计算机技术与发展,2015,25(11):53-57.
 YAN Hong[],LI Fu-xue[],ZHOU Yun[]. Calculation of Sentence Similarity Based on HowNet[J].,2015,25(11):53-57.
点击复制

 基于HowNet句子相似度的计算()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
25
期数:
2015年11期
页码:
53-57
栏目:
智能、算法、系统工程
出版日期:
2015-11-10

文章信息/Info

Title:
 Calculation of Sentence Similarity Based on HowNet
文章编号:
1673-629X(2015)11-0053-05
作者:
 闫红[1] 李付学[1] 周云[2]
 1.营口理工学院 机电工程系;2.辽宁科技大学 软件学院
Author(s):
 YAN Hong[1] LI Fu-xue[1] ZHOU Yun[2]
关键词:
 知网词语相似度义原句子相似度
Keywords:
 HowNet word similarity sememe sentence similarity
分类号:
TP391.1
文献标志码:
A
摘要:
 汉语句子的相似度计算在自然语言处理领域中是一项基础而又重要的工作,它直接决定着相关领域的研究发展状况.在词语相似度计算的基础上,针对目前句子相似度计算方法的不足,文中提出一种基于HowNet的计算句子相似度的方法.在《知网》的词汇语义相似度计算基础上,加入了词语定义义原间的反义、对义关系、单义原的否定和符号义原、定义信息来计算词语的相似度.计算句子相似度前加入词语的消歧,在计算句子相似度时考虑了词语定义的关系义原与待比较的词定义的某个义原相等的情况,并加大了关系义原的权重.实验结果表明,在同等的测试条件下,所提出的句子相似度计算方法可以提高句子相似度的计算精度,更符合人的直观感觉.
Abstract:
 Chinese sentence similarity computation is a fundamental and important work in the natural language processing. It directly de-termines the status of research and development for certain related fields. Based on the word similarity computing,for the shortcoming of current sentence similarity computing methods,present a method to compute sentence similarity Based on HowNet. Based on the lexical semantic similarity calculation of HowNet,antonyms,negative single sememe,symbol sememe,and definition information are demonstra-ted to calculate word similarity. In this method,word disambiguation is completed before the calculation of sentence similarity. The situa-tion of the similarity between the relation sememe of a word definition and a certain sememe of the given word are considered,and the re-lation sememe weight is added. Under the same test conditions,the experimental results show that the proposed method can improve the computational accuracy of sentence similarity and it is much closer to the people’ s comprehension to the meanings of the sentences.

相似文献/References:

[1]张明宝 马静.一种基于知网的中文词义消歧算法[J].计算机技术与发展,2009,(02):9.
 ZHANG Ming-bao,MA Jing.An Approach to Chinese Word Sense Disambiguation Based on HowNet[J].,2009,(11):9.
[2]魏凯斌 冉延平 余牛.语义相似度的计算方法研究与分析[J].计算机技术与发展,2010,(07):102.
 WEI Kai-bin,RAN Yan-ping,YU Niu.The Research and Analysis of Computing Methods on Semantic Similarity[J].,2010,(11):102.
[3]闫蓉 张蕾.一种新的汉语词义消歧方法[J].计算机技术与发展,2006,(03):22.
 YAN Rong,ZHANG Lei.New Chinese Word Sense Disambiguation Method[J].,2006,(11):22.
[4]周永梅 陶红 陈姣姣 张再跃.自动问答系统中的句子相似度算法的研究[J].计算机技术与发展,2012,(05):75.
 ZHOU Yong-mei,TAO Hong,CHEN Jiao-jiao,et al.Study on Sentence Similarity Approach of Automatic Ask & Answer System[J].,2012,(11):75.
[5]吴旭东 成卫青 黄卫东.改进的主客观结合的词语语义相似度算法[J].计算机技术与发展,2012,(09):45.
 WU Xu-dong,CHENG Wei-qing,HUANG Wei-dong.An Improved Subjective and Objective Combination Method for Measuring Word Semantic Similarity[J].,2012,(11):45.
[6]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(11):1.
[7]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(11):5.
[8]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(11):13.
[9]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(11):21.
[10]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(11):25.
[11]张培颖[],房龙云[]. 多特征结合的词语相似度计算模型[J].计算机技术与发展,2014,24(12):37.
 ZHANG Pei-ying[],FANG Long-yun[]. Word Similarity Computation Model of Multi-features Combination[J].,2014,24(11):37.
[12]赵涛[],张太红[][],陈燕红[]. 中文农业网页去重及相似度判断研究[J].计算机技术与发展,2015,25(01):191.
 ZHAO Tao[],ZHANG Tai-hong[][],CHEN Yan-hong[]. Research on Duplicate Removal and Similarity Evaluation of Chinese Agricultural Web Pages[J].,2015,25(11):191.
[13]王小林,陆骆勇,邰伟鹏. 基于信息熵的新的词语相似度算法研究[J].计算机技术与发展,2015,25(09):119.
 WANG Xiao-lin,LU Luo-yong,TAI Wei-peng. Research of a New Algorithm of Words Similarity Based on Information Entropy[J].,2015,25(11):119.
[14]建宇,周爱武,肖云,等. 基于特征空间的文本聚类[J].计算机技术与发展,2017,27(09):75.
 HUANG Jian-yu,ZHOU Ai-wu,XIAO Yun,et al. Text Clustering Based on Feature Space[J].,2017,27(11):75.

更新日期/Last Update: 2015-12-25