[1]付鹏斌,陈帅帅,杨惠荣,等.结合依存关系与同义词词林的相似度计算[J].计算机技术与发展,2020,30(01):13-18.[doi:10. 3969 / j. issn. 1673-629X. 2020. 01. 003]
 FU Peng-bin,CHEN Shuai-shuai,YANG Hui-rong,et al.Similarity Calculation between Dependency Relation and Tongyici Cilin[J].Computer Technology and Development,2020,30(01):13-18.[doi:10. 3969 / j. issn. 1673-629X. 2020. 01. 003]
点击复制

结合依存关系与同义词词林的相似度计算()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
30
期数:
2020年01期
页码:
13-18
栏目:
智能、算法、系统工程
出版日期:
2020-01-10

文章信息/Info

Title:
Similarity Calculation between Dependency Relation and Tongyici Cilin
文章编号:
1673-629X(2020)01-0013-06
作者:
付鹏斌陈帅帅杨惠荣李建君
北京工业大学 信息学部,北京 100124
Author(s):
FU Peng-binCHEN Shuai-shuaiYANG Hui-rongLI Jian-jun
Faculty of Information Technology,Beijing University of Technology,Beijing 100124,China
关键词:
依存关系同义词词林语义相似度关系路径平均偏差率
Keywords:
dependency relationTongyici Cilinsemantic similarityrelationship pathaverage deviation rate
分类号:
TP181
DOI:
10. 3969 / j. issn. 1673-629X. 2020. 01. 003
摘要:
设计了一种基于依存关系与同义词词林相结合的语义相似度计算方法。 该方法通过依存关系分别提取两个文本的关系路径,同时基于同义词词林计算两个文本之间关系路径的语义相似度。 在计算两个文本之间的语义相似度时,使用语言技术平台(language technology platform,LTP)对文本进行中文分词以及获取文本的依存关系图,从中提取关系路径,从而可以结合关系路径和同义词词林计算两个文本之间的语义相似度。 通过实验,获得的平均偏差率为 13. 83%。 实验结果表明,结合依存关系与同义词词林的语义相似度方法在准确率上相比较基于同义词词林的语义相似度和基于依存关系的语义相似度有了一定的提高。
Abstract:
We present a method of calculating semantic similarity based on the combination of dependency relation and Tongyici Cilin. This method extracts the relationship paths of two texts by the dependency relation, and calculates the semantic similarity of the relationship paths between two texts based on Tongyici Cilin. When calculating the semantic similarity between two texts,we use language technology platform (LTP) to segment the Chinese text and obtain the dependency graph of the text,and extract the relationship path from it,so that we can calculate the semantic similarity between the two texts by combining the relationship path and Tongyici Cilin. The average deviation rate is 13. 83% in the experiment which shows that the accuracy of the semantic similarity method based on the dependency relation and Tongyici Cilin is better than that based on Tongyici Cilin and based on the dependency relation.

相似文献/References:

[1]陶新竹,赵鹏,刘涛.融合核心句与依存关系的评价搭配抽取[J].计算机技术与发展,2014,24(01):118.
 TAO Xin-zhu,ZHAO Peng,LIU Tao.Extraction of Evaluation Collection of Merging Kernel Sentence and Dependency Relation[J].Computer Technology and Development,2014,24(01):118.
[2]张培颖[],房龙云[]. 多特征结合的词语相似度计算模型[J].计算机技术与发展,2014,24(12):37.
 ZHANG Pei-ying[],FANG Long-yun[]. Word Similarity Computation Model of Multi-features Combination[J].Computer Technology and Development,2014,24(01):37.
[3]刘清松,张仰森. 基于多词典融合的词汇语义倾向判别[J].计算机技术与发展,2015,25(05):104.
 LIU Qing-song,ZHANG Yang-sen. Lexical Semantic Tendency Determination Based on Multi-dictionary Strategy[J].Computer Technology and Development,2015,25(01):104.
[4]段准,刘功申. 基于TextRank的用户模板构建方法[J].计算机技术与发展,2015,25(10):1.
 DUAN Zhun,LIU Gong-shen. Method of Building User Profile Based on TextRank[J].Computer Technology and Development,2015,25(01):1.
[5]杨 泉.基于遗传算法的词语语义相似度计算研究[J].计算机技术与发展,2021,31(02):8.[doi:10. 3969 / j. issn. 1673-629X. 2021. 02. 002]
 YANG Quan.Research on Word Semantic Similarity Calculation Based on Genetic Algorithm[J].Computer Technology and Development,2021,31(01):8.[doi:10. 3969 / j. issn. 1673-629X. 2021. 02. 002]
[6]付鹏斌,刘 曼,杨惠荣.结合学科情感分析与依存关系的相似度评分[J].计算机技术与发展,2022,32(02):32.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 005]
 FU Peng-bin,LIU Man,YANG Hui-rong.Similarity Score Combining Subject Sentiment Analysis and Dependency Relationship[J].Computer Technology and Development,2022,32(01):32.[doi:10. 3969 / j. issn. 1673-629X. 2022. 02. 005]

更新日期/Last Update: 2020-01-10