[1]白菊[],何聚厚[]. 应用于问答系统的Lucene相似度检索算法改进[J].计算机技术与发展,2017,27(11):79-82.
 BAI Ju[],HE Ju-hou[]. Improvement of Lucene Similarity Search Algorithm Applied in Question Answering System[J].,2017,27(11):79-82.
点击复制

 应用于问答系统的Lucene相似度检索算法改进()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
27
期数:
2017年11期
页码:
79-82
栏目:
安全与防范
出版日期:
2017-11-10

文章信息/Info

Title:
 Improvement of Lucene Similarity Search Algorithm Applied in Question Answering System
文章编号:
1673-629X(2017)11-0079-04
作者:
 白菊[1]何聚厚[2]
1. 现代教学技术教育部重点实验室;2.陕西师范大学 计算机科学学院
Author(s):
 BAI Ju[1]HE Ju-hou[2]
关键词:
 Lucene 相似度问答系统语义
Keywords:
 Lucene similarityquestion answering systemsemantics
分类号:
TP301.6
文献标志码:
A
摘要:
 Lucene在文本检索和搜索领域有着广泛的应用,相似度评分算法是其搜索引擎的核心部分之一.而在问答系统中,也要用到检索功能,相似度评分算法也是其核心部分之一.那么能否对Lucene的相似度评分算法进行改进,使其在问答系统的领域也能得到很好的应用.针对上述提出的问题,结合问答系统中问句简短、包含信息量少的特点,引入外部词典对查找的关键词进行扩展,分析检索词项的语义相似度以及将词项位置关系的特征应用到Lucene中.在Lucene的基础上,对其语义相似度算法进行改进,提出了一种新的语义相似度评分算法.该算法考虑了词项位置关系和语义理解,能够更好地应用于问答系统.实验结果表面,提出的相似度算法能有效地提高自动问答系统的回答准确率.
Abstract:
 Lucene has a wide range of applications in the field of text retrieval and search,and the similarity score algorithm is one of the key parts of its search engine. And in the question answering system,the search function is also used,and the similarity score algorithm is also one of the key parts of its search engine. It is possible to improve the similarity score algorithm of the Lucene so that it can be widely used in the field of question answering system. In view of this problem,combined with the question answering system in the characteristic of brief question and small amount of information,the external dictionary is introduced to expand the searched key words,analysis and re-trieval of semantic similarity of words,application of lexical position relationship feature in Lucene. On the basis of Lucene,its semantic similarity algorithm is improved,and a new one is proposed which can be better applied in question answering system in consideration of lexical position relationship and semantic understanding. Experimental results show that the proposed algorithm can effectively improve the accuracy of the question answering system.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(11):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(11):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(11):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(11):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(11):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(11):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(11):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(11):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(11):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(11):47.

更新日期/Last Update: 2017-12-26