[1]黄武锋,何冬蕾,黄名选.基于关联规则后件扩展的越英跨语言信息检索[J].计算机技术与发展,2019,29(04):164-168.[doi:10. 3969 / j. issn. 1673-629X. 2019. 04. 033]
 HUANG Wu-feng,HE Dong-lei,HUANG Ming-xuan.Vietnamese-English Cross Language Information Retrieval Model Based on Association Rule Consequent Expansion[J].,2019,29(04):164-168.[doi:10. 3969 / j. issn. 1673-629X. 2019. 04. 033]
点击复制

基于关联规则后件扩展的越英跨语言信息检索()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
29
期数:
2019年04期
页码:
164-168
栏目:
应用开发研究
出版日期:
2019-04-10

文章信息/Info

Title:
Vietnamese-English Cross Language Information Retrieval Model Based on Association Rule Consequent Expansion
文章编号:
1673-629X(2019)04-0164-05
作者:
黄武锋何冬蕾黄名选
广西财经学院 信息与统计学院,广西 南宁 530003
Author(s):
HUANG Wu-fengHE Dong-leiHUANG Ming-xuan
School of Information and Statistics,Guangxi University of Finance and Economics,Nanning 530003,China
关键词:
完全加权模式挖掘查询扩展跨语言信息检索信息检索
Keywords:
all-weighted patterns miningquery expansioncross language information retrievalinformation retrieval
分类号:
TP301
DOI:
10. 3969 / j. issn. 1673-629X. 2019. 04. 033
摘要:
针对跨语言信息检索中存在的查询主题漂移问题,提出一种基于完全加权关联规则后件扩展的越英跨语言信息检索模型,给出了模型结构及其各个功能模块,详细阐述了模型的关键技术及其算法。该模型将完全加权模式挖掘技术和用户相关反馈扩展融合应用于越英跨语言信息检索,将越南语查询通过机器翻译系统译为英文并检索为英文文档,提取前列初检文档构建用户相关反馈文档集,采用完全加权关联规则挖掘技术对用户相关反馈文档集挖掘与原查询相关的关联规则,将关联规则后件作为扩展词,并和原查询组合成新查询再次检索英文文档,得到最终检索结果。在NTCIR-5 CLIR数据集上的实验结果表明,该模型能减少越英跨语言检索中的查询漂移,提高和改善其检索性能。
Abstract:
We propose a Vietnamese-English cross language information retrieval model based on all-weighted association rule consequent expansion to solve the problem of query drift existing in cross language information retrieval. The structure of the model and its function modules are given,and the key techniques and algorithms of the model are discussed in detail. This model integrates the techniques of all-weighted pattern mining and user relevance feedback expansion for Vietnamese-English cross language information retrieval,and translates the Vietnamese query into English by machine translation system so as to retrieve English documents,and extracts the top-ranked retrieved documents with the aim of setting up user relevance feedback document collection. The technique of all-weighed association rule mining is used to mine association rules related to the original query in the collection,and the association rule consequents are taken as the expansion terms,and combined with the original query as a new query to retrieve the English documents for the final search result. Experimental results on the NTCIR-5 CLIR data set show that the proposed model can effectively reduce query drift in Vietnamese-English cross language retrieval,and improve its retrieval performance.

相似文献/References:

[1]李文骏 崔志明.基于搜索引擎的Deep Web数据源发现技术[J].计算机技术与发展,2008,(08):58.
 LI Wen-jun,CUI Zhi-ming.Deep Web Source Discovery Based on Search Engine[J].,2008,(04):58.
[2]黄名选 陈燕红 张师超[].基于关联规则挖掘的查询扩展检索性能研究[J].计算机技术与发展,2008,(10):103.
 HUANG Ming-xuan,CHEN Yan-hong,ZHANG Shi-chao.Studies on Retrieval Performance of Query Expansion Based on Association Rules Mining[J].,2008,(04):103.
[3]杨学兵 钱蓉.语义检索系统中的查询语句扩展算法改进[J].计算机技术与发展,2008,(12):1.
 YANG Xue-bing,QIAN Rong.Improvement on Arithmetic of Query Expansion in Semantic Retrieval[J].,2008,(04):1.
[4]李泽军 曾利军 刘文华.基于相关性和语义相似度融合的查询扩展方法[J].计算机技术与发展,2010,(09):66.
 LI Ze-jun,ZENG Li-jun,LIU Wen-hua.Query Expansion Method Based on Relativity and Similarity Inosculate[J].,2010,(04):66.
[5]黄名选 冯平 谢统义.基于语词抽取与负关联规则挖掘的信息检索[J].计算机技术与发展,2012,(05):157.
 HUANG Ming-xuan,FENG Ping,XIE Tong-yi.Information Retrieval Based on Terms Extraction and Negative Association Rules Mining[J].,2012,(04):157.

更新日期/Last Update: 2019-04-10