[1]吴代文 詹海生.西安市数字方志全文检索系统的设计与实现[J].计算机技术与发展,2011,(10):121-124.
 WU Dai-wen,ZHAN Hai-sheng.Design and Implementation of Full-Text Retrieval System for Xi'an Data Chorography[J].,2011,(10):121-124.
点击复制

西安市数字方志全文检索系统的设计与实现()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2011年10期
页码:
121-124
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
Design and Implementation of Full-Text Retrieval System for Xi'an Data Chorography
文章编号:
1673-629X(2011)10-0121-04
作者:
吴代文1 詹海生2
[1]渭南师范学院传媒工程系[2]西安电子科技大学计算机学院
Author(s):
WU Dai-wen ZHAN Hai-sheng
[1]Department of Communication Engineering, Weinan Teachers University[2]College of Computer Science, Xidian University
关键词:
全文检索二次索引二次检索查全率查准率
Keywords:
full-text retrieval second Index second retrieval recall precision
分类号:
TP391.3
文献标志码:
A
摘要:
通过LuceneAPI实现对PDF文档的一次全文检索,为了更精确地定位搜索关键词,设计并实现了一种新的二次索引算法,该二次索引带有关键词的页码、坐标及其上下文等信息。利用该二次索引可将检索结果定位到PDF文档的具体页,然后在页面上标示出关键字的具体位置,使对PDF文档的二次检索达到了类似GoogleBook的图书检索效果。系统测试结果说明系统具有良好检索性能,有较高的查全率和查准率,能够满足用户快速检索的需求。系统作为西安市数字方志全文检索平台投入使用已有2年,取得了较好的应用成果
Abstract:
In the paper,it implements the fu'st index in PDF document by Lucene API. In order to locate the search keyword more accurately,this paper designs and implements a new algorithm for the second index. It contains the information about the keywords' page number, coordinates, context and so on. Which can be made used of locating the retrieval results in the specific page of the book and marking the specific positions of the keywords. Thus, the effect of the second retrieval in PDF document is as similar as Google Book. The test result proved that this system is provided with high retrieval performance, recall rate and precision rate. It can be satisfied with the requirement of quickly retrieving websites ' documents. This system has been using for 2 years as the full-text retrieval system for Xi ' an data chorography and it gets lots of application fruit

相似文献/References:

[1]郑榕增 林世平.基于Lucene的中文倒排索引技术的研究[J].计算机技术与发展,2010,(03):80.
 ZHENG Rong-zeng,LIN Shi-ping.Research of Chinese Full Texts Inverted Index Based on Lucene[J].,2010,(10):80.
[2]李永春 丁华福.Lucene的全文检索的研究与应用[J].计算机技术与发展,2010,(02):12.
 LI Yong-chun,DING Hua-fu.Research and Application of Full Text Search Based on Lucene[J].,2010,(10):12.
[3]林碧英 赵锐 陈良臣.基于Lucene的全文检索引擎研究与应用[J].计算机技术与发展,2007,(05):184.
 LIN Bi-ying,ZHAO Rui,CHEN Liang-chen.Research and Application of Full Text Search Engine Based on Lucene[J].,2007,(10):184.
[4]苏延君 张宏军 郝文宁.基于P2P的数据库全文检索系统的设计与实现[J].计算机技术与发展,2007,(09):28.
 SU Yan-jun,ZHANG Hong-jun,HAO Wen-ning.Design and Realization of DB Full Text Retrieval System Based on P2P[J].,2007,(10):28.
[5]蒙辉 陈燕.Oracle Text技术在复杂结构数据库中的应用[J].计算机技术与发展,2007,(04):38.
 MENG Hui,CHEN Yan.Application of Oracle Text in Complex - Structured Database[J].,2007,(10):38.
[6]韩升 刘广志.全文检索系统的数据预处理研究[J].计算机技术与发展,2006,(03):208.
 HAN Sheng,LIU Guang-zhi.Study of Data-Pretreatment for Full-Text Search System[J].,2006,(10):208.
[7]聂红梅 赵建民.Oracle数据库中Clob大字段的查询优化技术研究[J].计算机技术与发展,2006,(08):97.
 NIE Hong-mei,ZHAO Jian-min.Research of Optimum Query Technology on Clob Big Segment in Oracle Database[J].,2006,(10):97.
[8]周锦程 王丹 余泉 张维.基于Lucene的全文检索系统的研究与实现[J].计算机技术与发展,2011,(03):67.
 ZHOU Jin-cheng,WANG Dan,YU Quan,et al.Research and Implementation of Full-Text Retrieval Engine Based on Lucene[J].,2011,(10):67.

备注/Memo

备注/Memo:
教育部特色专业建设点(TS11772)吴代文(1979-),男,硕士,讲师,主要研究方向为远程教育、教育信息检索;詹海生,博士,副教授,主要研究领域为计算机图形学,数据与知识工程等
更新日期/Last Update: 1900-01-01