[1]唐华姣 何友全 徐小乐 徐澄.基于Lucene的分布式并行索引[J].计算机技术与发展,2011,(02):123-126.
 TANG Hua-jiao,HE You-quan,XU Xiao-le,et al.Distributed Parallel Index Based on Lucene[J].,2011,(02):123-126.
点击复制

基于Lucene的分布式并行索引()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2011年02期
页码:
123-126
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
Distributed Parallel Index Based on Lucene
文章编号:
1673-629X(2011)02-0123-04
作者:
唐华姣1 何友全1 徐小乐1 徐澄2
[1]重庆交通大学信息科学与工程学院[2]重庆交通大学管理学院
Author(s):
TANG Hua-jiaoHE You-quanXU Xiao-leXU Cheng
[1]Information Science & Engineering Department,Chongqing Jiaotong University[2]Department of Management,Chongqing Jiaotong University
关键词:
索引技术Lucene搜索引擎分布式并行索引
Keywords:
index technology Lucene search engine distributed parallel index
分类号:
TP311
文献标志码:
A
摘要:
索引技术是搜索引擎的核心技术之一,索引技术的好坏直接影响到搜索引擎的查准率以及对用户的响应速度。Lucene是一个优秀的全文检索引擎架构,采用高度优化的倒排索引结构并支持增量索引。但在实际应用Lucene时存在一个值得关注的问题:随着被索引文件的增多,索引时间成线性增长,导致建索引的过程会影响搜索体验;在搜索引擎应用中,当索引文件量达到一定等级时,搜索引擎就遇到性能瓶颈。在深入分析和研究Lucene索引机制的基础上,采用以内存为缓冲区建索引文件的分布式并行索引技术形成了一个可扩展的搜索引擎解决方案,极大地缓解了建索引给搜索带来的瓶颈问题
Abstract:
Index technology is one of the core technologies of search engine.The quality of the index technology has a direct influence to the precision of search engine and the responding speed to users.Lucene,with highly optimized inverted index structure and incremental index support,is an excellent full-text search engine framework.However,when apply Lucene to practice,we can discover a problem deserves concern: with the linearly growth of indexing time which results from the increase of the number of indexed documents,the process of building index will inevitably affect the search experience.In the search engine application,when the number of indexed documents reaches a certain level,the search engine will experience performance bottleneck.Based on deeply analysis and research of the index mechanism of Lucene,proposes a scalable search engine solution by adopting memory-buffer distributed parallel index technique.This solution has greatly relieved the bottleneck problem of search engine due to index building

相似文献/References:

[1]张银玲,武彤.常用OLAP查询优化方法性能分析[J].计算机技术与发展,2014,24(01):39.
 ZHANG Yin-ling,WU Tong.Performance Analysis of Several OLAP Query Optimization Methods[J].,2014,24(02):39.
[2]李永春 丁华福.Lucene的全文检索的研究与应用[J].计算机技术与发展,2010,(02):12.
 LI Yong-chun,DING Hua-fu.Research and Application of Full Text Search Based on Lucene[J].,2010,(02):12.
[3]张春燕 刘发升.关于Lucene索引工具的性能优化研究[J].计算机技术与发展,2011,(05):121.
 ZHANG Chun-yan,LIU Fa-sheng.Lucene Indexing Tools Research Based on Optimization of Performance[J].,2011,(02):121.
[4]潘政.基于快速分词的语义Web服务搜索系统设计[J].计算机技术与发展,2013,(08):107.
 PAN Zheng.Design of Semantic Web Service Search System Based on Fast Word Segmentation[J].,2013,(02):107.
[5]樊同科,谢勇.一种混合搜索算法在智能Web中的应用[J].计算机技术与发展,2013,(08):220.
 FAN Tong-ke,XIE Yong.Application of a Hybrid Search Algorithm in Intelligent Web[J].,2013,(02):220.
[6]焦洋,王纯,韩静茹.基于Lucene 的科研查新系统构建[J].计算机技术与发展,2018,28(05):193.[doi:10.3969/ j. issn.1673-629X.2018.05.043]
 JIAO Yang,WANG Chun,HAN Jing-ru.Construction of Scientific Research Management System Based on Lucene[J].,2018,28(02):193.[doi:10.3969/ j. issn.1673-629X.2018.05.043]
[7]冯亚洲,岳东. 电力视频大数据分布式检索系统设计与实现[J].计算机技术与发展,2016,26(12):186.
 FENG Ya-zhou,YUE Dong. Design and Implementation of Distributed Retrieval System for Massive Power Video[J].,2016,26(02):186.
[8]张郁彬,张深深,孟旭东.城市路网上动态迁移的移动对象索引结构[J].计算机技术与发展,2018,28(03):47.[doi:10.3969/ j. issn.1673-629X.2018.03.010]
 ZHANG Yu-bin,ZHANG Shen-shen,MENG Xu-dong.A Moving Object Index Structure of Dynamic Migration on Urban Road Network[J].,2018,28(02):47.[doi:10.3969/ j. issn.1673-629X.2018.03.010]

备注/Memo

备注/Memo:
重庆市科技攻关项目(CSTC 2010AC6074); 重庆交通大学研究生教育创新基金资助项目;重庆交通大学实验教学改革与研究基金资助项目(SYJ200922)唐华姣(1986-),女,湖南衡阳人,硕士,研究方向为数据挖掘;何友全,博士,教授,研究方向为信息处理、数据挖掘
更新日期/Last Update: 1900-01-01