[1]黄宇达 魏霞 王迤冉[].一种轻量级中文搜索引擎模型的设计与实现[J].计算机技术与发展,2012,(09):201-204.
 HUANG Yu-da,WEI Xia,WANG Yi-ran.Design and Implementation of System Model of a Lightweight Chinese Search Engine[J].,2012,(09):201-204.
点击复制

一种轻量级中文搜索引擎模型的设计与实现()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2012年09期
页码:
201-204
栏目:
应用开发研究
出版日期:
1900-01-01

文章信息/Info

Title:
Design and Implementation of System Model of a Lightweight Chinese Search Engine
文章编号:
1673-629X(2012)09-0201-04
作者:
黄宇达12 魏霞2 王迤冉[3]
[1]西南科技大学计算机科学与技术学院[2]周口职业技术学院信息工程系[3]周口师范学院计算机科学与技术学院
Author(s):
HUANG Yu-da WEI Xia WANG Yi-ran
[1]College of Computer Science and Technology, Southwest University of Science and Technology[2]Information and Engineering Department,Zhoukou Vocational and Technical College[3]College of Computer Science and Technology,Zhoukou Normal University
关键词:
网络爬虫URL库中文分词倒排文件索引多线程
Keywords:
Web crawler URL library Chinese word segmentation inverted file index multi-threaded
分类号:
TP31
文献标志码:
A
摘要:
首先详细介绍了一种建构在PC Windows平台上的轻量级中文搜索引擎系统模型的总体设计,然后采用基于多线程技术的广度优先遍历法及最大匹配法和最小匹配法相结合的中文分词法等技术进行了各个主要功能模块的具体设计和实现,对模型进行了基于多线程的网络爬虫、用户接口等测试。测试实验结果表明:构建并实现的轻量级中文搜索引擎系统模型能较好地实现一个简单中文搜索引擎所具有的基本功能,系统界面简单实用,具有较高的资源检索率并能够保证检索结果的准确性
Abstract:
First described in detail the overall design of the lightweight Chinese search engine system model based on PC Windows platform, and then the major functional blocks were designed and realized by using breadth-first traversal method based on multi-threading technology and the Chinese sub-lexical method of the combination of the maximum matching method and the minimum matching method and other technology ,then carded out some tests based on multi-threaded Web crawler and user interface on the model. Experimental results show:the lightweight Chinese search engine system built and realized is able to achieve the basic functions of a simple Chinese search engine and good operating results, the system interface is simple and practical, with higher rates of resource retrieval and to ensure the accuracy of search results

相似文献/References:

[1]张林才 张燕 王红霞.节点对等WebSpider设计与实现[J].计算机技术与发展,2010,(03):195.
 ZHANG Lin-cai,ZHANG Yan,WANG Hong-xia.Design and Realization of Peer - to - Peer Web Spider[J].,2010,(09):195.
[2]张春元 康耀红 伍小芹.Web新闻自动采集发布系统的设计与实现[J].计算机技术与发展,2009,(09):250.
 ZHANG Chun-yuan,KANG Yao-hong,WU Xiao-qin.Design and Implementation of Web News Automatically Gathering and Publishing System[J].,2009,(09):250.
[3]周凤丽 林晓丽.基于Lucene的Web搜索引擎的研究和实现[J].计算机技术与发展,2012,(01):140.
 ZHOU Feng-li,LIN Xiao-li.Research and Implementation of Web Search Engine Based on Lucene[J].,2012,(09):140.
[4]张俊,李鲁群,周熔.基于Lucene的搜索引擎的研究与应用[J].计算机技术与发展,2013,(06):230.
 ZHANG Jun,LI Lu-qun,ZHOU Rong.Research and Application of Search Engine Based on Lucene[J].,2013,(09):230.
[5]孙青云,王俊峰,赵宗渠,等.一种基于模拟登录的微博数据采集方案[J].计算机技术与发展,2014,24(03):6.
 SUN Qing-yun[],WANG Jun-feng[],ZHAO Zong-qu[],et al.A Microblog Data Collection Method Based on Simulated Login Technology[J].,2014,24(09):6.
[6]杨洋[][],李晓风[][],赵赫[][],等. 基于网络爬虫的文献检索系统的研究和实现[J].计算机技术与发展,2014,24(11):35.
 YANG Yang[][],LI Xiao-feng[][],ZHAO He[][],et al. Research and Realization of Academic Search System Based on Network Crawler[J].,2014,24(09):35.
[7]付剑生[] .徐林龙[]。 林文斌[]. 分布式全网职位搜索引擎的研究与实现[J].计算机技术与发展,2015,25(05):6.
 FU Jian-sheng[],XU Lin-long[],LIN Wen-bin[]. Research and Implementation of Distributed Network-wide Job Search Engine[J].,2015,25(09):6.
[8]韩贝,马明栋,王得玉.基于Scrapy框架的爬虫和反爬虫研究[J].计算机技术与发展,2019,29(02):139.[doi:10.3969/j.issn.1673-629X.2019.02.029]
 HAN Bei,MA Mingdong,WANG Deyu.Research on Crawler and Anti-reptile Based on Scrapy Framework[J].,2019,29(09):139.[doi:10.3969/j.issn.1673-629X.2019.02.029]
[9]王荩梓,赖雯洁. 基于房产交易网站的数据获取与在线工具开发[J].计算机技术与发展,2017,27(05):154.
 WANG Jin-zi,LAI Wen-jie. Data Acquisition and Development of Online Analysis Tools Based on Real Estate Transaction Websites[J].,2017,27(09):154.
[10]陈春玲,张凡,余瀚.Web应用程序漏洞检测系统设计[J].计算机技术与发展,2017,27(09):101.
 CHEN Chun-ling,ZHANG Fan,YU Han. Design of Vulnerability Detection System for Web Application Program[J].,2017,27(09):101.

备注/Memo

备注/Memo:
河南省科技基础与前沿技术研究计划项目(112300410307)黄宇达(1975-),男,河南周口人,硕士,讲师,研究方向为知识工程、并行计算等
更新日期/Last Update: 1900-01-01