[1]孙德才[],王晓霞[]. 近似串匹配过滤算法研究[J].计算机技术与发展,2015,25(04):171-176.
 SUN De-cai[],WANG Xiao-xia[]. Research on Filtering Algorithm of Approximate String Matching[J].,2015,25(04):171-176.
点击复制

 近似串匹配过滤算法研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
25
期数:
2015年04期
页码:
171-176
栏目:
应用开发研究
出版日期:
2015-04-10

文章信息/Info

Title:
 Research on Filtering Algorithm of Approximate String Matching
文章编号:
1673-629X(2015)04-0171-06
作者:
 孙德才[1]王晓霞[2]
 1.渤海大学 信息科学与技术学院;2.渤海大学 大学计算机教研部
Author(s):
 SUN De-cai[1] WANG Xiao-xia[2]
关键词:
 串匹配近似串匹配过滤算法q-gram过滤
Keywords:
 string matchingapproximate string matchingfilter algorithmq-gram filter
分类号:
TP391.3
文献标志码:
A
摘要:
 近似串匹配在众多研究领域都有广泛的应用,如文本检索、生物信息学等。文中对基于过滤技术的Off-line模式近似串匹配算法进行了相关研究。首先介绍了串匹配的基础知识和近似串匹配技术的应用分类;然后阐述了Off-line模式近似串匹配算法常用的索引结构;接着详细介绍了近似串匹配过滤算法的研究现状,并阐述了几个经典过滤算法的过滤原理;最后在实验中对比了这些经典过滤算法的性能差异,实验数据显示提高过滤效率和减少过滤时间是加快过滤算法匹配速度所要解决的关键问题。研究表明,基于留空q-gram的过滤算法是近似串匹配未来研究的方向。
Abstract:
 Approximate string matching is widely used in many areas,such as text retrieval,computational biology,etc. In this paper,a survey on filter-based approximate string matching algorithm of Off-line mode is done. First,the preliminaries of string matching and the classifications of approximate string matching techniques are introduced. Next,some index structures which are often used in Off-line ap-proximate string matching algorithms are illustrated. Then,the research status quo of approximate string matching is described in detail, and some classical filter algorithms are illustrated. Last,the performance of these classical filtering algorithms is given in experiment,and experimental data shows that enhancing filtration efficiency and decreasing filtration time are two key issues of improving matching speed. The research shows that the filter algorithms based on gapped q-gram is a further research direction of approximate string matc-hing.

相似文献/References:

[1]陈倩.一种基于有限自动机的快速串匹配算法[J].计算机技术与发展,2009,(01):131.
 CHEN Qian.A Fast String Matching Algorithm Based on Finite Automaton[J].,2009,(04):131.
[2]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(04):1.
[3]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(04):5.
[4]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(04):13.
[5]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(04):21.
[6]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(04):25.
[7]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(04):29.
[8]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(04):34.
[9]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(04):38.
[10]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(04):43.

更新日期/Last Update: 2015-06-05