[1]赵震,张龙昌. XML文档实体识别技术研究[J].计算机技术与发展,2014,24(10):84-87.
 ZHAO Zhen,ZHANG Long-chang. Research on Entity Identification Technology on XML Documents[J].,2014,24(10):84-87.
点击复制

 XML文档实体识别技术研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
24
期数:
2014年10期
页码:
84-87
栏目:
智能、算法、系统工程
出版日期:
2014-10-10

文章信息/Info

Title:
 Research on Entity Identification Technology on XML Documents
文章编号:
1673-629X(2014)10-0084-04
作者:
 赵震张龙昌
 渤海大学 信息科学与技术学院
Author(s):
 ZHAO ZhenZHANG Long-chang
关键词:
 XML文档实体识别数据质量
Keywords:
 XML documentsentity recognitionquality of data
分类号:
TP311
文献标志码:
A
摘要:
 随着XML文档的广泛应用,使用实体识别技术对XML文档数据质量进行管理变得非常重要。 XML中实体识别技术主要用于在XML文档中发现同一实体的不同描述,其在数据质量管理中可以用于错误检测、数据集成等。由于XML文档是半结构化的, XML文档上的实体识别与纯文本和关系数据上的实体识别有着很大不同。文中介绍了XML文档上实体识别的概念和应用,分别讨论了 XML文档上几种实体识别技术的概念和原理,给出了相应的树匹配算法,最后得出结论并展望了未来的研究方向。
Abstract:
 With the wide application of XML documents,it is important for applying entity recognition technology to the XML data quali-ty for management. Entity recognition is mainly applied to find different descriptions of the same entity in the XML document,which can be used for error detection,data integration in data quality management. Because XML documents is a semi-structured,entity identifica-tion is different from plain text and relation database in XML document. In this paper,introduce the concept and application of entity iden-tification of the XML document,and the concept and principle of several entity recognition technology are discussed,and the correspond-ing tree matching algorithm is given,finally discuss the prospect of future research directions.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(10):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(10):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(10):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(10):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(10):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(10):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(10):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(10):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(10):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(10):47.
[11]张苗[],惠小强[]. 一种快速的XML文档验证算法[J].计算机技术与发展,2015,25(08):123.
 ZHANG Miao[],XI Xiao-qiang[]. A Fast Algorithm of XML Document Verification[J].,2015,25(10):123.
[12]赵震[][],任永昌[]. 大数据时代电子政务中XML文档相似性[J].计算机技术与发展,2017,27(01):186.
 ZHAO Zhen[][],REN Yong-chang[]. Similarity of XML Documents in E-government in Era of Big Data[J].,2017,27(10):186.

更新日期/Last Update: 2015-04-02