[1]赵震[][],任永昌[]. 大数据时代电子政务中XML文档相似性[J].计算机技术与发展,2017,27(01):186-189.
 ZHAO Zhen[][],REN Yong-chang[]. Similarity of XML Documents in E-government in Era of Big Data[J].,2017,27(01):186-189.
点击复制

 大数据时代电子政务中XML文档相似性()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
27
期数:
2017年01期
页码:
186-189
栏目:
应用开发研究
出版日期:
2017-01-10

文章信息/Info

Title:
 Similarity of XML Documents in E-government in Era of Big Data
文章编号:
1673-629X(2017)01-0186-04
作者:
 赵震[1][2]任永昌[1]
 1.渤海大学 信息科学与技术学院;2.东北大学 计算机科学与工程学院
Author(s):
 ZHAO Zhen[1][2]REN Yong-chang[2]
关键词:
 XML文档相似性特征提取拟合数据集成
Keywords:
 XML documentssimilarityfeature extractingsynthesizingdata integration
分类号:
TP393
文献标志码:
A
摘要:
 XML作为电子政务应用中的数据交换标准已经被广泛研究。随着大数据时代的到来,对电子政务中XML数据的管理也显得越来越重要。在XML数据的管理中,XML文档的相似性是XML数据集成、XML数据分类的关键。为了研究XML文档的相似性,针对XML文档进行了树形变换,并提取树节点的相应特征,然后分别利用这些特征对节点进行相应的相似性计算,再将得到的相似性利用ELM(超限学习机)算法进行拟合得到最终的节点相似性。在节点相似性的基础上提出了XML文档树的相似性比较算法,从而计算得到XML文档的相似性。实验部分在给出具体的评估指标的基础上,在两个不同的数据集上给出使用文中方法所得到的精确度、召回率、F-measure值以及相应时间的对比情况,通过实验验证了所提方法的性能优势。
Abstract:
 XML has been widely studied as the standard of data exchange in e-government applications. With the arrival of the era of big data,the management of XML data in e-government is also becoming more and more important. In the management of XML data,the similarity of XML documents is the key of XML data integration and XML data classification. In order to study the XML document simi-larity,the XML document are transformed into tree,extracting the corresponding characteristics of the nodes of the tree,and then using these characteristics to calculate the similarity of nodes,and then the final node similarity can be obtained by the ELM( Extreme Learning Machine) algorithm. Based on the similarity of nodes,the algorithm of similarity comparison of the XML document tree is given,which can obtain the similarity of XML documents. Based on the given specific evaluation indexes,the accuracy,recall, F-measure values and the corresponding time are obtained through experiments in two different data sets using the method proposed. The performance advanta-ges of the proposed method are verified by experiments.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(01):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(01):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(01):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(01):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(01):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(01):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(01):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(01):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(01):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(01):47.
[11]赵震,张龙昌. XML文档实体识别技术研究[J].计算机技术与发展,2014,24(10):84.
 ZHAO Zhen,ZHANG Long-chang. Research on Entity Identification Technology on XML Documents[J].,2014,24(01):84.
[12]张苗[],惠小强[]. 一种快速的XML文档验证算法[J].计算机技术与发展,2015,25(08):123.
 ZHANG Miao[],XI Xiao-qiang[]. A Fast Algorithm of XML Document Verification[J].,2015,25(01):123.

更新日期/Last Update: 2017-04-05