[1]李 杰,曹建军,王保卫,等.基于图常量条件函数依赖的图修复规则发现[J].计算机技术与发展,2024,34(04):7-15.[doi:10. 3969 / j. issn. 1673-629X. 2024. 04. 002]
 LI Jie,CAO Jian-jun,WANG Bao-wei,et al.Graph Repairing Rule Discovery Based on Graph Constant Conditional Functional Dependencies[J].,2024,34(04):7-15.[doi:10. 3969 / j. issn. 1673-629X. 2024. 04. 002]
点击复制

基于图常量条件函数依赖的图修复规则发现()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
34
期数:
2024年04期
页码:
7-15
栏目:
大数据与云计算
出版日期:
2024-04-10

文章信息/Info

Title:
Graph Repairing Rule Discovery Based on Graph Constant Conditional Functional Dependencies
文章编号:
1673-629X(2024)04-0007-09
作者:
李 杰123 曹建军23 王保卫1 庄 园123
1. 南京信息工程大学 计算机学院 网络空间安全学院,江苏 南京 210044;
2. 国防科技大学 第六十三研究所,江苏 南京 210007;
3. 国防科技大学 大数据与决策实验室,湖南 长沙 410073
Author(s):
LI Jie123 CAO Jian-jun23 WANG Bao-wei1 ZHUANG Yuan123
1. School of Computer & Software,Nanjing University of Information Science and Technology,Nanjing 210044,China;
2. The 63rd Research Institute,National University of Defense Technology,Nanjing 210007,China;
3. Laboratory for Big Data and Decision,National University of Defense Technology,Changsha 410073,China
关键词:
数据一致性数据质量图函数依赖图修复规则子图同构最大公共同构子图
Keywords:
data consistencydata qualitygraph functional dependencygraph repairing rulesubgraph isomorphismmaximum common isomorphism subgraph
分类号:
TP301
DOI:
10. 3969 / j. issn. 1673-629X. 2024. 04. 002
摘要:
数据一致性是数据质量管理的一个重要内容。 为了提升图数据一致性,大量关系型数据库中的数据依赖理论被引入到图数据库,包括图函数依赖、图关联规则等。 图修复规则是最新提出的一种针对图数据的数据依赖规则,具有强大的修复能力,但目前尚无有效的挖掘算法。 为了自动生成图修复规则并提高图数据修复的可靠性,提出一种将图常量条件函数依赖转化为图修复规则的方法( GenGRR) 。 通过图模式在图中匹配同构子图并映射成节点-属性二维表,从表中相应属性域中抽取错误模式把图常量条件函数依赖转化成图属性值修复规则;删去图模式中常量条件函数依赖 RHS 对应的节点与相连边生成图属性补充规则。 基于最大公共同构子图筛选并验证生成图修复规则的一致性。 在多个真实数据集上进行测试,验证相比图常量条件函数直接修复图数据,通过转化生成的图修复规则具有更好的修复效果。
Abstract:
Data consistency is an important part of data quality management. In order to improve graph data consistency,a lot of data dependency theories in relational database have?
been introduced into graph database, including graph functional dependencies, graphassociation rules and so on. Graph repairing rule is a newly proposed data dependency?
rule for graph with powerful repairing capability,but there is no effective mining algorithm yet. In order to automatically generate graph repairing rule and improve the reliability?
of graphdata repairing,a method called GenGRR is proposed to transform graph constant conditional functional dependencies into graph repairingrules. By using the graph pattern,the isomorphic subgraph is matched and mapped into a node-attribute two-dimensional table,and theerror pattern is extracted from the corresponding attribute field in the table to transform the constant condition function dependency intothe graph attribute value repair rule. The graph attribute supplement rules are generated by deleting the nodes and contiguous edges ofconstant condition function dependent on RHS in graph mode. Based on the maximum common isomorphic subgraph,the consistency ofthe repair rules of the generated graph is screened and verified. It is tested on multiple real data sets to verify that the graph repair rulegenerated by transformation has better repair effect than that of the graph constant condition function.

相似文献/References:

[1]向坤 刘晓洁 赵奎 李峰.基于数据容灾系统的服务漂移实现[J].计算机技术与发展,2010,(07):152.
 XIANG Kun,LIU Xiao-jie,ZHAO Kui,et al.Implementation of Service Migration Based on Data-Level Disaster Recovery System[J].,2010,(04):152.
[2]丁海龙 徐宏炳.数据质量分析及应用[J].计算机技术与发展,2007,(03):236.
 DING Hai-long,XU Hong-bing.Data Quality Analysis and Application[J].,2007,(04):236.
[3]景旭 黄东.基于HA结构的HLR系统容灾中心技术[J].计算机技术与发展,2006,(07):153.
 JING Xu,HUANG Dong.Disaster Recovery Center Technology of HLR System Based on HA Structure[J].,2006,(04):153.
[4]黄武锋 郑华.面向企业信息化的数据质量评估研究[J].计算机技术与发展,2011,(01):185.
 HUANG Wu-feng,ZHENG Hua.Study of Data Quality Assessment for Enterprise Informationization[J].,2011,(04):185.
[5]李章兵 车乌江.基于全局目录的分布式数据库数据一致性算法[J].计算机技术与发展,2011,(09):77.
 LI Zhang-bing,CHE Wu-jiang.Data Consistency Algorithm Based on Global Directory in Distributed Database[J].,2011,(04):77.
[6]石彦华 李蜀瑜.知识建模的清洗模型研究[J].计算机技术与发展,2011,(11):124.
 SHI Yan-hua,LI Shu-yu.Research of Data Cleaning Model Based on Knowledge[J].,2011,(04):124.
[7]刘益江 毛宁 陈庆新.一种评估数据仓库设计质量的方法[J].计算机技术与发展,2012,(09):161.
 LIU Yi-jiang,MAO Ning,CHEN Qing-xin.A Methodology for Data Warehouse Design Quality Assessment[J].,2012,(04):161.
[8]徐小龙,邹勤文,杨庚[].分布式存储系统中数据副本管理机制[J].计算机技术与发展,2013,(02):245.
 XU Xiao-long,ZOU Qin-wen,YANG Geng.Data Replication Management Mechanisms for DSS[J].,2013,(04):245.
[9]袁满,张雪.一种基于规则的数据质量评价模型[J].计算机技术与发展,2013,(03):81.
 YUAN Man,ZHANG Xue.A Data Quality Assessment Model Based on Rules[J].,2013,(04):81.
[10]赵震,张龙昌. XML文档实体识别技术研究[J].计算机技术与发展,2014,24(10):84.
 ZHAO Zhen,ZHANG Long-chang. Research on Entity Identification Technology on XML Documents[J].,2014,24(04):84.

更新日期/Last Update: 2024-04-10