[1]高科[],刁兴春[],曹建军[]. 基于简单规则的数据质量检查系统设计与应用[J].计算机技术与发展,2015,25(06):176-180.
 GAO Ke[],DIAO Xing-chun[],CAO Jian-jun[]. Design and Application of Data Quality Detection System Based on Simple Rules[J].,2015,25(06):176-180.
点击复制

 基于简单规则的数据质量检查系统设计与应用()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
25
期数:
2015年06期
页码:
176-180
栏目:
应用开发研究
出版日期:
2015-06-10

文章信息/Info

Title:
 Design and Application of Data Quality Detection System Based on Simple Rules
文章编号:
1673-629X(2015)06-0176-05
作者:
 高科[1]刁兴春[2]曹建军[2]
 1.解放军理工大学 指挥信息系统学院;2.总参第六十三研究所,
Author(s):
 GAO Ke[1]DIAO Xing-chun[2]CAO Jian-jun[2]
关键词:
 数据质量评估指标规则关系型数据
Keywords:
 data qualityindex assessmentrulesstructured data
分类号:
TP311
文献标志码:
A
摘要:
 为了更加全面地对数据存在的质量问题进行检查,并找出其中的问题数据,分析了数据质量评估的一般性指标,从规则约束的角度对关系型数据字段的格式、语法、长度、取值范围,以及字段与字段之间的逻辑关系、函数依赖关系等进行分类描述,设计相应的数据质量检查算法并进行编码实现,形成一套完整的数据质量检查工具。对某单位的设备人员信息数据从完整性、规范性、一致性、有效性等方面进行检查。实验结果表明,这些规则能够有效检出关系型数据中存在的问题。
Abstract:
 In order to carry out the overall detection of data quality and locate the incorrect data,analyze the general indexes of data quali-ty assessment. The rules of structured data,which takes the form,grammar,length,range of single field and the logical,functional depend-ency relationship of different fields into account,was classified and programmed by using corresponding algorithm. Design a correspond-ing data quality inspection tools and realize it by programming,forming a set of whole data quality inspection tool. Test on the data of fa-cility and staff information from the angle of integrity,normalization,consistence,effectiveness. The result of experiment shows that such data rules can find out the errors of the data.

相似文献/References:

[1]丁海龙 徐宏炳.数据质量分析及应用[J].计算机技术与发展,2007,(03):236.
 DING Hai-long,XU Hong-bing.Data Quality Analysis and Application[J].,2007,(06):236.
[2]黄武锋 郑华.面向企业信息化的数据质量评估研究[J].计算机技术与发展,2011,(01):185.
 HUANG Wu-feng,ZHENG Hua.Study of Data Quality Assessment for Enterprise Informationization[J].,2011,(06):185.
[3]石彦华 李蜀瑜.知识建模的清洗模型研究[J].计算机技术与发展,2011,(11):124.
 SHI Yan-hua,LI Shu-yu.Research of Data Cleaning Model Based on Knowledge[J].,2011,(06):124.
[4]刘益江 毛宁 陈庆新.一种评估数据仓库设计质量的方法[J].计算机技术与发展,2012,(09):161.
 LIU Yi-jiang,MAO Ning,CHEN Qing-xin.A Methodology for Data Warehouse Design Quality Assessment[J].,2012,(06):161.
[5]袁满,张雪.一种基于规则的数据质量评价模型[J].计算机技术与发展,2013,(03):81.
 YUAN Man,ZHANG Xue.A Data Quality Assessment Model Based on Rules[J].,2013,(06):81.
[6]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(06):1.
[7]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(06):5.
[8]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(06):13.
[9]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(06):21.
[10]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(06):25.
[11]赵震,张龙昌. XML文档实体识别技术研究[J].计算机技术与发展,2014,24(10):84.
 ZHAO Zhen,ZHANG Long-chang. Research on Entity Identification Technology on XML Documents[J].,2014,24(06):84.
[12]张方舟,高晓松. 基于条件函数依赖的挖掘算法研究[J].计算机技术与发展,2015,25(05):56.
 ZHANG Fang-zhou,GAO Xiao-song. Research on Mining Algorithm Based on Conditional Functional Dependence[J].,2015,25(06):56.

更新日期/Last Update: 2015-08-05