[1]徐辉,王宁章,雷琳琳. 一种海量中文地址转化与切割的方法研究[J].计算机技术与发展,2015,25(11):6-10.
 XU Hui,WANG Ning-zhang,LEI Lin-lin. Research on a Massive Chinese Address Conversion and Cutting Method[J].,2015,25(11):6-10.
点击复制

 一种海量中文地址转化与切割的方法研究()

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
25
期数:
2015年11期
页码:
6-10
栏目:
智能、算法、系统工程
出版日期:
2015-11-10

文章信息/Info

Title:
 Research on a Massive Chinese Address Conversion and Cutting Method
文章编号:
1673-629X(2015)11-0006-05
作者:
 徐辉王宁章雷琳琳
 广西大学 计算机与电子信息学院
Author(s):
 XU HuiWANG Ning-zhangLEI Lin-lin
关键词:
 中文地址PRBP-DI分区算法海量数据并行计算
Keywords:
 Chinese addressPRBP-DIpartitioning algorithmhuge amounts of dataparallel computing
分类号:
TP391.1
文献标志码:
A
摘要:
 针对在传统单节点计算模式下,处理海量中文地址数据时不能直接地进行复杂空间数学计算,并且容易受节点硬件条件限制而出现内存溢出和计算速度慢的问题,文中提出了一种中文地址信息通过第三方接口转成对应的经纬度坐标数据,再运用改进后的PRBP-DI分区算法,将海量数据切分成若干子分区分别计算的方法.减少PRBP算法中,对分区数据块列或行重复进行的扫描计算和累积求和计算.真实数据集上的实验结果表明,通过该方法能将海量中文地址数据转化并切分成分布均匀的若干子分区,且算法耗时并不一直随数据点个数增加而增大,提高了海量中文地址数据并行计算的能力和准确性.并根据两种分区算法各自的耗时变化,分析了算法耗时在数据量增大到300 000个数据点时反而减小的原因.
Abstract:
 In traditional single node calculation mode,the handling of the massive Chinese address data cannot be directly to the complex space mathematical calculations,and susceptible to node hardware conditions and problems of memory and computing speed is slow,put forward a kind of Chinese address information through third-party interface into the corresponding latitude and longitude coordinates da-ta,using the improved PRBP-DI partition algorithm,to cut the huge amounts of data into several sub partition calculation method respec-tively. Reduce scanning and cumulative sum calculation to partition data block columns or rows of repeated in PRBP algorithm. Real data sets on the experimental results show that by this method can convert massive Chinese address data and cut into uniform distribution of a number of partitions,and the algorithm is time-consuming does not always increase with increasing number of data points,improving the ability of the massive Chinese address data parallel computation and accuracy. And their respective time-consuming changes according to the two kinds of partition algorithm,analyze the cause that the algorithm’ s time is decrease when taking the data quantity increases to 300 000 data points.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(11):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(11):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(11):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(11):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(11):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(11):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(11):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(11):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(11):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(11):47.

更新日期/Last Update: 2015-12-17