[1]郭海凤,李莉.基于CUDA平台的FIR滤波算法的设计与优化[J].计算机技术与发展,2014,24(03):102-105.
 GUO Hai-feng[],LI Li[].esign and Optimization of FIR Filtering Algorithm Based on CUDA Platform[J].,2014,24(03):102-105.
点击复制

基于CUDA平台的FIR滤波算法的设计与优化()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
24
期数:
2014年03期
页码:
102-105
栏目:
智能、算法、系统工程
出版日期:
2014-03-31

文章信息/Info

Title:
esign and Optimization of FIR Filtering Algorithm Based on CUDA Platform
文章编号:
1673-629X(2014)03-0102-04
作者:
郭海凤1李莉2
1.金陵科技学院 信息技术学院;2.江苏省信息分析工程实验室
Author(s):
GUO Hai-feng[1]LI Li[2]
关键词:
FIR滤波算法并行计算GPU计算CUDA平台矩阵乘法
Keywords:
FIR filtering algorithmparallel computingGPU computingCUDA platformmatrix multiplication
分类号:
TP391
文献标志码:
A
摘要:
针对目前基于普通DSP的FIR算法速度低、扩展性差的缺点,提出并实现基于CUDA平台实现的FIR滤波算法。由于在CUDA中程序可以直接操作数据而无需借助于图形系统的API,使开发者能够在GPU 强大计算能力的基础上建立起一种效率更高的密集数据计算解决方案。该算法将CUDA用于FIR滤波器输入输出关系计算,采用矩阵乘法的并行运算技术,在GPU上建立并行滤波模型,并对算法进行了优化。实验结果表明,在Tesla C1060平台上,和传统的基于DSP的FIR滤波算法计算速度相比,基于CUDA平台计算FIR滤波算法时,其加速比可接近30,解决了传统基于DSP计算FIR滤波算法速度较慢、扩展性差的问题。
Abstract:
It is well known that FIR algorithm based on normal DSP has low computing speed and extensive capabilities. In order to over-come these,present a new FIR filter algorithm based on CUDA platform. Since in CUDA program can directly manipulate data without graphics API of the system,enables developers on the basis of the powerful GPU computing power to set up a efficient dense data compu-ting solutions. The algorithm adopts CUDA for FIR filter calculation of input and output relationship,using the parallel computing tech-nology of matrix multiplication,on the GPU the parallel filtering model is established,and the algorithm is optimized. Experiment on Tes-la C1060 shows that,compared with traditional FIR filter algorithm's speed based on DSP,it can accelerate its computation speed up to 30 times,solving conventional FIR filter's defect based on DSP of low speed and bad extending capabilities.

相似文献/References:

[1]龚向坚 邹腊梅 隆重.基于分布对象的虚拟网络实验系统设计与实现[J].计算机技术与发展,2010,(01):111.
 GONG Xiang-jian,ZOU La-mei,LONG Zhong.Design and Realization of Virtual Network Laboratory System Based on Distributing Object[J].,2010,(03):111.
[2]职为梅 王芳 范明 杨勇.并行环境下的同步异步PSO算法[J].计算机技术与发展,2009,(03):123.
 ZHI Wei-mei,WANG Fang,FAN Ming,et al.Synchronous and Asynchronous PSO Algorithm of Parallel Circumstance[J].,2009,(03):123.
[3]陈小飞 徐宏炳.基于网格的并行FFT计算研究[J].计算机技术与发展,2008,(03):67.
 CHEN Xiao-fei,XU Hong-bing.Research of Parallel FFT Computing Based on Grid[J].,2008,(03):67.
[4]陈荣征 李代平 黄健 秦昭晖.EBE-PCG算法在有限元并行计算中的应用研究[J].计算机技术与发展,2008,(03):232.
 CHEN Rong-zheng,LI Dai-ping,HUANG Jian,et al.Research on Application of EBE- PCG Algorithm in Parallel Computing of FEM[J].,2008,(03):232.
[5]王勇超 张璟 王新卫 马静.基于MPICH2的高性能计算集群系统研究[J].计算机技术与发展,2008,(09):101.
 WANG Yong-chao,ZHANG Jing,WANG Xin-wei,et al.Research of High Performance Cluster System Based on MPICH2[J].,2008,(03):101.
[6]王延玲 祝永志 郭静.基于HPJava集群系统的环境搭建与性能分析[J].计算机技术与发展,2008,(11):94.
 WANG Yan-ling,ZHU Yong-zhi,GUO Jing.Constructing and Perfomance Analysis of Cluster System Based on HPJava[J].,2008,(03):94.
[7]冯冲 颜廷华.基于线积分卷积算法的并行实现方法[J].计算机技术与发展,2008,(12):22.
 FENG Chong,YAN Ting-hua.A Parallel Algorithm for Line Integral Convolution[J].,2008,(03):22.
[8]张晓奇 张翌维 郑新建.一种基于流水线结构的多级数字混沌编码方案[J].计算机技术与发展,2007,(05):152.
 ZHANG Xiao-qi,ZHANG Yi-wei,ZHENG Xin-jian.A Multilevel Digital Chaotic Encoding Scheme Based on Pipeline Structure[J].,2007,(03):152.
[9]牛志伟 黄红女.Windows平台下机群并行编译环境配置[J].计算机技术与发展,2007,(08):15.
 NIU Zhi-wei,HUANG Hong-nü.Configuration of Parallel Compile Environment of Cluster on Windows Platform[J].,2007,(03):15.
[10]蔡佳佳 李名世 郑锋.多核微机基于OpenMP的并行计算[J].计算机技术与发展,2007,(10):87.
 CAI Jia-jia,LI Ming-shi,ZHENG Feng.OpenMP- Based Parallel Computation on Multi- Core PC[J].,2007,(03):87.

更新日期/Last Update: 1900-01-01