[1]崔立梅,李燕萍,吕中良. 基于ISODATA聚类算法的语音转换研究[J].计算机技术与发展,2017,27(06):106-109.
 CUI Li-mei,LI Yan-ping,LYU Zhong-liang. Research on Voice Conversion Based on Self Organizing Clustering and Frequency Warping[J].,2017,27(06):106-109.
点击复制

 基于ISODATA聚类算法的语音转换研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
27
期数:
2017年06期
页码:
106-109
栏目:
智能、算法、系统工程
出版日期:
2017-06-10

文章信息/Info

Title:
 Research on Voice Conversion Based on Self Organizing Clustering and Frequency Warping
文章编号:
1673-629X(2017)06-0106-04
作者:
 崔立梅李燕萍吕中良
 南京邮电大学 通信与信息工程学院
Author(s):
 CUI Li-meiLI Yan-pingLYU Zhong-liang
关键词:
 迭代自组织聚类算法双线性频率弯折语音转换模型残差成分聚类特性
Keywords:
 iterative self-organizing clustering algorithmbilinear frequency warping voice conversion modelresidual componentsclustering characteristics
分类号:
TP301.6
文献标志码:
A
摘要:
 提出了一种基于迭代自组织聚类算法(ISODATA)的双线性频率弯折语音转换模型.根据语音特征参数分类不充分产生残差成分的问题,在基于高斯混合模型的聚类过程中引入了迭代自组织聚类算法.该算法将聚类得到的类内均值作为训练模型初始均值,改善了EM算法初始值选取不当导致算法不能收敛的问题,从而对特征参数的拟合更加准确,结合后续的双线性频率弯折(BLFW)模型实现语音转换.实验测试结果表明:提出的算法具有较好的自适应聚类特性,能够使特征参数分类更合理,进而得到更准确的转换函数,使得转换的语音更接近目标语音.选择合适的初始值参数,对提出的算法与高斯混合模型及双线性频率弯折模型进行比较,平均MCD值相差很小,平均MOS值有所提高.这说明合理精确的聚类有利于提高语音转换系统的性能.
Abstract:
 A voice conversion model of bilinear frequency warping based on Iterative Self-Organizing clustering Data Analysis Techniques Algorithm (ISODATA) is put forward.According to the residual components generated by insufficient classification of speech feature parameters,in the clustering process based on Gaussian mixture model,the iterative self-organizing clustering algorithm is introduced.It takes average value within class obtained by clustering as the initial mean for training model,which improves the problem that the algorithm cannot converge due to inappropriated initial value selection of EM algorithm,thus making the characteristic parameters fitting more accurate,realization of voice conversion with subsequent bilinear frequency warping (BLFW) model.The experimental results show that the proposed algorithm has better adaptive clustering characteristics,which can make the characteristic parameters classification more reasonable,and get more accurate conversion function,making the speech more close to the target speech.Choosing appropriate initial value parameters,the algorithm proposed is compared with the Gauss mixture model and the bilinear frequency warping model.The average MCD value is very small,and the average MOS value is high.This shows that reasonable and accurate clustering is beneficial to improve the performance of speech conversion system.

相似文献/References:

[1]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(06):1.
[2]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(06):5.
[3]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(06):13.
[4]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(06):21.
[5]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(06):25.
[6]翁鹤,皮德常. 混沌RBF神经网络异常检测算法[J].计算机技术与发展,2014,24(07):29.
 WENG He,PI De-chang. Chaotic RBF Neural Network Anomaly Detection Algorithm[J].,2014,24(06):29.
[7]刘茜[],荆晓远[],李文倩[],等. 基于流形学习的正交稀疏保留投影[J].计算机技术与发展,2014,24(07):34.
 LIU Qian[],JING Xiao-yuan[,LI Wen-qian[],et al. Orthogonal Sparsity Preserving Projections Based on Manifold Learning[J].,2014,24(06):34.
[8]尚福华,李想,巩淼. 基于模糊框架-产生式知识表示及推理研究[J].计算机技术与发展,2014,24(07):38.
 SHANG Fu-hua,LI Xiang,GONG Miao. Research on Knowledge Representation and Inference Based on Fuzzy Framework-production[J].,2014,24(06):38.
[9]叶偲,李良福,肖樟树. 一种去除运动目标重影的图像镶嵌方法研究[J].计算机技术与发展,2014,24(07):43.
 YE Si,LI Liang-fu,XIAO Zhang-shu. Research of an Image Mosaic Method for Removing Ghost of Moving Targets[J].,2014,24(06):43.
[10]余松平[][],蔡志平[],吴建进[],等. GSM-R信令监测选择录音系统设计与实现[J].计算机技术与发展,2014,24(07):47.
 YU Song-ping[][],CAI Zhi-ping[] WU Jian-jin[],GU Feng-zhi[]. Design and Implementation of an Optional Voice Recording System Based on GSM-R Signaling Monitoring[J].,2014,24(06):47.

更新日期/Last Update: 2017-07-26