[1]王小科,晏峻峰*.基于随机森林的帕金森疾病诊断模型构建研究[J].计算机技术与发展,2023,33(04):154-160.[doi:10. 3969 / j. issn. 1673-629X. 2023. 04. 023]
 WANG Xiao-ke,YAN Jun-feng.Research on Construction of Parkinson’s Disease Diagnosis Model Based on Random Forest[J].,2023,33(04):154-160.[doi:10. 3969 / j. issn. 1673-629X. 2023. 04. 023]
点击复制

基于随机森林的帕金森疾病诊断模型构建研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
33
期数:
2023年04期
页码:
154-160
栏目:
人工智能
出版日期:
2023-04-10

文章信息/Info

Title:
Research on Construction of Parkinson’s Disease Diagnosis Model Based on Random Forest
文章编号:
1673-629X(2023)04-0154-07
作者:
王小科晏峻峰*
湖南中医药大学 信息科学与工程学院,湖南 长沙 410208
Author(s):
WANG Xiao-keYAN Jun-feng
School of Information Science and Engineering,Hunan University of Chinese Medicine,Changsha 410208,China
关键词:
SVM SMOTE信息增益随机森林网格搜索交叉验证
Keywords:
SVM SMOTEinformation gainrandom forestgrid searchcross-validation
分类号:
TP301
DOI:
10. 3969 / j. issn. 1673-629X. 2023. 04. 023
摘要:
近些年来,根据帕金森疾病( PD) 患者的语音数据对该疾病做出诊断成为一种行之有效的疾病诊断方法。 首先,针对语音数据集中存在非均衡数据和噪声样本的问题使用 SVM SMOTE 过采样技术,利用支持向量机分类器寻找支持向量并在此基础上合成新的样本以达到均衡数据集的目的;为了减少数据维度,降低学习难度,运用信息增益特征选择对所有特征属性计算数值并划分数据集以此来获得信息增益,根据信息增益的大小排序选取得到八个特征作为最优特征组合;最后,构建随机森林帕金森疾病诊断模型,并采用网格搜索和交叉验证相结合的方式进行参数调优,进一步优化模型,实现诊断模型准确率的进一步提高。 实验结果表明,优化后模型的准确率、灵敏度和特异度均有提升,为别为 96. 59% 、94. 81% 和 95. 49% ,且准确率均高于支持向量机、最邻近节点算法、朴素贝叶斯和决策树等决策模型。
Abstract:
In recent years,the diagnosis of Parkinson’s disease based on the speech data of patients with Parkinson’s disease has becomean effective disease diagnosis method. Firstly,we use the SVM SMOTE oversampling technique to solve the problem of unbalanced dataand noise samples in the speech data set,and use the?
SVM classifier to find the support vector and synthesize new samples on this basis toachieve the purpose?
of the balanced data set. In order to reduce the data dimension and the difficulty of learning,we use the informationgain feature selection to calculate the value of all feature attributes and divide the data set to?
obtain the information gain,and select eightfeatures as the optimal feature combination according to the?
size of the information gain. Finally,a random forest Parkinson’s disease diagnosis model is constructed,
and the parameters are optimized by the combination of grid search and cross-validation to further optimize
the model,so as to further improve the accuracy of the diagnosis model. The experimental results show that?
the accuracy,sensitivity andspecificity of the optimized model are improved,which are 96. 59% ,94. 81% and?
95. 49% ,respectively,and the accuracy is higher thanthat of support vector machine,nearest neighbor algorithm,naive Bayes and decision trees.

相似文献/References:

[1]李广水 郑滔 孙梅.基于分形维的决策树构建及应用研究[J].计算机技术与发展,2009,(12):5.
 LI Guang-shui,ZHENG Tao,SUN Mei.Research of Decision Tree Design and Application Based on Fractal Dimension[J].,2009,(04):5.
[2]林伟 柳荣其 徐熙.邮件过滤中一种改进的特征选择方法研究[J].计算机技术与发展,2009,(01):84.
 LIN Wei,LIU Rong-qi,XU Xi.Improvement of Feature Selection Algorithm in Spam Filtering[J].,2009,(04):84.
[3]孟飞翔 帅立国 姜昌金.决策树在客户价值分析中的应用[J].计算机技术与发展,2007,(04):60.
 MENG Fei-xiang,SHUAI Li-guo,JIANG Chang-jin.An Application of Decision Tree to Analyze the Value of Customer[J].,2007,(04):60.
[4]房立 黄泽宇.竞争选择分裂属性的决策树分类模型[J].计算机技术与发展,2006,(08):106.
 FANG Li,HUANG Ze-yu.A Decision - Tree Classifier Model of Competition in Choosing Split Attribute[J].,2006,(04):106.
[5]王凡,陈健. 基于概念相似度计算的多策略本体映射研究[J].计算机技术与发展,2015,25(04):38.
 WANG Fan,CHEN Jian. Research on Multi-strategy Ontology Mapping Based on Conceptual Similarity Computing[J].,2015,25(04):38.
[6]强 晗,郭亚兰,田礼明.基于深度置信网络的恶意代码检测方法研究[J].计算机技术与发展,2019,29(07):93.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 019]
 QIANG Han,GUO Ya-lan,TIAN Li-ming.Research on Malicious Code Detection Based on Deep Belief Networks[J].,2019,29(04):93.[doi:10. 3969 / j. issn. 1673-629X. 2019. 07. 019]
[7]孙 傲,赵礼峰.基于信息增益和基尼不纯度的 K 近邻算法[J].计算机技术与发展,2019,29(09):51.[doi:10. 3969 / j. issn. 1673-629X. 2019. 09. 010]
 SUN Ao,ZHAO Li-feng.K-Nearest Neighbor Algorithm Based on Information Gain and Gini Impurity[J].,2019,29(04):51.[doi:10. 3969 / j. issn. 1673-629X. 2019. 09. 010]
[8]陈 鹏,李勇志,余肖生.基于特征选择模型的钓鱼网站快速识别方法[J].计算机技术与发展,2021,31(04):40.[doi:10. 3969 / j. issn. 1673-629X. 2021. 04. 007]
 CHEN Peng,LI Yong-zhi,YU Xiao-sheng.Method for Quickly Identifying Phishing Websites Based onFeature Selection Model[J].,2021,31(04):40.[doi:10. 3969 / j. issn. 1673-629X. 2021. 04. 007]
[9]王励烨,丁威威.基于同步性脑网络的注意力识别研究[J].计算机技术与发展,2023,33(02):146.[doi:10. 3969 / j. issn. 1673-629X. 2023. 02. 022]
 WANG Li-ye,DING Wei-wei.Attention Recognition Based on Synchronous Brain Network[J].,2023,33(04):146.[doi:10. 3969 / j. issn. 1673-629X. 2023. 02. 022]

更新日期/Last Update: 2023-04-10