[1]姚永生,董育宁,邱晓晖.基于相似性度量的网络流分类模型融合[J].计算机技术与发展,2021,31(12):7-12.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 002]
 YAO Yong-sheng,DONG Yu-ning,QIU Xiao-hui.Network Traffic Classification Model Fusion Based on Similarity Measurement[J].,2021,31(12):7-12.[doi:10. 3969 / j. issn. 1673-629X. 2021. 12. 002]
点击复制

基于相似性度量的网络流分类模型融合()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
31
期数:
2021年12期
页码:
7-12
栏目:
人工智能
出版日期:
2021-12-10

文章信息/Info

Title:
Network Traffic Classification Model Fusion Based on Similarity Measurement
文章编号:
1673-629X(2021)12-0007-06
作者:
姚永生董育宁邱晓晖
南京邮电大学 通信与信息工程学院,江苏 南京 210003
Author(s):
YAO Yong-shengDONG Yu-ningQIU Xiao-hui
School of Telecommunications & Information Engineering,Nanjing University of Posts and Telecommunications,Nanjing 210003,China
关键词:
Jensen-Shannon 距离迁移学习机器学习网络流分类概念漂移
Keywords:
Jensen-Shannon distancetransfer learningmachine learningnetwork traffic classificationconceptual drift
分类号:
TP301
DOI:
10. 3969 / j. issn. 1673-629X. 2021. 12. 002
摘要:
由于网络流特征会随时间和网络环境的变化而发生概念漂移,不同类别应用的流发生漂移情况不同,导致基于机器学习的流量分类方法精度明显降低。 同时,随着互联网网络技术的不断提高,使得过去采集并做好标签的大量视频流样本数据会发生很大变化,导致可用的训练集较少,需要实时采集和标注大量的新数据。 针对上述问题,提出一种结合Jensen-Shannon 距离、MultiTrAdaBoost 和 RandomForest 算法的分类方法。 该方法的核心思想是:度量新老视频数据流之间的相似性,根据度量结果判断采用何种模型进行分类,其中的迁移学习分类方法是从老数据集中选出有用信息的样本来辅助新数据集样本的识别与分类。 文中新老数据集样本特征属性分布是不一样的。 实验结果表明,与现有的方法比较,该方法可以更好地实现典型的网络视频流分类,表现出较好的分类性能和泛化能力( 即,模型的总体准确率标准差较小) 。
Abstract:
Because network flow characteristics will experience conceptual drift with time and network environment changes,the flow of different types? ?of applications drifts differently,resulting in a significant reduction in the accuracy of the traffic classification method based on machine learning. Meanwhile,with the continuous improvement of Internet network technology, the large number of video streams ample data collected and labeled in the past will change greatly,resulting in fewer training sets available,and a large amount of new data needs to be collected and labeled in real time. Regarding the problem above, a classification method combining Jensen-Shannon distance,MultiTrAdaBoost and Randon Forest algorithms is proposed. The core idea of this method is to measure the similarity between the new and old video data streams,and determine which model to use for classification based on the measurement results. The migration learning classification method is to select useful information samples from the old data set to assist the identification and classification of the new data set samples. In the article,the distribution of feature attributes of the new and old data sets is different. Experiment shows that compared with the existing methods,the proposed method can better implement typical network video stream classification,showing better classification performance and generalization ability ( that is,the overall accuracy of the model has a smaller standard deviation) .

相似文献/References:

[1]李 勇,刘战东,张海军.跨项目软件缺陷预测方法研究综述[J].计算机技术与发展,2020,30(03):98.[doi:10. 3969 / j. issn. 1673-629X. 2020. 03. 019]
 LI Yong,LIU Zhan-dong,ZHANG Hai-jun.Review on Cross-project Software Defects Prediction Methods[J].,2020,30(12):98.[doi:10. 3969 / j. issn. 1673-629X. 2020. 03. 019]
[2]武苏雯,赵慧杰,刘 鑫,等.基于迁移学习的图像分类在诗词中的应用研究[J].计算机技术与发展,2021,31(07):215.[doi:10. 3969 / j. issn. 1673-629X. 2021. 07. 036]
 WU Su-wen,ZHAO Hui-jie,LIU Xin,et al.Research on Application of Image Classification Based onTransfer Learning in Poetry[J].,2021,31(12):215.[doi:10. 3969 / j. issn. 1673-629X. 2021. 07. 036]
[3]娄丰鹏,吴迪,荆晓远,等.增加度量元的迁移学习跨项目软件缺陷预测[J].计算机技术与发展,2018,28(07):103.[doi:10.3969/ j. issn.1673-629X.2018.07.022]
 LOU Feng-peng,WU Di,JING Xiao-yuan,et al.Cross-project Software Defect Prediction Based on Transfer Learning with Metrics[J].,2018,28(12):103.[doi:10.3969/ j. issn.1673-629X.2018.07.022]
[4]刘宇廷,倪颖杰.融合知识迁移学习的微博社团检测模型构建[J].计算机技术与发展,2018,28(09):11.[doi:10.3969/j.issn.1673-629X.2018.09.003]
 LIU Yu-ting,NI Ying-jie.Construction of Weibo Community Detection Model with Knowledge Transfer Learning[J].,2018,28(12):11.[doi:10.3969/j.issn.1673-629X.2018.09.003]
[5]王泽泓,刘厚泉.基于迁移学习与自适应特征融合的建筑物识别[J].计算机技术与发展,2019,29(12):40.[doi:10. 3969 / j. issn. 1673-629X. 2019. 12. 007]
 WANG Ze-hong,LIU Hou-quan.Building Recognition Based on Transfer Learning and Adaptive Feature Fusion[J].,2019,29(12):40.[doi:10. 3969 / j. issn. 1673-629X. 2019. 12. 007]
[6]易 未,郑沫利,赵艳轲,等.基于小样本 SVR 的迁移学习及其应用[J].计算机技术与发展,2020,30(02):47.[doi:10. 3969 / j. issn. 1673-629X. 2020. 02. 010]
 YI Wei,ZHENG Mo-li,ZHAO Yan-ke,et al.Transfer Learning Based on Support Vector Regression Model for Small Sample Data and Its Applications[J].,2020,30(12):47.[doi:10. 3969 / j. issn. 1673-629X. 2020. 02. 010]
[7]王新美,丁爱玲,雷梦宁,等.基于 CNN 和 SVM 融合的交通标志识别[J].计算机技术与发展,2020,30(06):7.[doi:10. 3969 / j. issn. 1673-629X. 2020. 06. 002]
 WANG Xin-mei,DING Ai-ling,LEI Meng-ning,et al.Traffic Sign Recognition Based on Combination of CNN and SVM[J].,2020,30(12):7.[doi:10. 3969 / j. issn. 1673-629X. 2020. 06. 002]
[8]龚 安,井晓萌.多卷积神经网络模型融合的农作物病害图像识别[J].计算机技术与发展,2020,30(08):134.[doi:10. 3969 / j. issn. 1673-629X. 2020. 08. 023]
 GONG An,JING Xiao-meng.Image Recognition of Crop Diseases Based on Multi-convolution Neural Network Model Ensemble[J].,2020,30(12):134.[doi:10. 3969 / j. issn. 1673-629X. 2020. 08. 023]
[9]龚 安,郭文婷.基于卷积神经网络的皮肤癌识别方法[J].计算机技术与发展,2020,30(10):167.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 030]
 GONG An,GUO Wen-ting.Skin Cancer Image Classification Method Based on Convolutional Neural Network[J].,2020,30(12):167.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 030]
[10]张泽宇,郭 斌,张太红*.基于 DCNN 的马匹图像分割算法研究[J].计算机技术与发展,2020,30(10):210.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 037]
 ZHANG Ze-yu,GUO Bin,ZHANG Tai-hong.Research on Horse Image Segmentation Algorithm Based on DCNN[J].,2020,30(12):210.[doi:10. 3969 / j. issn. 1673-629X. 2020. 10. 037]

更新日期/Last Update: 2021-12-10