[1]吴克寿 曾志强.非平衡数据集分类研究[J].计算机技术与发展,2011,(09):39-42.
 WU Ke-shou,ZENG Zhi-qiang.Research on Imbalanced Dataset Learning Method[J].,2011,(09):39-42.
点击复制

非平衡数据集分类研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
期数:
2011年09期
页码:
39-42
栏目:
智能、算法、系统工程
出版日期:
1900-01-01

文章信息/Info

Title:
Research on Imbalanced Dataset Learning Method
文章编号:
1673-629X(2011)09-0039-04
作者:
吴克寿 曾志强
厦门理工学院计算机科学与技术系
Author(s):
WU Ke-shouZENG Zhi-qiang
Dept.of Computer Science and Technology,Xiamen University of Technology
关键词:
非平衡数据集上采样核学习
Keywords:
imbalanced dataset over-sample kernel learning
分类号:
TP18
文献标志码:
A
摘要:
现实世界中存在着非平衡数据集,即数据集中的一类样本数量远大于另一类。而少数类样本的识别通常是人们首要关心的,将少数类样本误分为多数类要比将多数类样本误分为少数类付出更高的代价。传统的机器学习算法可能会产生偏向多数类的结果,因而对于少数类而言,预测的效果会很差。在对目前国内外非平衡数据集研究现状深入分析的基础上,针对非平衡数据集数据复杂度研究和失衡解决方法研究两个方向相对孤立及缺乏系统性的缺陷,提出了一种非平衡数据集整体解决框架,以满足日益迫切的应用需求
Abstract:
A dataset is imbalanced if the classification categories are not approximately equally represented.Often real-world datasets are predominately composed of "normal" examples with only a small percentage of "abnormal" or "interesting" examples.It is also the case that the cost of misclassifying an abnormal(interesting) example as a normal example is often much higher than the cost of the reverse error.Traditional machine learning algorithms may be biased towards the majority class,thus producing poor predictive accuracy over the minority class.Based on the deep analysis on current research about rare cases classification,proposes a learning framework to address the problem of relative isolation of research between data complexity and solution of imbalanced data,and lack of systematic defects to meet the increasingly urgent applications

相似文献/References:

[1]姚芷馨,张太红,赵昀杰.基于卷积神经网络的多模型交通场景识别研究[J].计算机技术与发展,2022,32(07):93.[doi:10. 3969 / j. issn. 1673-629X. 2022. 07. 016]
 YAO Zhi-xin,ZHANG Tai-hong,ZHAO Yun-jie.Research on Multi-model Traffic Scene Recognition Based on Convolution Neural Network[J].,2022,32(09):93.[doi:10. 3969 / j. issn. 1673-629X. 2022. 07. 016]

备注/Memo

备注/Memo:
国家自然科学基金资助项目(60903203); 福建省教育厅A类科技计划项目(JA08222)吴克寿(1975-),男,湖南长沙人,剐教授,博士,研究方向为人工智能、语义网格;曾志强,讲师,博士,研究方向为机器学习、模式识别
更新日期/Last Update: 1900-01-01