[1]贾志强,李 涛*,乐金祥.基于集成学习算法的消费行为预测[J].计算机技术与发展,2022,32(05):141-146.[doi:10. 3969 / j. issn. 1673-629X. 2022. 05. 024]
 JIA Zhi-qiang,LI Tao*,YUE Jin-xiang.Consumer Behavior Prediction Based on Ensemble Learning Algorithm[J].,2022,32(05):141-146.[doi:10. 3969 / j. issn. 1673-629X. 2022. 05. 024]
点击复制

基于集成学习算法的消费行为预测()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
32
期数:
2022年05期
页码:
141-146
栏目:
应用前沿与综合
出版日期:
2022-05-10

文章信息/Info

Title:
Consumer Behavior Prediction Based on Ensemble Learning Algorithm
文章编号:
1673-629X(2022)05-0141-06
作者:
贾志强李 涛* 乐金祥
武汉科技大学 计算机科学与技术学院,湖北 武汉 430065
Author(s):
JIA Zhi-qiangLI Tao* YUE Jin-xiang
School of Computer Science and Technology,Wuhan University of Science and Technology,Wuhan 430065,China
关键词:
行为预测特征工程算法建模stacking 策略集成学习
Keywords:
behavior predictionfeature engineeringalgorithm modelingstacking strategyensemble learning
分类号:
TP391
DOI:
10. 3969 / j. issn. 1673-629X. 2022. 05. 024
摘要:
消费行为预测在营销活动中具有重要的价值,其预测效果主要取决于特征工程与算法建模。 通过特征提取与新特征发现,提出定长与变长滑动窗口相结合的特征提取方法和基于先验知识与矩阵分解的特征交叉方法。 特征提取方法考虑样本不平衡和用户消费习惯,提取更多的样本数据并给特征加上时间属性,而特征交叉方法考虑商品与用户之间隐含的关联关系,提取有关联的新特征。 对于单一模型预测效果较差的问题, 采用 stacking 策略构建集成学习模型, 以XGBoost、随机森林和梯度提升决策树作为初级学习器对特征进行变换,以逻辑回归作为元学习器对用户消费行为进行预测。 实验结果表明,该特征工程方法在多个模型算法中均能明显提高精准率,该集成学习模型预测效果要比单个模型更好。
Abstract:
The prediction of consumption behavior is of great value in marketing activities,and its prediction effect mainly depends on feature engineering and algorithm modeling. Through feature extraction and new feature discovery, the feature extraction method combining fixed length and variable length sliding window and feature intersection method based on prior knowledge and matrix decomposition are proposed. Feature extraction method takes sample imbalance and consumer habits into account,extracts more sample data and adds time attribute to features. Feature intersection method takes the implicit relationship between goods and users into account to extract new features with relevance. For the first? mock exam,the stacking model is used to build the ensemble learning model. The XGBoost,random forest and gradient decision tree are used as primary learning devices to transform the features,and logistic regression is used as a meta learning device to predict user consumption behavior. The experimental results show that the feature engineering method can improve the accuracy of the algorithm in many models,and the prediction effect of the integrated learning model is better than that of a single model.

相似文献/References:

[1]郭博 程家兴 张大强.非通讯多Agent协作在RoboCup中的应用[J].计算机技术与发展,2006,(04):90.
 GUO Bo,CHENG Jia-xing,ZHANG Da-qiang.Non- communicative Multi- Agent Collaboration in RoboCup[J].,2006,(05):90.
[2]陈春玲,陈红,余瀚.改进的BP算法对移动用户行为预测的研究[J].计算机技术与发展,2018,28(07):178.[doi:10.3969/ j. issn.1673-629X.2018.07.038]
 CHEN Chun-ling,CHEN Hong,YU Han.Research on Mobile User Behavior Prediction Based on Improved BP Algorithm[J].,2018,28(05):178.[doi:10.3969/ j. issn.1673-629X.2018.07.038]
[3]闫 坤,沈苏彬.一种基于智能家居的用户行为预测方法[J].计算机技术与发展,2020,30(01):19.[doi:10. 3969 / j. issn. 1673-629X. 2020. 01. 004]
 YAN Kun,SHEN Su-bin.A User Behavior Prediction Method Based on Smart Home[J].,2020,30(05):19.[doi:10. 3969 / j. issn. 1673-629X. 2020. 01. 004]
[4]张银杰,揣锦华,翟晓惠.基于集成学习算法的恶意软件感染二分类预测[J].计算机技术与发展,2021,31(05):15.[doi:10. 3969 / j. issn. 1673-629X. 2021. 05. 003]
 ,BinaryPredictionofMalwareInfectionBasedonIntegratedLearningAlgorithm[J].,2021,31(05):15.[doi:10. 3969 / j. issn. 1673-629X. 2021. 05. 003]
[5]曹茂俊,崔欣锋.基于一维卷积神经网络的地层智能识别方法[J].计算机技术与发展,2023,33(09):133.[doi:10. 3969 / j. issn. 1673-629X. 2023. 09. 020]
 CAO Mao-jun,CUI Xin-feng.Intelligent Stratigraphic Recognition Method Based on 1DCNN[J].,2023,33(05):133.[doi:10. 3969 / j. issn. 1673-629X. 2023. 09. 020]

更新日期/Last Update: 2022-05-10