[1]刘高军,王小宾.基于 CNN+LSTMAttention 的营销新闻文本分类[J].计算机技术与发展,2020,30(11):59-63.[doi:10. 3969 / j. issn. 1673-629X. 2020. 11. 011]
 LIU Gao-jun,WANG Xiao-bin.Marketing News Text Classification Incorporating CNN+LSTMAttention[J].,2020,30(11):59-63.[doi:10. 3969 / j. issn. 1673-629X. 2020. 11. 011]
点击复制

基于 CNN+LSTMAttention 的营销新闻文本分类()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
30
期数:
2020年11期
页码:
59-63
栏目:
智能、算法、系统工程
出版日期:
2020-11-10

文章信息/Info

Title:
Marketing News Text Classification Incorporating CNN+LSTMAttention
文章编号:
1673-629X(2020)11-0059-05
作者:
刘高军王小宾
北方工业大学 信息学院,北京 100144
Author(s):
LIU Gao-junWANG Xiao-bin
School of Information,North China University of Technology,Beijing 100144,China
关键词:
营销新闻文本分类卷积神经网络注意力机制长短期记忆神经网络
Keywords:
marketing newstext classificationconvolutional neural networkattention mechanismlong short-term memory neural network
分类号:
TP301
DOI:
10. 3969 / j. issn. 1673-629X. 2020. 11. 011
摘要:
针对营销新闻分类识别任务,传统方法采用的长短期记忆神经网络 LSTM 和卷积神经网络 CNN 存在分类识别率不高的问题,因此提出一种融合 CNN 和引入注意力机制的长短时记忆(LSTM Attention)来提高营销新闻识别分类能力。 首先通过 word2vec 获取营销新闻文本词向量形成的矩阵,分别输入到传统机器学习分类模型中,在此基础上使用模型融合技术融合单一模型中分类效果较好的模型,最后得到融合模型和单一模型的分类结果并进行对比。 实验结果显示,在基础模型 LSTM 引入了注意力机制之后准确率,召回率和 F1 值分别达到 67.01%,66.07%,0.680,而 CNN 和 LSTMAttention进行模型融合之后的准确率,召回率和 F1 值进一步达到了 68.29% ,71.27% ,0.692。 表明基于 CNN 和 LSTMAttention 融合之后的神经网络模型相较于单一模型,最终分类效果更好,可以达到提高营销新闻文本分类识别效果的目的。
Abstract:
For the classification and recognition task of marketing news,the long-term and short-term memory neural network LSTM and convolutional neural network CNN used by traditional methods have a low classification recognition rate. Therefore,we propose a long and short-term memory (LSTMAttention) that combines CNN and introduces attention mechanism to improve the ability to identify and classify marketing news. Firstly, the matrix formed by the word vector of the marketing news text is obtained by word2vec and input into the traditional machine learning classification model. Based on this,model fusion technology is used to fuse the model with better classification in a single model, and finally the classification results of fusion model and single model are obtained and compared. The experiment shows that after introduction of the attention mechanism in the basic model of LSTM,the accuracy,recall and F1 values reach 67.01%,66.07% and 0.680 respectively,and the accuracy, recall and F1 value after model fusion of CNN and LSTMAttention further reach 68.29%,71.27% and 0.692. It is shown that the neural network model based on the fusion of CNN and LSTMAttention has a better final classification effect than a single model,and can achieve the purpose of improving the classification and recognition effect of marketing news text.

相似文献/References:

[1]田昕辉 李成基.带有短语切分的中文文本分类方法[J].计算机技术与发展,2010,(01):5.
 TIAN Xin-hui,LEE Sung-kee.Phrase Segmentation for Chinese Text Classification[J].,2010,(11):5.
[2]姜鹤 陈丽亚.SVM文本分类中一种新的特征提取方法[J].计算机技术与发展,2010,(03):17.
 JIANG He,CHEN Li-ya.A New Feature Selection Method in SVM Text Categorization[J].,2010,(11):17.
[3]周瑛 张铃.有限混合模型在文本分类中的应用研究[J].计算机技术与发展,2010,(06):18.
 ZHOU Ying,ZHANG Ling.Study of Application of Finite Mixture Model in Text Classification[J].,2010,(11):18.
[4]陈锦禾 范新 沈闻 沈洁.基于情感词识别的BBS情感分类研究[J].计算机技术与发展,2009,(07):120.
 CHEN Jin-he,FAN Xin,SHEN Wen,et al.Research on Sentiment Classification of BBS Reviews Based on Identifying Words with Polarity[J].,2009,(11):120.
[5]刘锋 唐佳 仲红.一种基于RBF神经网络的XML文本分类方法[J].计算机技术与发展,2009,(08):34.
 LIU Feng,TANG Jia,ZHONG Hong.A Text Categorization Method Based on RBF Neural Network[J].,2009,(11):34.
[6]晋幼丽 周明全 王学松.SVM和K-means结合的文本分类方法研究[J].计算机技术与发展,2009,(11):35.
 JIN You-li,ZHOU Ming-quan,WANG Xue-song.Research on Text Classification Method of SVM and K - means[J].,2009,(11):35.
[7]张燕平 徐庆鹏 苏守宝 邢猛.一种基于贪婪覆盖的文本分类方法[J].计算机技术与发展,2009,(01):74.
 ZHANG Yan-ping,XU Qing-peng,SU Shou-bao,et al.A Text Categorization Method Based on Greedy Cover[J].,2009,(11):74.
[8]陈素萍 谢丽聪.一种文本特征选择方法的研究[J].计算机技术与发展,2009,(02):112.
 CHEN Su-ping,XIE Li-cong.Research on Document Feature Selection[J].,2009,(11):112.
[9]于水英 丁华福 付志超.基于遗传算法和模糊聚类的文本分类研究[J].计算机技术与发展,2009,(04):131.
 YU Shui-ying,DING Hua-fu,FU Zhi-chao.Study on Text Categorization Based on Genetic Algorithm and Fuzzy Clustering[J].,2009,(11):131.
[10]许幸 张启蕊.基于KNN算法的医药信息文本分类系统的研究[J].计算机技术与发展,2009,(04):206.
 XU Xing,ZHANG Qi-rui.Research of Medical Information Text Categorization Based on KNN Algorithm[J].,2009,(11):206.

更新日期/Last Update: 2020-11-10