[1]郑文静,李雷. 基于聚类核的半监督情感分类算法研究[J].计算机技术与发展,2016,26(12):87-91.
 ZHENG Wen-jing,LI Lei. Research on Semi-supervised Sentiment Classification Based on Cluster Kernel[J].,2016,26(12):87-91.
点击复制

 基于聚类核的半监督情感分类算法研究()
分享到:

《计算机技术与发展》[ISSN:1006-6977/CN:61-1281/TN]

卷:
26
期数:
2016年12期
页码:
87-91
栏目:
智能、算法、系统工程
出版日期:
2016-12-10

文章信息/Info

Title:
 Research on Semi-supervised Sentiment Classification Based on Cluster Kernel
文章编号:
1673-629X(2016)12-0087-05
作者:
 郑文静李雷
 南京邮电大学 理学院
Author(s):
 ZHENG Wen-jingLI Lei
关键词:
 半监督学习聚类核情感分类
Keywords:
 semi-supervised learningclustering kernelgraphsentiment classification
分类号:
TP301.6
文献标志码:
A
摘要:
 在互联网快速发展的今天,人类已经进入“大数据”时代,其中文本数据作为人类知识的载体,对于人类的进步与发展意义重大。如何运用大量未标记样本来提升文本情感分类的精度,也变得愈发重要。将半监督学习中的聚类核算法应用到情感分类问题中,给出基于聚类核的半监督情感分类算法。在标记样本和未标记样本上,建立加权无向图,求解聚类核,然后将该核函数用于SVM的情感分类器的训练上,完成情感分类工作。该方法直接将未标记样本所蕴含的信息融合到核中,不需要建立多个分类器,有效利用了未标记样本。实验结果表明,CKSVM算法在分类精度上明显优于基于Self-learning SVM和Co-training SVM的半监督情感分类算法,且在不同数据集上都有较好的适应性。
Abstract:
 Tn the rapid development of the Internet today, mankind has entered the era of big data. Text data as the carrier of human knowledge,is of great significance for human progress and development. So the usage of a large number of unlabeled samples to improve the accuracy of sentiment classification,has become more and more important. The kernel clustering method in semi supervised learning is applied to the emotion classification problem,and a semi supervised sentiment classification algorithm based on kernel clustering is pro-posed. A weighted undirected graph is built according to the labeled samples and unlabeled samples,solving the clustering kernel,and then the kernel function is used for the training of classifier SVM. This method directly uses the information contained by unlabeled samples in-to the kernel,no need to set up multiple classifiers,effective useage of the unlabeled samples. Experimental results show that the CKSVM is better than that based on Self-learning SVM and Co-training SVM in classification accuracy,with better adaptability on different data sets.

相似文献/References:

[1]陈锦禾 沈洁.基于信息熵的主动学习半监督分类研究[J].计算机技术与发展,2010,(02):110.
 CHEN Jin-he,SHEN Jie.Active Learning Based on Information Entropy for Semi- supervised Classification[J].,2010,(12):110.
[2]陈东泳 钟尚平.基于半监督学习的JPEG图像通用隐写检测方法[J].计算机技术与发展,2009,(02):169.
 CHEN Dong-yong,ZHONG Shang-ping.A Universal Steganalysis Method for JPEG Images Based on Semi - supervised Learning[J].,2009,(12):169.
[3]徐庆伶 汪西莉.一种基于支持向量机的半监督分类方法[J].计算机技术与发展,2010,(10):115.
 XU Qing-ling,WANG Xi-li.A Novel Semi-Supervised Classification Method Based on SVM[J].,2010,(12):115.
[4]戴林 姜梅.基于半监督学习的入侵检测系统[J].计算机技术与发展,2011,(01):162.
 DAI Lin JIANG Mei.Semi-Supervised Learning-Based Network Intrusion Detection System[J].,2011,(12):162.
[5]李亚娥,汪西莉.一种自适应的半监督图像分类算法[J].计算机技术与发展,2013,(02):112.
 LI Ya-e,WANG Xi-li.An Adaptive Semi-supervised Image Classification Algorithm[J].,2013,(12):112.
[6]张志宏,吴庆波,邵立松,等.基于飞腾平台TOE协议栈的设计与实现[J].计算机技术与发展,2014,24(07):1.
 ZHANG Zhi-hong,WU Qing-bo,SHAO Li-song,et al. Design and Implementation of TCP/IP Offload Engine Protocol Stack Based on FT Platform[J].,2014,24(12):1.
[7]梁文快,李毅. 改进的基因表达算法对航班优化排序问题研究[J].计算机技术与发展,2014,24(07):5.
 LIANG Wen-kuai,LI Yi. Research on Optimization of Flight Scheduling Problem Based on Improved Gene Expression Algorithm[J].,2014,24(12):5.
[8]黄静,王枫,谢志新,等. EAST文档管理系统的设计与实现[J].计算机技术与发展,2014,24(07):13.
 HUANG Jing,WANG Feng,XIE Zhi-xin,et al. Design and Implementation of EAST Document Management System[J].,2014,24(12):13.
[9]侯善江[],张代远[][][]. 基于样条权函数神经网络P2P流量识别方法[J].计算机技术与发展,2014,24(07):21.
 HOU Shan-jiang[],ZHANG Dai-yuan[][][]. P2P Traffic Identification Based on Spline Weight Function Neural Network[J].,2014,24(12):21.
[10]李璨,耿国华,李康,等. 一种基于三维模型的文物碎片线图生成方法[J].计算机技术与发展,2014,24(07):25.
 LI Can,GENG Guo-hua,LI Kang,et al. A Method of Obtaining Cultural Debris’ s Line Chart Based on Three-dimensional Model[J].,2014,24(12):25.
[11]朱乔亚,陈可佳,方彪. 采用位置信息的半监督链接预测方法[J].计算机技术与发展,2015,25(07):63.
 ZHU Qiao-ya,CHEN Ke-jia,FANG Biao. A Semi-supervised Link Prediction Method Using Place Features[J].,2015,25(12):63.
[12]张志武[],荆晓远[][],吴飞[]. 基于非负稀疏图的协同训练软件缺陷预测[J].计算机技术与发展,2017,27(07):38.
 ZHANG Zhi-wu[],JING Xiao-yuan[] [],WU Fei[]. Defect Prediction of Co-training Software with Non-negative Sparse Graph[J].,2017,27(12):38.

更新日期/Last Update: 2017-02-03