基于半监督学习的微博情感倾向性分析

doi:10.6040/j.issn.1671-9352.3.2014.136

Abstract

Abstract: Sentiment analysis of Chinese Micro-blog usually refers to classification of Micro-blogs into positive, negative and neutral polarity. According to the characteristics of Micro-blogs, such as fragmentation and imbalanced of sentiment class, on the basis of reserved self-training method we presented before, text features were extracted that are appropriate for the sentiment analysis of Micro-blog, and then a training degree threshold setup method was proposed to optimize the iteration termination condition of reserved self-training method. These methods not only take advantage of the effective treatment on imbalanced distribution problem but also prevent the overtraining problem in training process. The evaluation result in COAE2014 showed the effectiveness of these methods.

Key words: training degree threshold, sentiment analysis, reserved self-training

CLC Number:

TP391

ZHU Xi, DONG Xi-shuang, GUAN Yi, LIU Zhi-guang. Sentiment analysis of Chinese Micro-blog based on semi-supervised learning[J].JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 37-42.

References

[1] 王远怀, 于洪彦, 李响. 网络评论如何影响网络购物意愿?[J]. 中大管理研究, 2013, 8(2):1-19. WANG Huaiyuan, YU Hongyan, LI Xiang. How network comment to influence the online shopping intention?[J]. China Management Studies, 2013, 8(2):1-19.
[2] PANG Bo, LEE L, VAITHYANATHAN S. Thumbs up? sentiment classification using machine learning techniques[C]// Proceedings of the 2002 Conference on Empirical Methods In Natural Language Processing. Somerset: ACL, 2002: 79-86.
[3] LIU Z, DONG X, GUAN Y, et al. Reserved self-training: a semi-supervised sentiment classification method for Chinese Micro-blogs[C]// Proceedings of IJCNLP. Somerset: ACL, 2013: 455-462.
[4] BAKLIWAL A, FOSTER J, VAN DER PUIL J, et al. Sentiment analysis of political tweets: towards an accurate classifier[C]// Proceedings of NAACL Workshop on Language Analysis in Social Media. Atlanta, GA, 2013: 49-58.
[5] BARBOSA L, FENG J. Robust sentiment detection on Twitter from biased and noisy data[C]// Proceedings of the 23rd International Conference on Computational Linguistics. Philadelphia, PA, USA: Association for Computational Linguistics, 2010: 36-44.
[6] RUSTAMOY S, CLEMENTS M A. Sentence-level subjectivity detection using neuro-fuzzy models[C]// Proceedings of the 4th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis.Atlanta: Association for Computational Linguistics, 2013: 108-114.
[7] BOLLEN J, PEPE A, MAO Huina. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena[C]// Proceedings of ICWSM.[S.l.]: AAAI Press, 2011: 450-453.
[8] MEENA A, PRABHAKAR T. Sentence level sentiment analysis in the presence of conjuncts using linguistic analysis[M]. Berlin Heidelberg: Springer, 2007: 573-580.
[9] SOCHER R, PENNINGTON J, HUANG E, et al. Semi-supervised recursive autoencoders for predicting sentiment distributions[C]// Proceedings of the Conference on Empirical Methods in Natural Language Processing. Philadelphia, PA, USA: Association for Computational Linguistics, 2011: 151-161.
[10] TAN C, LEE L, TANG J, et al. User-level sentiment analysis incorporating social networks[C]// Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data mining. New York: ACM, 2011: 1397-1405.
[11] LI Shoushan, WANG Zhongqing, ZHOU Guodong, et al. Semi-supervised learning for imbalanced sentiment classification[C]// Proceedings of International Joint Conference on Artificial Intelligence(IJCAI).[S.l.]: AAAI Press, 2011, 22(3):1826-1831.
[12] DONG X, GUAN Y, LI B, et al. Sentiment analysis on Chinese words and sentences based on maximum entropy model[C]// Proceedings of COAE.Shanghai:[s.n.], 2009: 50-58.
[13] BLUMER A, EHRENFEUCHT A, HAUSSLER D, et al. Occam's razor[J]. Information Processing Letters, 1987, 24(6):377-380.

Related Articles 9

[1]	YU Chuan-ming, FENG Bo-lin, TIAN Xin, AN Lu. Deep representative learning based sentiment analysis in the cross-lingual environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 13-23.
[2]	CHEN Xin, XUE Yun, LU Xin, LI Wan-li, ZHAO Hong-ya, HU Xiao-hui. Text feature extraction method for sentiment analysis based on order-preserving submatrix and frequent sequential pattern mining [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 36-45.
[3]	HE Yan-xiang, LIU Jian-bo, SUN Song-tao, WEN Wei-dong. Product reviews sentiment classification in Micro-blog based on cascaded conditional random field [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 67-73.
[4]	ZHU Zhu, LI Shou-shan, DAI Min, ZHOU Guo-dong. Opinion target extraction with active-learning and automatic annotation [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(07): 38-44.
[5]	ZHOU Wen, ZHANG Shu-qing, OUYANG Chun-ping, LIU Zhi-ming, YANG Xiao-hua. Topic sentiment analysis of Chinese news based on emotional dependency tuple [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 1-6.
[6]	YANG Jia-neng, YANG Ai-min, ZHOU Yong-mei. Sentiment classification method of Chinese Micro-blog based on semantic analysis [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 14-21.
[7]	LIU Ming, ZAN Hong-ying, YUAN Hui-bin. Key sentiment sentence prediction using SVM and RNN [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 68-73.
[8]	SUN Song-tao, HE Yan-xiang, CAI Rui, LI Fei, HE Fei-yan. Comparative study of methods for Micro-blog sentiment evaluation tasks [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 43-50.
[9]	ZHANG Cheng-gong 1, 2, LIU Pei-yu1, 2*, ZHU Zhen-fang1,2, FANG Ming1,2. A sentiment analysis method based on a polarity lexicon [J]. J4, 2012, 47(3): 47-50.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

Comments

Recommended 0

No Suggested Reading articles found!

Sentiment analysis of Chinese Micro-blog based on semi-supervised learning

PDF (PC)

Abstract

Cite this article

share this article

References

Related Articles 9

Metrics

Comments

Recommended 0