JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2014, Vol. 49 ›› Issue (11): 22-30.doi: 10.6040/j.issn.1671-9352.3.2014.074

Previous Articles     Next Articles

Micro-blog opinion analysis based on syntactic dependency and feature combination

XIA Meng-nan, DU Yong-ping, ZUO Ben-xin   

  1. College of Computer Science, Beijing University of Technology, Beijing 100124, China
  • Received:2014-08-28 Revised:2014-10-17 Online:2014-11-20 Published:2014-11-25

Abstract: Micro-blog opinion mining faces the difficulty because of the short text's conciseness. The technique of syntactic dependency relation analysis and CRFs(Conditional Random Fields) were combined to extract the candidate opinion objects. And then the dictionaries of the opinion analysis and all kinds of semantic features were used in the machine learning method to improve the performance of the opinion classification. The precision, recall and F1 values were used as the evaluation metric. The experimental results on the COAE(Chinese opinion analysis evaluation) data set verify both the validity of emotion factor extraction approach and the impact on opinion classification performance by different features. The macro and micro precisions for the opinion classification task are both 91.4%.

Key words: opinion mining, emotion factor extraction, feature selection

CLC Number: 

  • TP391
[1] 文坤梅,徐帅,李瑞轩,等. 微博及中文微博信息处理研究综述[J]. 中文信息学报, 2012,26(6):27-37. WEN Kunmei, XU Shuai, LI Ruixuan, et al. Survey of Micro-blog and Chinese Microblog information processing[J]. Journal of Chinese Information Processing, 2012, 26(6):27-37.
[2] 杜伟夫,谭松波,云晓春,等. 一种新的情感词汇语义倾向计算方法[J]. 计算机研究与发展,2009,26(10):1713-1720. DU Weifu, TAN Songbo, YUN Xiaochun, et al. A new method to compute semantic orientation[J]. Journal of Computer Research and Development, 2009, 26(10):1713-1720.
[3] 李寿山,李逸薇,黄居仁,等. 基于双语信息和标签传播算法的中文情感词典构建方法[J]. 中文信息学报,2013,27(06):75-81. LI Shoushan, LI Yiwei, HUANG Juren, et al. Construction of Chinese sentiment lexicon using bilingual information and label propagation algorithm[J]. Journal of Chinese Information Processing, 2013, 27(06):75-81.
[4] PANG Bo, LEE L, VAITHYANATHAN S. Thumbs up? Sentiment classification using machine learning techniques[C]// Proceedings of 2002 Conference on Empirical Methods in Natural Language Processing. Somerset: ACL, 2002: 79-86.
[5] PANG Bo, LEE Lilian. A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts[C]// Proceedings of the 42nd Meeting of the Association for Computational Linguistics. Philadelphia,PA,USA: Association for Computational Linguistics, 2004: 271-278.
[6] 孙艳,周学广,付伟. 基于主题情感混合模型的无监督文本情感分析[J]. 北京大学学报,2013,49(01):102-108. SUN Yan, ZHOU Xueguang, FU Wei. Unsupervised topic and sentiment unification model for sentiment analysis[J]. Acta Scientiarum Naturalium Universitatis Pekinensis, 2013, 49(01):102-108.
[7] 谢丽星,周明,孙茂松. 基于层次结构的多策略中文微博情感分析和特征抽取[J]. 中文信息学报, 2012,26(01):73-83. XIE Lixing, ZHOU Ming, SUN Maosong. Hierarchical structure based hybrid approach to sentiment analysis of Chinese Micro-blog and its feature extraction[J]. Journal of Chinese Information Processing, 2012, 26(01):73-83.
[8] 曹海涛. 基于PAD模型的中文微博情感分析研究[D]. 大连:大连理工大学计算机应用技术系,2013. CAO Haitao. Chinese Micro-blog sentiment analysis based on the PAD model[D]. Dalian: Dalian University of Technology, 2013.
[9] MEI Qiaozhu, LING Xu, MATTHEW W, et al. Topic sentiment mixture: modeling facets and opinions in weblogs[C]// Proceedings of the 16th International Conference on World Wide Web. Banff, Alberta, Canada, 2007: 171-180.
[10] 张想. 面向热点话题型微博的情感分析研究[D]. 哈尔滨:哈尔滨工业大学,2013. ZHANG Xiang. Research on sentiment analysis for hot topic Micro-blog[D]. Harbin: Harbin Institute of Technology, 2013.
[11] 张珊,于留宝,胡长军. 基于表情图片与情感词的中文微博情感分析[J]. 计算机科学, 2012,39(11):146-148. ZHANG Shan, YU Liubao, HU Changjun. Sentiment analysis of Chinese Micro-blogs based on emoticons and emotional words[J]. Computer Science, 2012, 39(11):146-148.
[12] QIU Guang, LIU Bing, BU Jianjun, et al. Expanding domain sentiment lexicon through double propagation[C]// Proceedings of the 21st Internation Joint Conference on Artifical Intelligence (IJCAI-09). Freiburg: IJCAI-INT, 2009: 1199-1204.
[13] LIU Zitao, YU Wenchao, CHEN Wei, et al. Short text feature selection for Micro-blog mining[C]// Proceedings of International Conference on Computational Intelligence and Software Engineering (CiSE 2010). Piscataway: IEEE, 2010: 1-4.
[1] HUANG Tian-yi, ZHU William. Cost-sensitive feature selection via manifold learning [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(3): 91-96.
[2] WAN Zhong-ying, WANG Ming-wen, ZUO Jia-li, WAN Jian-yi. Feature selection combined with the global and local information(GLFS) [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(5): 87-93.
[3] LI Zhao,SUN Zhan-,LI Xiao,LI Cheng,. Study on feature selection method based on information loss [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(11): 7-12.
[4] LUO Yi, LI Li, TAN Song-bo, CHENG Xue-qi. Sentiment analysis on Chinese Micro-blog corpus [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 1-7.
[5] ZHENG Yan, PANG Lin, BI Hui, LIU Wei, CHENG Gong. Feature selection algorithm based on sentiment topic model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 74-81.
[6] PAN Qing-qing, ZHOU Feng, YU Zheng-tao, GUO Jian-yi, XIAN Yan-tuan. Recognition method of Vietnamese named entity based on#br# conditional random fields [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(1): 76-79.
[7] YU Ran 1,2, LIU Chun-yang3*, JIN Xiao-long 1, WANG Yuan-zhuo 1, CHENG Xue-qi 1. Chinese spam microblog filtering based on the fusion of
multi-angle features
[J]. J4, 2013, 48(11): 53-58.
[8] YI Chao-qun, LI Jian-ping, ZHU Cheng-wen. A kind of feature selection based on classification accuracy of SVM [J]. J4, 2010, 45(7): 119-121.
[9] YANG Yu-Zhen, LIU Pei-Yu, SHU Zhen-Fang, QIU Ye. Research of an improved information gain methodusing distribution information of terms [J]. J4, 2009, 44(11): 48-51.
[10] YUAN Xiao-hang,DU Xiao-yong . iRIPPER: an improved rule-based text categorization algorithm [J]. J4, 2007, 42(11): 66-68 .
[11] YU Jun-ying,WANG Ming-wen,SHENG Jun . Class information feature selection method for text classification [J]. J4, 2006, 41(3): 144-148 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!