JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2015, Vol. 50 ›› Issue (07): 38-44.doi: 10.6040/j.issn.1671-9352.3.2014.106

Previous Articles     Next Articles

Opinion target extraction with active-learning and automatic annotation

ZHU Zhu, LI Shou-shan, DAI Min, ZHOU Guo-dong   

  1. Natural Language Processing Lab, Soochow University, Suzhou 215006, Jiangsu, China
  • Received:2015-03-03 Online:2015-07-20 Published:2015-07-31

Abstract: An opinion target extraction method combined active-learning and automatic annotation is introduced. Firstly, the results of automatically annotation with the confidence are obtained by using a few of labeled corpus to train the classifier to test the unlabeled samples: secondly, the samples of low confidence is annotated by calculating the confidence of every sample: finally, the words of low confidence in the selected samples is annotated manually, while the others are adopted the results of automatic annotation. The empirical results demonstrate that the proposed method effectively reduces the annotation cost and achieves good performance on opinion target extraction.

Key words: opinion target extraction, active-learning, automatic annotation, sentiment analysis

CLC Number: 

  • TP391
[1] PANG Bo, LEE L. Opinion mining and sentiment analysis[J]. Foundations and Trends in Information Retrieval, 2008, 2(1-2):1-135.
[2] PANG Bo, LEE L, VAITHYANATHAN S. Thumbs up? Sentiment classification using machine learning techniques[C]//Proceedings of EMNLP-02. Stroudsburg: Association for Computational Linguistics, 2002:79-86.
[3] 赵妍妍,秦兵,刘挺.文本情感分析[J]. 软件学报, 2010, 21(8):1834-1848. ZHAO Yanyan, QIN Bing, LIU Ting. Sentiment analysis[J]. Journal of Software, 2010, 21(8):1834-1848.
[4] LEWIS D, GALE W. Training text classifiers by uncertainty sampling[C]//Proceedings of SIGIR-94.London:Springer-verlag, 1994: 3-12.
[5] HU Minqing, LIU B. Mining opinion features in customer reviews[C]//Proceedings of AAAI-2004. California: AAAI Press, 2004: 755-760.
[6] LI Binyang, ZHOU L, FENG S, et al. A unified graph model for sentence-based opinion retrieval[C]// Proceedings of ACL.Stroudsburg:Association for Computational Linguistics, 2010:1367-1375.
[7] ZHUANG Li, JING F, ZHU X. Movie review mining and summarization[C]//Proceedings of CIKM-2006. New York: ACM, 2006: 43-50.
[8] JAKOB N. GUREVYCH I. Extracting opinion targets in a single and cross-domain setting with conditional random fields[C]//Proceedings of EMNLP-2010.Stroudsburg: Association for Computational Linguistics, 2010: 1035-1045.
[9] 王荣洋,鞠久鹏,李寿山,等. 基于CRFs的评价对象抽取特征研究[J]. 中文信息学报,2012,26(2):56-61. WANG Rongyang, JU Jiupeng, LI Shoushan, et al. Feature engineering for CRFs based opinion target extraction[J]. Journal of Chinese Information Processing, 2012, 26(2):56-61.
[10] 龙军, 殷建平, 祝恩, 等. 主动学习研究综述[J]. 计算机研究与发展, 2008, 45(S1):300-304. LONG Jun, YIN Jianping, ZHU En, et al. A survey of active learning[J]. Journal of Computer Research and Development, 2008, 45(Suppl):300-304.
[11] FREUND Y, SEUNG H S, SHAMIR E, et al. Selective sampling using the query by committee algorithm[J]. Machine learning, 1997, 28(2-3): 133-168.
[12] LEWIS D D, GALE W A. A sequential algorithm for training text classifiers[C]//Proceedings of the 17th ACM Int'l Conf on Research and Development in Information Retrieval. New York: Springer-verlag, 1994: 3-12.
[13] MUSLEA I, MINTON S, KNOBLOCK C. Active learning with multiple view[J]. Journal of Artificial Intelligence Research, 2006,27: 203-233
[14] MCCALLUM A, NIGAM K. Employing EM in pool-based active learning for text classification[C]//Proceedings of the 15th Int'l Conf on Machine Learning. New York: ACM, 1998: 500-512.
[15] MUSLEA I, MINTON S, KNOBLOCK C A. Active learning with multiple views[J]. Journal of Artificial Intelligence Research, 2006, 27(1):203-233.
[16] 宗成庆. 统计自然语言处理[M]. 北京:清华大学出版社,2008:1-475. ZONG Chengqing. Statistics natural language processing[M]. Beijing: Tsinghua University Press, 2008:1-475
[1] YU Chuan-ming, FENG Bo-lin, TIAN Xin, AN Lu. Deep representative learning based sentiment analysis in the cross-lingual environment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 13-23.
[2] CHEN Xin, XUE Yun, LU Xin, LI Wan-li, ZHAO Hong-ya, HU Xiao-hui. Text feature extraction method for sentiment analysis based on order-preserving submatrix and frequent sequential pattern mining [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 36-45.
[3] HE Yan-xiang, LIU Jian-bo, SUN Song-tao, WEN Wei-dong. Product reviews sentiment classification in Micro-blog based on cascaded conditional random field [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 67-73.
[4] ZHOU Wen, ZHANG Shu-qing, OUYANG Chun-ping, LIU Zhi-ming, YANG Xiao-hua. Topic sentiment analysis of Chinese news based on emotional dependency tuple [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 1-6.
[5] YANG Jia-neng, YANG Ai-min, ZHOU Yong-mei. Sentiment classification method of Chinese Micro-blog based on semantic analysis [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 14-21.
[6] ZHU Xi, DONG Xi-shuang, GUAN Yi, LIU Zhi-guang. Sentiment analysis of Chinese Micro-blog based on semi-supervised learning [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 37-42.
[7] LIU Ming, ZAN Hong-ying, YUAN Hui-bin. Key sentiment sentence prediction using SVM and RNN [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 68-73.
[8] SUN Song-tao, HE Yan-xiang, CAI Rui, LI Fei, HE Fei-yan. Comparative study of methods for Micro-blog sentiment evaluation tasks [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 43-50.
[9] ZHANG Cheng-gong 1, 2, LIU Pei-yu1, 2*, ZHU Zhen-fang1,2, FANG Ming1,2. A sentiment analysis method based on a polarity lexicon [J]. J4, 2012, 47(3): 47-50.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!