JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2015, Vol. 50 ›› Issue (01): 20-25.doi: 10.6040/j.issn.1671-9352.3.2014.024

Previous Articles     Next Articles

Analysis on new word detection and sentiment orientation in Micro-blog

TANG Bo, CHEN Guang, WANG Xing-ya, WANG Fei, CHEN Xiao-hui   

  1. School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2014-09-19 Revised:2014-11-25 Online:2015-01-20 Published:2015-01-24

Abstract: Due to popularity and flexibility of social media, more increasingly created words were used to express people's feelings and attitudes. New word detection and sentiment orientation has become a hot issue in Micro-blog analysis. The methods and techniques used in Task 3 of COAE 2014 were introduced. Generalized suffix tree was employed in string extraction, which was determined as new words with metrics like left-right-flexibility of words etc. Then, with pattern-based and statistic-based methods combined with multiple lexicons, sentiment orientation of new words was decided. Search engine was also used to optimize result as a supplement from semantic perspective. Results have shown our methods effective in new word detection and sentiment orientation analysis.

Key words: Micro-blog, generalized suffix tree, sentiment orientation analysis, new word detection

CLC Number: 

  • TP391
[1] 黄轩, 李熔烽. 博客语料的新词发现方法[J]. 现代电子技术, 2013,36(2):144-149. HUANG Xuan, LI Rongfeng. Discovery method of new words in blog contents[J]. Modern Electronics Technique, 2013, 36(2):144-149.
[2] 郑家恒,李文花.基于构词法的网络新词自动识别初探[J].山西大学学报:自然科学版,2002,25(2):115-119. ZHENG Jiahuan, LI Wenhua. A study on automatic identification for internet new words according to word-building rule[J]. Journal of Shanxi University: Natural Science Edition, 2002, 25(2):115-119.
[3] LIU Tao, LIU Bingquan, XU Zhiming, et al. Automatic domain-specific term extraction and its application in text classification[J]. Acta Electronica Sinica, 2007, 35(2):328-332.
[4] 林自芳,蒋秀凤.基于词内部模式的新词识别[J].计算机与现代化,2010(11):56-58. LIN Zifang, JIANG Xiufeng. A new method for Chinese new word identification based on inner pattern of word[J]. Computer and Modernization, 2010(11):56-58.
[5] 苏其龙. 微博新词发现研究[D]. 哈尔滨:哈尔滨工业大学, 2013. SU Qilong. Research on new word detection from Microblog data[D]. Harbin:Harbin Institute of Technology, 2013.
[6] UKKONEN E. On-line construction of suffix trees[J]. Algorithmica, 1995, 14(3):249-260.
[7] 徐硕, 乔晓东, 朱礼军, 等. 广义后缀树及其在汉语科技词系统中的应用研究[J]. 数字图书馆论坛, 2013(004):37-41. XU Shuo, QIAO Xiaodong, ZHU Lijun, et al. Generalized suffix trees with its applications in Chinese scientific technical vocabulary system[J]. Digital Library Forum, 2013(004): 37-41.
[8] 赵妍妍, 秦兵, 刘挺. 文本情感分析[J]. 软件学报, 2010, 21(8):1834-1848. ZHAO Yanyan, QIN Bing, LIU Ting. Sentiment analysis[J]. Journal of Software, 2010, 21(8):1834-1848.
[9] RAO D, RAVICHANDRAN D. Semi-supervised polarity lexicon induction[C]// Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics. Stroudsburg: Association for Computational Linguistics, 2009: 675-682.
[10] 李钝, 乔保军, 曹元大, 等. 基于语义分析的词汇倾向识别研究[J]. 模式识别与人工智能, 2008, 21(4):482-487. LI Dun, QIAO Baojun, CAO Yuanda, et al. Word orientation recognition based on semantic analysis[J]. Pattern Recognition and Artificial Intelligence, 2008, 21(4):482-487.
[11] 田久乐, 赵蔚. 基于同义词词林的词语相似度计算方法[J]. 吉林大学学报: 信息科学版, 2010(006):602-608. TIAN Jiule, ZHAO Wei. Words similarity algorithm based on Tongyici Cilin in semantic web adaptive learning system[J]. Journal of Jilin University: Information Science Edition, 2010(006):602-608.
[12] TURNEY P D. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews[C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Somerset: Association for Computational Linguistics, 2002: 417-424.
[13] 宋继华, 杨尔弘, 王强军. 中文信息处理教程[M]. 北京:高等教育出版社, 2011: 74-75. SONG Jihua, YANG Erhong, WANG Qiangjun. Chinese information processing tutorial[M]. Beijing: Higher Education Press, 2011: 74-75.
[14] 王立希, 王建东. 基于数据挖掘的新词发现[J].计算机应用研究, 2006,2(12):195-197. WANG Lixi, WANG Jiandong. Approach for lexicon updating based on data mining[J]. Application Research of Computers, 2006, 2(12):195-197.
[1] ZHANG Zhong-jun, ZHANG Wen-juan, YU Lai-hang, LI Run-chuan. A community division method based on network distance and content similarity in micro-blog social network [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(7): 97-103.
[2] HU Mo-zhi, YAO Tian-fang. Recognition of Chinese Micro-blog sentiment polarity and extraction of opinion target [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(7): 81-89.
[3] SUN He, LI Shu-qin, L(¨overU)Xue-qiang, LIU Ke-hui. Recognition of geographical entity in city complaints of Micro-blog [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(3): 77-85.
[4] HE Yan-xiang, LIU Jian-bo, SUN Song-tao, WEN Wei-dong. Product reviews sentiment classification in Micro-blog based on cascaded conditional random field [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 67-73.
[5] WANG Li-ren, YU Zheng-tao, WANG Yan-bing, GAO Sheng-xiang, LI Xian-hui. Micro-blogging topic mining based on supervised LDA user interest model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(09): 36-41.
[6] ZAN Hong-ying, WU Yong-gang, JIA Yu-xiang, NIU Gui-ling. Chinese Micro-blog named entity linking based on multisource knowledge [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(07): 9-16.
[7] YANG Jia-neng, YANG Ai-min, ZHOU Yong-mei. Sentiment classification method of Chinese Micro-blog based on semantic analysis [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 14-21.
[8] SUN Song-tao, HE Yan-xiang, CAI Rui, LI Fei, HE Fei-yan. Comparative study of methods for Micro-blog sentiment evaluation tasks [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 43-50.
[9] LIU Pei-yu, ZHANG Yan-hui, ZHU Zhen-fang, XUN Jing. Micro-blog orientation analysis based on emotion symbol [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 8-13.
[10] KUANG Chong, LIU Zhi-yuan, SUN Mao-song. Personalized ranking of Micro-blogging forwarders [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(11): 31-36.
[11] TIAN Hai-long, ZHU Yan-hui, LIANG Tao, MA Jin, LIU Jing. Research on identificating Chinese micro-blog opinion sentence based on three-way decisions [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(08): 58-65.
[12] ZHENG Jian-xing, ZHANG Bo-feng*, YUE Xiao-dong, CHENG Ze-yu. Research on themes recommendation in microblogging
scenario based on neighbor-user profile
[J]. J4, 2013, 48(11): 59-65.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!