您的位置:山东大学 -> 科技期刊社 -> 《山东大学学报(理学版)》

山东大学学报(理学版) ›› 2014, Vol. 49 ›› Issue (08): 58-65.doi: 10.6040/j.issn.1671-9352.1.2014.199

• 论文 • 上一篇    下一篇

基于三支决策的中文微博观点句识别研究

田海龙, 朱艳辉, 梁韬, 马进, 刘璟   

  1. 湖南工业大学计算机与通信学院, 湖南 株洲 412008
  • 收稿日期:2014-04-27 修回日期:2014-07-08 发布日期:2014-09-24
  • 通讯作者: 朱艳辉(1968-),女,教授,研究方向为文本分类和信息检索.E-mail:swayhzhu@163.com E-mail:swayhzhu@163.com
  • 作者简介:田海龙(1990-),男,硕士研究生,研究方向为文本分类和信息检索.E-mail:tianhailongbmg@163.com
  • 基金资助:
    国家自然科学基金资助项目(61170102);国家社科基金资助项目(12BYY045)

Research on identificating Chinese micro-blog opinion sentence based on three-way decisions

TIAN Hai-long, ZHU Yan-hui, LIANG Tao, MA Jin, LIU Jing   

  1. School of Computer and Communication, Hunan University of Technology, Zhuzhou 412008, Hunan, China
  • Received:2014-04-27 Revised:2014-07-08 Published:2014-09-24

摘要: 微博观点句识别是对微博进行观点挖掘和舆情分析的基础,因此观点句识别的准确率对后续研究工作至关重要。提出了一种基于三支决策的中文微博观点句识别方法,采用支持向量机分类器,sigmoid函数计算某条微博属于观点句的概率,并将基于三支决策的中文微博观点句识别方法与传统支持向量机方法进行对比实验,实验结果表明,基于三支决策的中文微博观点句识别方法取得了很好的识别效果。

关键词: 观点句, 中文微博, 三支决策

Abstract: Micro-blog opinion sentence identification is the foundation for opinion mining and public opinion analysis.Therefore, the accuracy of identification of critical opinion sentences is very important for the follow-up research work. A Chinese micro-blog perspective sentence identification method based on three-way decisions.using support vector machine classifiers and sigmoid function to caculate the probability of a micro-blog belonging to perspective sentence,and conducting a contrast experiment between Chinese microblog perspective sentence identificaition method based on three-way decisions and traditional support vector machine,it shows that three-way decisions has accepted a good identification effect.

Key words: chinese micro-blog, three-way decisions, opinion sentence

中图分类号: 

  • TP18
[1] 中国互联网网络信息中心. 第33次中国互联网发展状况统计报告[EB/OL].2014. http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/201301/P020140221376266085836.pdf. China Internet Network Information Center. The 33th China Internet Development StatisticsReport [EB/OL].2014.http://www.cnnic.net.cn/hlwfzyj/hlwxzbg/hlwtjbg/201301/P020140221376266085836.pdf.
[2] YAO Yiyu. An outline of a theory of three-way decisions[C]. Proceedings of the 8th International RSCTC Conference. Chengdu:Springer 2012 Lecture Notes in Computer Science, 2012.
[3] YAO Yiyu. Three-way decisions with probabilistic rough sets. Information Sciences, 2010, 180: 341-353.
[4] YAO Yiyu. The superiority of three-way decisions in probabilistic rough set models, Information Sciences, 2011,181:1080-1096.
[5] PAWLAK Z. Roughsets[J]. International Journal of Computer and Information Sciences, 1982, 11(5):341-356.
[6] PAWLAK Z. Roughset:theoreticalaspects of reasonsing about data[M].Dordrecht:Kluwer Academic Publishers, 1991.
[7] YAO Yiyu, Wong S K M, Lingras P.A decision -theoretic rough set model[C].The 5th International Symposium on Methodologies for Intelligent Systemd, 1990.
[8] YAO Yiyu, Wong S K M. A decision theoretic framework for approximating concepts[J].International Journal of Man-Machine studies,1992, 37:793-809.
[9] YAO Yiyu. Probabilistic approaches to rough sets[J].Expert System,2003,20:287-297.
[10] YAO Yiyu. Probabilistic rough set approximations[J].I nternational Journal of Approximate Reasoning, 2008, 49:255-271.
[11] 中国知网.《知网》情感分析用词语集:Beta 版[EB/OL].2014. http://www.keenage.com/html/c_ index.html. HowNet. HowNet.“HowNet”word set for sentiment analysis:Beta Version[EB/OL]. 2014.http://www.keenage.com/html/c_ index.html.
[12] 朱艳辉,徐叶强,王文华,等. 中文评论文本观点抽取方法研究[C]//第三届中文倾向性分析评测论文集. 济南:中国科学院计算技术研究所,2011:126-135. ZHU Yanhui, XU Yeqiang, WANG Wenhua, et al. Research on Opinion Extraction of Chinese Review[C]//The Third Chinese Opinion Analysis Evaluation Proleedings. Jinan: Institute of Computing Technology, Chinese Academy of Science, 2011: 126-135.
[13] 杜锐,朱艳辉,鲁琳,等. 基于SVM的中文微博观点句识别算法[J]. 湖南工业大学学报, 2013(2):89-93. DU Rui, ZHU Yanhui, LU Lin, et al. The SVM-based algorithm for Chinese micro-blog opinion sentence identification[J]. Journal of Hunan University of Technology, 2013(2):89-93.
[14] 张华平. ICTCLAS2014汉语分词系统.NLPIR下载[EB/OL].2014. http://ictclas.nlpir.org/ZHANG Huaping. ICTCLAS2014 Chinese Word Segmentation.DownloadNLPIR[EB/OL].2014. http://ictclas.nlpir.org/.
[15] 中国计算机学会. 中文微博情感分析评测--样例数据集[EB/OL].[2012-07-01]. http://tcci.ccf.org.cn/conference/2012/pages/page04_eva.html. China Computer Federation. The Chinese micro-blog emotional analysis and evaluation: sample data sets[EB/OL].[2012-07-01]. http://tcci.ccf.org.cn/conference/2012/pages/page04_eva.html.
[16] 贾修一,商琳,周献中,等. 三支决策理论与应用[M]. 南京:南京大学出版社,2012:61-79. JIA Xiuyi, SHANG Lin, ZHOU Xianzhong, et al. Threey-way decisions theory and its applications[M]. Nanjing: Nanjing University Press, 2012: 61-79.
[1] 刘国涛,张燕平,徐晨初. 一种优化覆盖中心的三支决策模型[J]. 山东大学学报(理学版), 2017, 52(3): 105-110.
[2] 胡默之,姚天昉. 中文微博观点句识别及评价对象抽取方法[J]. 山东大学学报(理学版), 2016, 51(7): 81-89.
[3] 昝红英, 吴泳钢, 贾玉祥, 牛桂玲. 基于多源知识的中文微博命名实体链接[J]. 山东大学学报(理学版), 2015, 50(07): 9-16.
[4] 杨佳能, 阳爱民, 周咏梅. 基于语义分析的中文微博情感分类方法[J]. 山东大学学报(理学版), 2014, 49(11): 14-21.
[5] 张里博, 李华雄, 周献中, 黄兵. 人脸识别中的多粒度代价敏感三支决策[J]. 山东大学学报(理学版), 2014, 49(08): 48-57.
[6] 杜丽娜, 徐久成, 刘洋洋, 孙林. 基于三支决策风险最小化的风险投资评估应用研究[J]. 山东大学学报(理学版), 2014, 49(08): 66-72.
[7] 张聪, 于洪. 一种三支决策软增量聚类算法[J]. 山东大学学报(理学版), 2014, 49(08): 40-47.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!