JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2021, Vol. 56 ›› Issue (5): 66-75.doi: 10.6040/j.issn.1671-9352.1.2020.029

Previous Articles     Next Articles

An extractive topic brief representation generation method to event

WANG Wei-yu1,2, SHI Cun-hui1,2*, YU Xiao-ming1, LIU Yue1, CHENG Xue-qi1   

  1. 1. CAS Key Laboratory of Network Data Science and Technology, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China;
    2.University of Chinese Academy of Sciences, Beijing 100190, China
  • Online:2021-05-20 Published:2021-05-13

Abstract: This paper takes advantage of the fact that event description contents are highly similar, and proposes an extractive topic brief representation generation method, which takes the titles in the event document set as the processing object, and extracts the common information retaining the original word order from different titles, further integrates these common information to generate a topic brief representation of the event. The experimental results on the event data from search engines show that this method can well generate the topic brief representation with concise and accurate form, clear and complete semantics and good readability.

Key words: topic brief representation generation, extractive, event

CLC Number: 

  • TP391
[1] 洪宇,张宇,刘挺,等. 话题检测与跟踪的评测及研究综述[J]. 中文信息学报,2007,21(6):71-87. HONG Yu, ZHANG Yu, LIU Ting, et al. Topic detection and tracking review[J]. Journal of Chinese Information Processing, 2007, 21(6):71-87.
[2] 鲁琳. 面向中文微博的舆情分析技术研究[D]. 株洲:湖南工业大学,2014. LU Lin. Research on Chinese microblog public opinion analysis[D]. Zhuzhou: Hunan University of Technology, 2014.
[3] YOU Y, HUANG G, CAO J, et al. GEAM:a general and event-related aspects model for twitter event detection[C] //International Conference on Web Information Systems Engineering. Berlin: Springer, 2013: 319-332.
[4] ZHENG L, JIN P, ZHAO J, et al. A fine-grained approach for extracting events on microblogs[M] //Database and Expert Systems Applications. Munich:Springer International Publishing, 2014: 275-283.
[5] 徐雷,潘珺. 事件表示方式及其语义表示模型研究[J].情报杂志,2019,38(6):159-167. XU Lei, PAN Jun. Research on the way of event representation and its semantic representation model[J]. Journal of Intelligence, 2019, 38(6):159-167.
[6] 仲兆满,李存华,刘宗田,等. 面向Web新闻的事件多要素检索方法[J]. 软件学报,2013,24(10):2366-2378. ZHONG Zhaoman, LI Cunhua, LIU Zongtian, et al. Web news oriented event multi-elements retrieval[J]. Journal of Software, 2013, 24(10):2366-2378.
[7] 张瑾,杨森,王孝宗,等. 话题检测与跟踪研究进展综述[J]. 信息技术快报,2010,8(4):52-60. ZHANG Jin, YANG Sen, WANG Xiaozong, et al. Review of research progress on topic detection and tracking[J]. Information Technology Letter, 2010, 8(4):52-60.
[8] 张仰森,段宇翔,黄改娟,等. 社交媒体话题检测与追踪技术研究综述[J]. 中文信息学报,2019,33(7):1-10. ZHANG Yangsen, DUAN Yuxiang, HUANG Gaijuan, et al. A survey on topic detection and tracking methods in social media[J]. Journal of Chinese Information Processing, 2019, 33(7):1-10.
[9] SALTON G, BUCKLEY C. Term-weighting approaches in automatic text retrieval[J]. Information Processing & Management, 1988, 24(5):513-523.
[10] 赵京胜,朱巧明,周国栋,等. 自动关键词抽取研究综述[J]. 软件学报,2017,28(9):2431-2449. ZHAO Jingsheng, ZHU Qiaoming, ZHOU Guodong, et al. Review of research in automatic keyword extraction[J]. Journal of Software, 2017, 28(9):2431-2449.
[11] PAGE L, BRIN S, MOTWANI R, et al. The PageRank citation ranking:bringing order to the Web[J]. Stanford Digital Libraries Working Paper, 1998, 9(1):1-14.
[12] MIHALCEA R, TARAU P. TextRank: bringing order into text[C] //Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. Barcelona: Association for Computational Linguistics, 2004: 404-411.
[13] 刘栋,张彩环. 基于短语的中文标签自动生成混合算法[J]. 计算机科学,2014,41(S1):87-90. LIU Dong, ZHANG Caihuan. Keyphrase-based Chinese tags generation hybrid algorithm[J]. Computer Science, 2014, 41(S1):87-90.
[14] 刘兴林,郑启伦,马千里. 一种基于主题词集的自动文摘方法[J]. 计算机应用研究,2011,28(4):1322-1324. LIU Xinglin, ZHENG Qilun, MA Qianli. Automatic summarization method based on thematic term set[J]. Application Research of Computers, 2011, 28(4):1322-1324.
[15] 李娜娜,刘培玉,刘文锋,等. 基于TextRank的自动摘要优化算法[J]. 计算机应用研究,2019,36(4):1045-1050. LI Nana, LIU Peiyu, LIU Wenfeng, et al. Automatic digest optimization algorithm based on TextRank[J]. Application Research of Computers, 2019, 36(4):1045-1050.
[16] 韩永峰,许旭阳,李弼程,等. 基于事件抽取的网络新闻多文档自动摘要[J]. 中文信息学报,2012,26(1):58-67. HAN Yongfeng, XU Xuyang, LI Bicheng, et al. Web news multi-document summarization based on event extraction[J]. Journal of Chinese Information Processing, 2012, 26(1):58-67.
[17] 王晓东. 计算机算法设计与分析[M]. 北京:电子工业出版社,2012:44-54. WANG Xiaodong. Computer algorithm design and analysis[M]. Beijing: Publishing House of Electronics Industry, 2012:44-54.
[18] Daniel S Hirschberg. Algorithms for the longest common subsequence problem[J]. Journal of the ACM(JACM), 1977, 24(4):664-675.
[19] BERGROTH L, HAKONEN H, RAITA T. A survey of longest common subsequence algorithms[C] //Proceedings Seventh International Symposium on String Processing and Information Retrieval. Curuna: IEEE, 2000: 39-48.
[20] BOUDIN F. A comparison of centrality measures for graph-based keyphrase extraction[C] //Proceedings of the Sixth International Joint Conference on Natural Language Processing. Nagoya: Asian Federation of Natural Language Processing, 2013: 834-838.
[21] 章志华,陆海良,郁钢. 基于TFIDF算法的关键词提取方法[J]. 信息技术与信息化,2015,188(8):164-166. ZHANG Zhihua, LU Hailiang, YU Gang. A keyword extracting technique based on TFIDF algorithm[J]. Information Technology and Informatization, 2015, 188(8):164-166.
[22] LIN C Y. ROUGE:a package for automatic evaluation of summaries[C] //Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics. Barcelona: ACL, 2004: 74-81.
[1] YANG Yang, WU Bao-wei, WANG Yue-e. Input-output finite time stability of asynchronous switched systems with event-triggered [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2020, 55(2): 118-126.
[2] WANG Xu-dong, SUN Yan, GONG Chun-mei. Structure of eventually C-L-weakly regular semigroups [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2020, 55(2): 63-67.
[3] FENG Na-na, WU Bao-wei. Input-output finite time stability for event-triggered control of switched singular systems [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2019, 54(3): 75-84.
[4] YE Xiao-ming, CHEN Xing-shu, YANG Li, WANG Wen-xian, ZHU Yi, SHAO Guo-lin, LIANG Gang. Anomaly detection model of host group based on graph-evolution events [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 1-11.
[5] LIN Li. News event extraction based on kernel dependency graph [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 121-126.
[6] LI Xi-peng, GUO Yan,ZHAO Ling, ZHANG Ru-qing, LIU Yue, YU Xiao-ming, CHENG Xue-qi. A news App popular comment prediction framework based on event detection [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(3): 91-97.
[7] DONG Ke, LÜ Wen-yuan, WANG Yong-zhi. Optimal preventive maintenance policy for second-hand equipment under lease considering residual value [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(12): 95-102.
[8] HE Xin-hua, HU Wen-fa, XIAO Min. Coordination optional contract mechanism of service supply chain for emergencies [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 81-90.
[9] LI Feng-huan, ZHENG De-quan, ZHAO Tie-jun. Temporal recognition for topic event based on shallow semantic parsing [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 74-80.
[10] XU Xia, LI Pei-feng, ZHENG Xin, ZHU Qiao-ming. Event inference for semi-supervised Chinese event extraction [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 12-17.
[11] LI Ling1, CHENG Guo-qing1, TANG Ying-hui2. Optimal inspection and replacement policy for a shock model with preventive repair [J]. J4, 2011, 46(9): 122-126.
[12] DING Ran, LI Qi-Qiang, LIANG Tao. Short-term scheduling formulation with decomposition structurefor multi-purpose batch plants [J]. J4, 2010, 45(1): 73-79.
[13] JIAO Tie-Ke. The eventually regularity on a class subsemirings of a semiring [J]. J4, 2009, 44(8): 56-57.
[14] LIU Ming-hui,ZHU Cheng-guang . The Monte Carlo study of W boson polarization [J]. J4, 2008, 43(5): 14-18 .
[15] LI Tong-xing,HAN Zhen-lai,ZHANG Meng,CAO Feng-juan . Oscillation of second order nonlinear neutral delay difference equations with continuous arguments [J]. J4, 2008, 43(2): 70-71 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!