JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2016, Vol. 51 ›› Issue (11): 26-32.doi: 10.6040/j.issn.1671-9352.1.2015.E14

Previous Articles     Next Articles

Algorithm of knowledge base cumulative citation recommendation based on semantic features expansion

XU Ye, XU Wei-ran   

  1. School of Information and Communication and Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
  • Received:2015-09-18 Online:2016-11-20 Published:2016-11-22

Abstract: The task of knowledge base cumulative citation recommendation was mainly decomposed into three basic key problems: query expansion based on an entity name in knowledge base, feature extraction for documents and entities.We proposed a method that using the combination of the semantic dictionary(DBpedia)and the word vector(word embedding)for query expansion, and using LDA and ESA algorithms for feature extraction. Finally classify documents based on linear Logistic Regresion combined with unlinear random forest. The F1 value of this system operated on TREC KBA2014 promoted 14.7% compared to the baseline, which indicated that the method raised by the study is good at dealing with question of citation recommendation.

Key words: query expansion, feature extraction, knowledge base, classification

CLC Number: 

  • TP391
[1] ALLAN J. Topic detection and tracking: event-based information organization [M]. Norwell: Kluwer Academic Publishers, 2002:194-218.
[2] 史存会, 林鸿飞. 追踪事件微博报道:一种流的动态话题模型[J]. 山东大学学报(理学版), 2012, 47(5):78-79. SHI Cunhui, LIN Hongfei. Tracking event microblogs: a streaming dynamic topic model[J]. Journal of Shandong University(Natural Science), 2012, 47(5):78-79.
[3] HANANI U, SHAPIRA B, SHOVAL P. Information filtering: overview of issues, research and systems [J]. User Modeling and User-Adapted Interaction, 2001, 11(3):203-259.
[4] BODNER R C, SONG F. Knowledge-based approaches to query expansion in information retrieval[J]. Lecture Notes in Computer Science, 1996, 1081:146-158.
[5] 王瑞琴, 孔繁胜. 基于无导词义消歧的语义查询扩展[J]. 情报学报, 2011, 30(2):131-137. WANG Ruiqin, KONG Fansheng. Semantic query expansion based on unsupervised word sense disambiguation[J]. Journal of the China Society for Scientific and Technical Information, 2011, 30(2):131-137.
[6] 杨清琳, 李陶深, 农健. 基于领域本体知识库的语义查询扩展[J]. 计算机工程与设计, 2011, 32(11):3853-3856. YANG Qinglin, LI Taoshen, NONG Jian. Semantic query expansion based on domain ontology knowledge base[J]. Computer Engineering and Design, 2011, 32(11):3853-3856.
[7] 付剑锋, 刘宗田, 刘念祖. 基于多知识库和局部反馈的查询扩展研究[J]. 情报杂志, 2013,32(2):103-106. FU Jianfeng, LIU Zongtian, LIU Nianzu.Research on query expansion based on multi-knowledge base and local feedback[J].Journal of Intelligence, 2013, 32(2):103-106.
[8] 毛琪, 黄永峰. 基于网络知识库与通用搜索引擎的查询词扩展方法[J]. 计算机应用, 2012,32(S2):5-9. MAO Qi, HUANG Yongfeng. Query expansion based on Web knowledge base and search engine[J]. Journal of Computer Applications, 2012, 32(S2):5-9.
[9] 李卫疆, 赵铁军, 王宪刚. 基于上下文的查询扩展[J]. 计算机研究与发展, 2010, 47(2):300-304. LI Weijiang, ZHAO Tiejun, WANG Xiangang. Context-sensitive query expansion[J]. Journal of Computer Research and Development, 2010, 47(2):300-304.
[10] 邹扬. WAF改进算法在基于语义分析的查询扩展上的应用[D]. 北京:北京邮电大学, 2012. ZOU Yang. Topic detection and tracking based on semantic framework [D].Beijing: Beijing University of Posts and Telecommunications, 2012.
[11] 于东, 荀恩东. 基于Word Embedding语义相似度的字母缩略术语消歧[J]. 中文信息学报, 2014, 28(5):51-59. YU Dong, XUN Endong. Acronym term disambiguation based on semantic similarity calculated by word embedding[J].Journal of Chinese Information Processing, 2014, 28(5):51-59.
[12] 石松, 王明文, 涂伟,等. 基于Markov网络团的信息检索扩展模型[J]. 山东大学学报(理学版), 2011(5):54-57. SHI Song, WANG Mingwen, TU Wei, et al. Extended information retrieval model based on the Markov network cliques[J]. Journal of Shandong University(Natural Science), 2011(5):54-57.
[13] WANG J, SONG D, LIN C Y, et al. Bit and MSRA at TREC KBA CCR track 2013[C/OL]. Proceedings of the 22nd Text Retrieval Conference.[2015-03-02]. http://trec.nist.gov/pubs/trec22/papers/BIT-MSRA-kba.pdf.
[14] KJERSTEN B, MCNAMEE P. The HLTCOE approach to the TREC 2012 KBA track[C/OL]. Proceedings of the 22nd Text Retrieval Conference.[2015-03-02]. http://trec.nist.gov/pubs/trec21/papers/hltcoe.kba.final.pdf
[15] BALOG K, RAMAMPIARO H. Cumulative citation recommendation: classification vs. ranking[C] //Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval. New York: ACM, 2013:941-944.
[16] GUO J. An activation force-based affinity measure for analyzing complex networks[J]. Scientific Reports, 2011, 1(10):1-9.
[17] MIKOLOV T, SUTSKEVER I, CHEN K, et al. Distributed representations of words and phrases and their compositionality [J]. Advances in Neural Information Processing Systems, 2013, 26:3111-3119.
[18] BENGIO Y, SCHWENK H, SENÉCAL J S, et al. A neural probabilistic language model [J]. Journal of Machine Learning Research, 2003, 3(6):1137-1155.
[19] BLEI D M, NG A Y, JORDAN M I. Latent dirichlet allocation [J]. Journal of Machine Learning Research, 2003, 3:993-1022.
[20] GABRILOVICH E, MARKOVITCH S. Wikipedia-based semantic interpretation for natural language processing [J]. Journal of Artificial Intelligence Research, 2009, 34(4):443-498.
[1] . Reader emotion classification with news and comments [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 35-39.
[2] ZUO Zhi-cui, ZHANG Xian-yong, MO Zhi-wen, FENG Lin. Block discernibility matrix based on decision classification and its algorithm finding the core [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(8): 25-33.
[3] CHEN Xin, XUE Yun, LU Xin, LI Wan-li, ZHAO Hong-ya, HU Xiao-hui. Text feature extraction method for sentiment analysis based on order-preserving submatrix and frequent sequential pattern mining [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 36-45.
[4] LI Hui-hui, LIU Xi-qiang, XIN Xiang-peng. Differential invariants and exact solutions of variable coefficients Benjamin-Bona-Mahony-Burgers equation [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(10): 51-60.
[5] SUN Jian-dong, GU Xiu-sen, LI Yan, XU Wei-ran. Chinese entity relation extraction algorithms based on COAE2016 datasets [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 7-12.
[6] YANG Yan, XU Bing, YANG Mu-yun, ZHAO Jing-jing. An emotional classification method based on joint deep learning model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 19-25.
[7] ZHANG Peng, WANG Su-ge, LI De-yu, WANG Jie. A semi-supervised spam review classification method based on heuristic rules [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(7): 44-51.
[8] SHI Han-xiao, LI Xiao-jun, HAO Teng-da, LIU Hong, ZHU Liu-qing. Emotion analysis on Microblog short text [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(7): 80-90.
[9] DU Man, XU Xue-ke, DU Hui, WU Da-yong, LIU Yue, CHENG Xue-qi. Emotion-specific word embedding learning for emotion classification [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(7): 52-58.
[10] TANG Ming-wei, SU Xin-ning, JIANG Xun. The RESTful web services and knowledge base collaborative driven real-time tracking of emergency network opinion [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(6): 49-55.
[11] QIAO Hu-sheng, BAI Yong-fa. Characterization of monoids by inverse S-acts [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(2): 1-4.
[12] LUO Yong-gui. Maximal(regular)subsemigroups of the semigroup W(n,r) [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(10): 7-11.
[13] TANG Liang, ZHAO Xiao-feng, XI Yao-yi, YI Mian-zhu. The method of query expansion based on local co-occurrence and context similarity [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(1): 29-36.
[14] WAN Zhong-ying, WANG Ming-wen, ZUO Jia-li, WAN Jian-yi. Feature selection combined with the global and local information(GLFS) [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(5): 87-93.
[15] MA Fei-xiang, LIAO Xiang-wen, YU Zhi-yong, WU Yun-bing, CHEN Guo-long. A text opinion retrieval method based on knowledge graph [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(11): 33-40.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!