JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2017, Vol. 52 ›› Issue (9): 7-12.doi: 10.6040/j.issn.1671-9352.1.2016.PC7

Previous Articles     Next Articles

Chinese entity relation extraction algorithms based on COAE2016 datasets

SUN Jian-dong, GU Xiu-sen, LI Yan, XU Wei-ran*   

  1. Beijing University of Posts and Telecommunications, Lab of Pattern Recognition and Intelligent System, Beijing 100876, China
  • Received:2016-11-25 Online:2017-09-20 Published:2017-09-15

Abstract: Entity relation extraction is one of the important procedures of knowledge graph technology. Research on entity relation extraction in English is comparatively developed. By contrast, the development of Chinese entity relation extraction is not ideal, and it is mainly because the lack of corpus. In order to solve this problem, COAE2016 proposes a Chinese entity relation extraction task in task 3. In this paper, we use three algorithms to solve the problem: a pattern based algorithm, a SVM based algorithm and a CNN based algorithm respectively. Then, we analyze the advantages and the disadvantages of the three algorithms according to the effects of the dataset in COAE2016 Experiments show that the SVM based algorithm and the CNN based algorithm are useful to extract entity relation.

Key words: feature extraction, SVM, CNN, pattern match

CLC Number: 

  • TP391
[1] 徐健,张智雄,吴振新. 实体关系抽取的技术方法综述[J]. 现代图书情报技术, 2008(8): 18-23. XU Jian, ZHANG Zhixiong, WU Zhenxin. Review on techniques of entity relation extraction [J]. New Technology of Library and Information Service, 2008(8):18-23.
[2] 毛小丽, 何中市, 邢欣来, 等. 基于特征选择的实体关系抽取[J]. 计算机应用研究, 2012, 29(2):530-532. MAO Xiaoli, HE Zhongshi, XING Xinlai, et al. Entity relation extraction based on feature selection[J]. Application Research of Computers, 2012, 29(2):530-532.
[3] 车万翔, 刘挺, 李生. 实体关系自动抽取[J]. 中文信息学报, 2004, 19(2): 1-6. CHE Wanxiang, LIU Ting, LI Sheng. Automatic entity relation extraction [J]. Journal of Chinese Information Processing, 2004, 19(2):1-6.
[4] 刘建舟, 邵雄凯. 一种改进的中文实体关系抽取方法[J]. 软件导刊,2011,10(4):27-29. LIU Jianzhou, SHAO Xiongkai. An improved method of chinese entity relation extraction [J]. Software Guide, 2011, 10(4):27-29.
[5] 张素香, 文娟, 秦颖, 等. 实体关系的自动抽取研究[J]. 哈尔滨工程大学学报, 2006, 27(S1):370-373. ZHANG Suxiang, WEN Juan, QIN Ying, et al. Study about automatic entity relation extraction [J]. Journal of Harbin Engineering University, 2006, 27(S1):370-373.
[6] LECUN Yann, BENGIO Yoshua, HINTON Geoffrey. Deep learning[J]. Nature.2015, 521(7553): 436-444.
[7] KRIZHEVSKY Alex SUTSKEVER Ilya, HINTON Geoffrey. ImageNet classification with deep convolutionalneural networks[J]. International Conference on Neural Information Processing Systems, 2012, 25(2):1097-1105.
[8] ZHANG Shiliang, LIU Cong, JIANG Hui, et al. Feedforward sequential memory networks:a new structure to learn long-term dependency [J]. Computer Science, 2015, arXiv:1510.02693.
[9] BENGIO Y, DUCHARME R, VINCENT P, et al. A neural probabilistic language model[J]. Journal of Machine Learning Research, 2014(3):1137-1155.
[10] HASHIMOTO K, STENETORP P, MIWA M, et al. Task-oriented learning of word embeddings for semantic relation classification[J]. Computer Science,2015, arXiv: 1503. 00095.
[11] HENDRICKX I, KIM S N, KOZAREVA Z, et al. Semeval-2010 task 8: multi-way classification of semantic relations between pairs of nominal[C] //Proceedings of the NAACL HLT Workshop on Semantic Evaluations: Recent Achievements and Future Directions Boulder: Association for Computational Linguistics Stroudsburg, PA, USA, 2009: 94-99.
[12] MIKOLOV Tomas, SUTSKEVER Ilya, CHEN Kai, et al. Distributed representations of words and pharses and their coposi-tionality[J].Computer Science, 2013, arXiv:1310.4546.
[1] GONG Shuang-shuang, CHEN Yu-feng, XU Jin-an, ZHANG Yu-jie. Extraction of Chinese multiword expressions based on Web text [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(9): 40-48.
[2] CHEN Xin, XUE Yun, LU Xin, LI Wan-li, ZHAO Hong-ya, HU Xiao-hui. Text feature extraction method for sentiment analysis based on order-preserving submatrix and frequent sequential pattern mining [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2018, 53(3): 36-45.
[3] YANG Yan, XU Bing, YANG Mu-yun, ZHAO Jing-jing. An emotional classification method based on joint deep learning model [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(9): 19-25.
[4] SHI Han-xiao, LI Xiao-jun, HAO Teng-da, LIU Hong, ZHU Liu-qing. Emotion analysis on Microblog short text [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2017, 52(7): 80-90.
[5] PENG Qiu-fang, LIU Yang. Research of gender prediciton based on SVM with E-commerce data [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(7): 74-80.
[6] XU Ye, XU Wei-ran. Algorithm of knowledge base cumulative citation recommendation based on semantic features expansion [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(11): 26-32.
[7] WANG Hui, CHEN Guang. Feature extraction method based on Bootstrapping in English product comment [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 23-29.
[8] LIU Biao1,2, CHEN Chun-ping3, FENG Hua-min1,3, LI Yang3. A SVM parameters selection algorithm based on Fisher criterion [J]. J4, 2012, 47(7): 50-54.
[9] XU Guang-zhu1, LIU Ming2, REN Dong1, MA Yi-de3, LIU Xiao-li1. Multi-region image segmentation based on pulse coupled neural network [J]. J4, 2010, 45(7): 86-93.
[10] XIAN Jian,MO Xuan-lang and XI Jiang-qing . A question answering system based on question pattern match [J]. J4, 2006, 41(3): 100-103 .
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!