JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE) ›› 2024, Vol. 59 ›› Issue (7): 53-63.doi: 10.6040/j.issn.1671-9352.1.2023.080

• Review • Previous Articles     Next Articles

A document-level event extraction method based on core arguments

Chengjie SUN(),Zongwei LI,Lili SHAN,Lei LIN   

  1. Faculty of Computing, Harbin Institute of Technology, Harbin 150001, Heilongjiang, China
  • Received:2023-10-18 Online:2024-07-20 Published:2024-07-15

Abstract:

A document-level event extraction method based on core arguments(CA-DocEE) is proposed, which defines criteria for selecting core arguments based on their distributions in document-level events, uses heterogeneous graph convolutional neural networks to augment document contextual information for encoding argument entities, and captures deep semantic information in sentences based on machine reading comprehension methods for classifying the role of arguments. On the document-level event extraction dataset, the method proposed in this paper achieves a micro-average F1 value of 80.1%, which is comparable with the state-of-the-art methods.

Key words: event extraction, document-level event extraction, machine reading comprehension, graph convolutional neural network

CLC Number: 

  • TP391.1

Fig.1

Architecture of discourse level event extraction model based on argument relationship graph"

Fig.2

Example of argument relationship diagram"

Fig.3

Diagram of argument classification model"

Table 1

Comparison of the results of argument mention 单位: %"

论元提及识别模型 micro-P micro-R micro-F1
BiLSTM+CRF 88.0 82.9 85.4
BERT+MCRF 91.0 91.2 91.1

Table 2

Comparison of overall experimental results 单位: %"

模型 S M All
DCFEE-O 72.4 52.4 63.2
GIT 86.8 72.3 79.9
PTPCG 88.2 69.1 79.4
PTPCG复现结果 86.2 68.6 78.5
CA-DocEE 88.0 70.3 80.1

Table 3

Results of ablation experiment 单位: %"

模型 micro-P micro-R micro-F1
使用Zhu[1]的伪触发词作为核心论元 82.7 76.8 79.6
-MRC 83.2 76.9 79.9
CA-DocEE 83.7 77.4 80.1
1 ZHU Tong, QU Xiaoye, CHEN Wenliang, et al. Efficient document-level event extraction via pseudo-trigger-aware pruned complete graph[EB/OL]. (2021-12-11)[2023-10-18]. http://arxiv.org/abs/2112.06013.
2 RILOFF E. Automatically constructing a dictionary for information extraction tasks[C]//AAAI'93: Proceedings of the Eleventh National Conference on Artificial Intelligence. Washington: AAAI Press, 1993: 811-816.
3 AHN D. The stages of event extraction[C]// Proceedings of the Workshop on Annotating and Reasoning about Time and Events. Stroudsburg: Association for Computational Linguistics, 2006: 1-8.
4 CHEN Zheng, JI Heng. Language specific issue and feature exploration in Chinese event extraction[C]//Proceedings of Human Language Technologies: the 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers on-NAACL'09. Morristown: Association for Computational Linguistics, 2009: 209-212.
5 LIAO S S, GRISHMAN R. Using document level cross-event inference to improve event extraction[C]//Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics. Uppsala: ACM, 2010: 789-797.
6 JI H, GRISHMAN R. Refining event extraction through cross-document inference[C]//Proceedings of ACL-08: Hlt. Columbus: Association for Computational Linguistics, 2008: 254-262.
7 CHEN Yubo, XU Liheng, LIU Kang, et al. Event extraction via dynamic multi-pooling convolutional neural networks[C]//Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing. Beijing: Association for Computational Linguistics, 2015: 167-176.
8 YANG Sen, FENG Dawei, QIAO Linbo, et al. Exploring pre-trained language models for event extraction and generation[C]//Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Florence: Association for Computational Linguistics, 2019: 5284-5294.
9 SHA L, LIU J, LIN C Y, et al. RBPB: regularization-based pattern balancing method for event extraction[C]//Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. Berlin: Association for Computational Linguistics, 2016: 1224-1234.
10 DU X Y, CLAIRE C. Event extraction by answering (almost) natural questions[EB/OL]. (2020-04-28)[2023-10-18]. http://arxiv.org/abs/2004.13625.
11 CHEN C, NG V. Joint modeling for Chinese event extraction with rich linguistic features[C]// Proceedings of COLING. Mumbai: The COLING 2012 Organizing Committee, 2012: 529-544.
12 LI Qi, JI Heng, HUANG Liang, et al. Joint event extraction via structured prediction with global features[C]// Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics. Bulgaria: Association for Computational Linguistics, 2013: 73-82.
13 NGUYEN T H, CHO K, GRISHMAN R. Joint event extraction via recurrent neural networks[C]//Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. San Diego: Association for Computational Linguistics, 2016: 300-309.
14 CHEN Yubo, LIU Shulin, ZHANG Xiang, et al. Automatically labeled data generation for large scale event extraction[C]// Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics. Vancouver: Association for Computational Linguistics, 2017: 409-419.
15 LI Qian , PENG Hao , LI Jianxin , et al. Reinforcement learning-based dialogue guided event extraction to exploit argument relations[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, 30, 520- 533.
doi: 10.1109/TASLP.2021.3138670
16 YANG Hang, CHEN Yubo, LIU Kang, et al. DCFEE: a document-level Chinese financial event extraction system based on automatically labeled training data[C]// Proceedings of ACL 2018, System Demonstrations. Melbourne: Association for Computational Linguistics, 2018: 50-55.
17 WANG Haitao , ZHU Tong , WANG Mingtao , et al. A prior information enhanced extraction framework for document-level financial event extraction[J]. Data Intelligence, 2021, 3 (3): 460- 476.
doi: 10.1162/dint_a_00103
18 ZHANG Hongkuan, SONG Hui, WANG Shuyi, et al. A bert-based end-to-end model for Chinese document-level event extraction[C]// Proceedings of the 19th Chinese National Conference on Computational Linguistics. Haikou: Chinese Information Processing Society of China, 2020: 390-401.
19 YANG Hang, SUI Dianbo, CHEN Yubo, et al. Document-level event extraction via parallel prediction networks[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2021: 6298-6308.
20 WANG Peng , DENG Zhenkai , CUI Ruilong . TDJEE: a document-level joint model for financial event extraction[J]. Electronics, 2021, 10 (7): 824.
doi: 10.3390/electronics10070824
21 LIU Jian , LIANG Chen , XU Jinan . Document-level event argument extraction with self-augmentation and a cross-domain joint training mechanism[J]. Knowledge-based Systems, 2022, 257, 109904.
doi: 10.1016/j.knosys.2022.109904
22 ZHENG Shun, CAO Wei, XU Wei, et al. Doc2EDAG: an end-to-end document-level framework for Chinese financial event extraction[C]//Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP). Hong Kong: Association for Computational Linguistics, 2019: 337-346.
23 XU Runxin, LIU Tianyu, LI Lei, et al. Document-level event extraction via heterogeneous graph-based interaction model with a tracker[C]//Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing. Stroudsburg: Association for Computational Linguistics, 2021: 3533-3546.
24 WEI Tianwen, QI Jianwei, HE Shenghuan, et al. Masked conditional random fields for sequence labeling[C]//Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. Stroudsburg: Association for Computational Linguistics, 2021: 2024-2035.
25 ZENG Shuang, XU Runxin, CHANG Baobao, et al. Double graph based reasoning for document-level relation extraction[C]// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Stroudsburg: Association for Computational Linguistics, 2020: 1630-1640.
26 WANG Difeng, HU Wei, CAO Ermei, et al. Global-to-local neural networks for document-level relation extraction[C]// Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP). Stroudsburg: Association for Computational Linguistics, 2020: 3711-3721.
[1] BAO Liang, CHEN Zhi-hao, CHEN Wen-zhang, YE Kai, LIAO Xiang-wen. Dual co-matching network with multiway attention for opinion reading comprehension [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(3): 44-53.
[2] YIN Ai-ying, LIN Jian-zhou, WU Yun-bing, LIAO Xiang-wen. Sentiment classification combining graph convolution neural network [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2021, 56(11): 15-23.
[3] LIN Li. News event extraction based on kernel dependency graph [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2016, 51(9): 121-126.
[4] LI Feng-huan, ZHENG De-quan, ZHAO Tie-jun. Temporal recognition for topic event based on shallow semantic parsing [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2015, 50(11): 74-80.
[5] XU Xia, LI Pei-feng, ZHENG Xin, ZHU Qiao-ming. Event inference for semi-supervised Chinese event extraction [J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(12): 12-17.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LI Hui-juan,YIN Hai-ying,ZHANG Xue-cheng and YANG Ai-fang* . Enhancement of drought resistance in transgenic tobacco expressing sucrose: Sucrose 1-fructosyltransferase gene from Lactuca sativa[J]. J4, 2007, 42(1): 89 -94 .
[2] HU Chuan-Ke, CHEN Ru-Hui, DIAO E-Ou. Prediction of proteinprotein interaction based on improved pseudo amino acid composition[J]. J4, 2009, 44(9): 17 -21 .
[3] WANG Hong-mei,XIAO Min*,LI Zheng-yi,LI Yu-mei,QIAN Xin-min, . Screening and identification of βgalactosidaseproducing microorganism and enzymatic synthesis of galactooligosaccharides using its transgalactosylation[J]. J4, 2006, 41(1): 133 -139 .
[4] ZHANG Jia-li, MIAO Lian-ying, SONG Wen-yao. Edge colorings of 1-planar graphs for maximum degree eight #br# without adjacent 4-cycles[J]. JOURNAL OF SHANDONG UNIVERSITY(NATURAL SCIENCE), 2014, 49(04): 18 -23 .
[5] SUN Xiao-ting1, JIN Lan2*. Application of DOSY in oligosaccharide mixture analysis[J]. J4, 2013, 48(1): 43 -45 .
[6] REN Min1,2, ZHANG Guang-hui1. Absorbing probabilities of random walks in an independent random  environment convergence in distribution on the half-line[J]. J4, 2013, 48(1): 93 -99 .
[7] XU Jun-feng. On the growth of the meromorphic solutions of complex algebraic differential equations[J]. J4, 2010, 45(6): 91 -93 .
[8] ZHANG Li,XU Yu-ming . [J]. J4, 2006, 41(5): 30 -32 .
[9] WANG Gang, XU Xin-shun*. A new Multi-instance learning method for scene classification[J]. J4, 2010, 45(7): 108 -113 .
[10] LU Wei-jie,ZHU Chen-fu,SONG Cui and YANG Yan-li . Determination of inorganic cations in the Chinese traditional drug Yujin by capillary electrophoresis[J]. J4, 2007, 42(7): 13 -18 .