《山东大学学报(理学版)》 ›› 2024, Vol. 59 ›› Issue (7): 53-63.doi: 10.6040/j.issn.1671-9352.1.2023.080

  1. 哈尔滨工业大学计算学部,黑龙江 哈尔滨 150001
  • 收稿日期:2023-10-18 出版日期:2024-07-20 发布日期:2024-07-15
  • 作者简介:孙承杰(1980—),男,副教授,博士,研究方向为自然语言处理、信息抽取、对话系统. E-mail: sunchengjie@hit.edu.cn
  • 基金资助:

A document-level event extraction method based on core arguments

Chengjie SUN(),Zongwei LI,Lili SHAN,Lei LIN   

  1. Faculty of Computing, Harbin Institute of Technology, Harbin 150001, Heilongjiang, China
  • Received:2023-10-18 Online:2024-07-20 Published:2024-07-15


提出一种基于核心论元的篇章级事件抽取选取方法(core arguments-based document level event extraction, CA-DocEE),该方法根据论元在篇章级事件中的分布特点定义核心论元的选取标准,采用异质图卷积神经网络将篇章上下文信息用于增强论元实体编码,基于机器阅读理解方法捕捉句子中的深层次语义信息来进行论元角色分类。在篇章级事件抽取公开数据集上,本文提出的方法的微平均F1值达到了80.1%,取得了与目前已知最好方法相当的效果。

关键词: 事件抽取, 篇章级事件抽取, 机器阅读理解, 图卷积神经网络


A document-level event extraction method based on core arguments(CA-DocEE) is proposed, which defines criteria for selecting core arguments based on their distributions in document-level events, uses heterogeneous graph convolutional neural networks to augment document contextual information for encoding argument entities, and captures deep semantic information in sentences based on machine reading comprehension methods for classifying the role of arguments. On the document-level event extraction dataset, the method proposed in this paper achieves a micro-average F1 value of 80.1%, which is comparable with the state-of-the-art methods.

Key words: event extraction, document-level event extraction, machine reading comprehension, graph convolutional neural network


论元提及识别模型 micro-P micro-R micro-F1
BiLSTM+CRF 88.0 82.9 85.4
BERT+MCRF 91.0 91.2 91.1



模型 S M All
DCFEE-O 72.4 52.4 63.2
GIT 86.8 72.3 79.9
PTPCG 88.2 69.1 79.4
PTPCG复现结果 86.2 68.6 78.5
CA-DocEE 88.0 70.3 80.1



模型 micro-P micro-R micro-F1
使用Zhu[1]的伪触发词作为核心论元 82.7 76.8 79.6
-MRC 83.2 76.9 79.9
CA-DocEE 83.7 77.4 80.1
