Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion
Citations Over TimeTop 10% of 2022 papers
Abstract
aims to extract semantic relations among entity pairs in a document. Typical DocRE methods blindly take the full document as input, while a subset of the sentences in the document, noted as the evidence, are often sufficient for humans to predict the relation of an entity pair. In this paper, we propose an evidenceenhanced framework, EIDER, that empowers DocRE by efficiently extracting evidence and effectively fusing the extracted evidence in inference. 1 We first jointly train an RE model with a lightweight evidence extraction model, which is efficient in both memory and runtime. Empirically, even training the evidence model on silver labels constructed by our heuristic rules can lead to better RE performance. We further design a simple yet effective inference process that makes RE predictions on both extracted evidence and the full document, then fuses the predictions through a blending layer. This allows EIDER to focus on important sentences while still having access to the complete information in the document. Extensive experiments show that EIDER outperforms state-ofthe-art methods on three benchmark datasets (e.g., by 1.37/1.26 Ign F1/F1 on DocRED). Head:Hero of the Day Tail:the United States Rel:[country of origin] GT evidence sentences: [1,10] Extracted evidence: [1,10] Original document as input: [1] Load is the sixth studio album by the American heavy metal band Metallica, released on June 4, 1996 by Elektra Records in the United States [9] It was certified 5platinum for shipping five million copies in the United States. [10] Four singles-"Hero of the Day", "Until It Sleeps", "Mama Said", and "King Nothing" -were released as part of the marketing campaign for the album. Prediction scores: NA: 17.63 country of origin: 14.79 Extracted evidence as input: [1] Load is the sixth studio album released in the United States [10] Four singles -"Hero of the Day", were released for the album. Prediction scores: country of origin: 18.31 NA: 13.45 Final prediction of our model: country of origin ()
Related Papers
- → Review of entity relation extraction(2023)16 cited
- → Information Extraction in the Medical Domain(2015)10 cited
- → The Availability Heuristic and Inference to the Best Explanation(2019)5 cited
- Relation extraction from text documents(2011)
- Simple ontologies for practical information extraction and advanced information extraction for practical ontologies(2013)