HIT: Nested Named Entity Recognition via Head-Tail Pair and Token Interaction
Citations Over TimeTop 10% of 2020 papers
Abstract
Named Entity Recognition (NER) is a fundamental task in natural language processing. In order to identify entities with nested structure, many sophisticated methods have been recently developed based on either the traditional sequence labeling approaches or directed hypergraph structures. Despite being successful, these methods often fall short in striking a good balance between the expression power for nested structure and the model complexity. To address this issue, we present a novel nested NER model named HIT. Our proposed HIT model leverages two key properties pertaining to the (nested) named entity, including (1) explicit boundary tokens and (2) tight internal connection between tokens within the boundary. Specifically, we design (1) Head-Tail Detector based on the multi-head selfattention mechanism and bi-affine classifier to detect boundary tokens, and (2) Token Interaction Tagger based on traditional sequence labeling approaches to characterize the internal token connection within the boundary. Experiments on three public NER datasets demonstrate that the proposed HIT achieves state-ofthe-art performance.
Related Papers
- → Beyond Tokens: Fair Evaluation of French Large Language Models for Clinical Named Entity Recognition(2024)1 cited
- NERU: Named Entity Recognition for German(2014)
- → Improving named entity recognition and disambiguation in news headlines(2019)1 cited
- → Analysis of named entity recognition & entity linking in historical text(2016)1 cited