Reducing class imbalance during active learning for named entity annotation
Citations Over TimeTop 10% of 2009 papers
Abstract
In lots of natural language processing tasks, the classes to be dealt with often occur heavily imbalanced in the underlying data set and classifiers trained on such skewed data tend to exhibit poor performance for low-frequency classes. We introduce and compare different approaches to reduce class imbalance by design within the context of active learning (AL). Our goal is to compile more balanced data sets up front during annotation time when AL is used as a strategy to acquire training material. We situate our approach in the context of named entity recognition. Our experiments reveal that we can indeed reduce class imbalance and increase the performance of classifiers on minority classes while preserving a good overall performance in terms of macro F-score.
Related Papers
- → WordPerfect 5.0 Macro Capabilities and an Accessions List Macro(1988)
- 다중 사용자 환경에서 Annotation 인터페이스의 설계 및 구현(2002)
- Social Filtering 환경에서 사용자 관심사를 고려한 Annotation 디스플레이 설계 및 구현(2002)
- On the Important Content Characters about Annotation of Xiaojing by Tang Xuan_zong(2005)
- Annotation of Li Shan WenXuan——One Annotation Phenomenon Which is Poles Apart with China Classics Annotation(2006)