Rapid development of Hindi named entity recognition using conditional random fields and feature induction
ACM Transactions on Asian Language Information Processing2003Vol. 2(3), pp. 290–294
Citations Over TimeTop 11% of 2003 papers
Abstract
This paper describes our application of conditional random fields with feature induction to a Hindi named entity recognition task. With only five days development time and little knowledge of this language, we automatically discover relevant features by providing a large array of lexical tests and using feature induction to automatically construct the features that most increase conditional likelihood. In an effort to reduce overfitting, we use a combination of a Gaussian prior and early stopping based on the results of 10-fold cross validation.
Related Papers
- → Negation-based transfer learning for improving biomedical Named Entity Recognition and Relation Extraction(2023)26 cited
- → Named entity recognition in Assamese using CRFS and rules(2014)12 cited
- → Linguistic Features for Named Entity Recognition Using CRFs(2010)5 cited
- → Named Entity Recognition in Hindi Using Conditional Random Fields(2016)3 cited
- → Research on Chinese Named Entity Recognition Based on Ontology(2012)