Improving English-to-Indian Language Neural Machine Translation Systems
Citations Over TimeTop 10% of 2022 papers
Abstract
Most Indian languages lack sufficient parallel data for Machine Translation (MT) training. In this study, we build English-to-Indian language Neural Machine Translation (NMT) systems using the state-of-the-art transformer architecture. In addition, we investigate the utility of back-translation and its effect on system performance. Our experimental evaluation reveals that the back-translation method helps to improve the BLEU scores for both English-to-Hindi and English-to-Bengali NMT systems. We also observe that back-translation is more useful in improving the quality of weaker baseline MT systems. In addition, we perform a manual evaluation of the translation outputs and observe that the BLEU metric cannot always analyse the MT quality as well as humans. Our analysis shows that MT outputs for the English–Bengali pair are actually better than that evaluated by BLEU metric.
Related Papers
- → Design and Testing of Automatic Machine Translation System Based on Chinese-English Phrase Translation(2021)7 cited
- Statistical Machine Translation System(2009)
- A Hybrid Approach to Example based Machine Translation for Indian Languages(2007)
- → English-Dogri Translation System using MOSES(2016)3 cited
- → English to Kurdish Rule-based Machine Translation System(2018)4 cited