0 citations
Bianet: A Parallel News Corpus in Turkish, Kurdish and English
arXiv (Cornell University)2018
Citations Over Time
Abstract
We present a new open-source parallel corpus consisting of news articles collected from the Bianet magazine, an online newspaper that publishes Turkish news, often along with their translations in English and Kurdish. In this paper, we describe the collection process of the corpus and its statistical properties. We validate the benefit of using the Bianet corpus by evaluating bilingual and multilingual neural machine translation models in English-Turkish and English-Kurdish directions.
Related Papers
- Improving Low-Resource Neural Machine Translation with Filtered Pseudo-Parallel Corpus(2017)
- → Parallel Corpora Preparation for English-Amharic Machine Translation(2021)9 cited
- Dutch Parallel Corpus: a multifunctional and multilingual corpus(2006)
- → … and never the twain shall meet?(2002)10 cited
- → Central Kurdish machine translation: First large scale parallel corpus and experiments(2021)2 cited