0 citations0 references

An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering

2019pp. 220–227

Citations Over TimeTop 10% of 2019 papers

Shayne Longpre, Yi Lu, Zhucheng Tu, Chris DuBois

Abstract

To produce a domain-agnostic question answering model for the Machine Reading Question Answering (MRQA) 2019 Shared Task, we investigate the relative benefits of large pretrained language models, various data sampling strategies, as well as query and context paraphrases generated by back-translation. We find a simple negative sampling technique to be particularly effective, even though it is typically used for datasets that include unanswerable questions, such as SQuAD 2.0. When applied in conjunction with per-domain sampling, our XLNet (Yang et al., 2019)-based submission achieved the second best Exact Match and F1 in the MRQA leaderboard competition.

Related Papers

Students’ Problems in Learning Conjunction(2016)
Mengajar kata hubung(2006)
THE USE OF CONJUNCTIONS IN KOMPAS SELECTED SHORT STORY BY SENO GUMIRA AJIDARMA(2020)