0 citations0 references

HybridQA: A Dataset of Multi-Hop Question Answering over Tabular and Textual Data

2020pp. 1026–1036

Citations Over TimeTop 1% of 2020 papers

Wenhu Chen, Hanwen Zha, Zhiyu Chen, Wenhan Xiong, Hong Wang, William Yang Wang

Abstract

Existing question answering datasets focus on dealing with homogeneous information, based either only on text or KB/Table information alone. However, as human knowledge is distributed over heterogeneous forms, using homogeneous information alone might lead to severe coverage problems. To fill in the gap, we present HybridQA 1 , a new large-scale question-answering dataset that requires reasoning on heterogeneous information. Each question is aligned with a Wikipedia table and multiple free-form corpora linked with the entities in the table. The questions are designed to aggregate both tabular information and text information, i.e., lack of either form would render the question unanswerable. We test with three different models: 1) a table-only model.

Related Papers

Overview of Question-Answering(2002)
A Survey on Question and Answering Systems(2012)
Theoretical Analysis of the Benchmark for Choosing Manipulative Instruments of Monetary Policies(2009)
→ Exploring disk performance benchmarks(2017)
→ Support Structure Performance Benchmark(2023)