NAPS: Natural Program Synthesis Dataset
arXiv (Cornell University)2018
Citations Over Time
Abstract
We present a program synthesis-oriented dataset consisting of human written problem statements and solutions for these problems. The problem statements were collected via crowdsourcing and the program solutions were extracted from human-written solutions in programming competitions, accompanied by input/output examples. We propose using this dataset for the program synthesis tasks aimed for working with real user-generated data. As a baseline we present few models, with the best model achieving 8.8% accuracy, showcasing both the complexity of the dataset and large room for future research.
Related Papers
- A Search Task Dataset for German Textual Entailment(2013)
- → Statistical Machine Transliteration Baselines for NEWS 2018(2018)7 cited
- → A Survey of Recent Abstract Summarization Techniques(2021)7 cited
- → GEM: A General Evaluation Benchmark for Multimodal Tasks(2021)7 cited
- → A Large-Scale Chinese Short-Text Conversation Dataset(2020)6 cited