GEM: A General Evaluation Benchmark for Multimodal Tasks
2021
Citations Over Time
Lin Su, Nan Duan, Edward Cui, Lei Ji, Chenfei Wu, Huaishao Luo, Yongfei Liu, Ming Zhong, Taroon Bharti, Arun Sacheti
Abstract
In this paper, we present GEM 1 as a General Evaluation benchmark for Multimodal tasks. Different from existing datasets such as GLUE Comparing with existing multimodal datasets such as MSCOCO (Chen et al., 2015) and Flicker30K We also provide two baseline models for this benchmark. We will release the dataset, code and baseline models, aiming to advance the development of multilingual multimodal research.
Related Papers
- Method for Improvement of Baseline Resolving Quality in GPS Measurement(2007)
- Analysis of the Baseline Decorrelation and Critical Baseline of Interferometric SAR(2003)
- Theoretical Analysis of the Benchmark for Choosing Manipulative Instruments of Monetary Policies(2009)
- Research of Baseline Implement(2010)
- Study and application of telecom operators system security state baseline(2012)