Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning
2023pp. 11844–11857
Citations Over TimeTop 10% of 2023 papers
Zhuolin Yang, Ping Wei, Zihan Liu, Vijay Anand Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Mohammad Shoeybi, Ming-Yu Liu, Yuke Zhu, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar
Abstract
Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Mohammad Shoeybi, Ming-Yu Liu, Yuke Zhu, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar. Findings of the Association for Computational Linguistics: EMNLP 2023. 2023.
Related Papers
- → OSCAR and ActivityNet: an Image Captioning model can effectively learn a Video Captioning dataset(2021)1 cited
- → Video Captioning via Hierarchical Reinforcement Learning(2017)22 cited
- → Boosted Attention: Leveraging Human Attention for Image Captioning(2019)1 cited
- → Image Captioning Methodologies Using Deep Learning: A Review(2020)
- → Image Captioning using Neural Networks(2022)