Dhruv Batra
Georgia Institute of Technology(US)Menlo School(US)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning, Advanced Image and Video Retrieval Techniques, Human Pose and Action Recognition, Topic Modeling
Most-Cited Works
- → Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization(2017)20,436 cited
- → VQA: Visual Question Answering(2015)4,206 cited
- → Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering(2017)2,114 cited
- → ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks(2019)1,673 cited
- → Hierarchical Question-Image Co-Attention for Visual Question Answering(2016)1,216 cited
- → Graph R-CNN for Scene Graph Generation(2018)879 cited
- → Joint Unsupervised Learning of Deep Representations and Image Clusters(2016)782 cited
- → A Corpus and Cloze Evaluation for Deeper Understanding of Commonsense Stories(2016)613 cited