Reuben Tan
Microsoft Research (United Kingdom)(GB)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Human Pose and Action Recognition, Domain Adaptation and Few-Shot Learning, Video Analysis and Summarization, Advanced Image and Video Retrieval Techniques
Most-Cited Works
- → Learning Similarity Conditions Without Explicit Supervision(2019)82 cited
- → LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval(2021)75 cited
- wMAN: Weakly-supervised Moment Alignment Network for Text-based Video Segment Retrieval.(2019)
- → Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos(2021)13 cited
- → Magma: A Foundation Model for Multimodal AI Agents(2025)12 cited
- → Koala: Key Frame-Conditioned Long Video-LLM(2024)10 cited
- → Language-Guided Audio-Visual Source Separation via Trimodal Consistency(2023)10 cited
- → Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News(2020)9 cited
- → Language Features Matter: Effective Language Representations for Vision-Language Tasks(2019)9 cited