Ronghang Hu
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning, Topic Modeling, Natural Language Processing Techniques, Advanced Image and Video Retrieval Techniques
Most-Cited Works
- → ConvNeXt V2: Co-designing and Scaling ConvNets with Masked Autoencoders(2023)1,179 cited
- → Natural Language Object Retrieval(2016)574 cited
- → Learning to Reason: End-to-End Module Networks for Visual Question Answering(2017)497 cited
- → FLAVA: A Foundational Language And Vision Alignment Model(2022)470 cited
- → Segmentation from Natural Language Expressions(2016)409 cited
- → Modeling Relationships in Referential Expressions with Compositional Modular Networks(2017)404 cited
- → Learning to Segment Every Thing(2018)314 cited
- → UniT: Multimodal Multitask Learning with a Unified Transformer(2021)270 cited
- → Speaker-Follower Models for Vision-and-Language Navigation(2018)244 cited
- → SAM 2: Segment Anything in Images and Videos(2024)212 cited