Xiao Wang
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Advanced Image and Video Retrieval Techniques, Domain Adaptation and Few-Shot Learning, Advanced Neural Network Applications, Natural Language Processing Techniques
Most-Cited Works
- → LiT: Zero-Shot Transfer with Locked-image text Tuning(2022)341 cited
- → Simple Open-Vocabulary Object Detection(2022)250 cited
- → PaLI: A Jointly-Scaled Multilingual Language-Image Model(2022)194 cited
- → Scaling Vision Transformers to 22 Billion Parameters(2023)118 cited
- → Simple Open-Vocabulary Object Detection with Vision Transformers(2022)63 cited
- → PaLI-X: On Scaling up a Multilingual Vision and Language Model(2023)38 cited
- → On Scaling Up a Multilingual Vision and Language Model(2024)34 cited
- → PaLI-3 Vision Language Models: Smaller, Faster, Stronger(2023)26 cited
- → SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features(2025)7 cited
- → Three Towers: Flexible Contrastive Learning with Pretrained Image Models(2023)4 cited