Filip Pavetic
Google (United States)(US)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Natural Language Processing Techniques, Advanced Image and Video Retrieval Techniques, Domain Adaptation and Few-Shot Learning, Advanced Neural Network Applications
Most-Cited Works
- → FlexiViT: One Model for All Patch Sizes(2023)77 cited
- → PaLI-X: On Scaling up a Multilingual Vision and Language Model(2023)38 cited
- → On Scaling Up a Multilingual Vision and Language Model(2024)34 cited
- → PaLI-3 Vision Language Models: Smaller, Faster, Stronger(2023)26 cited
- → A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision(2023)3 cited
- → LocCa: Visual Pretraining with Location-aware Captioners(2024)1 cited