Daniel Salz
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Natural Language Processing Techniques, Topic Modeling, Advanced Image and Video Retrieval Techniques, Domain Adaptation and Few-Shot Learning
Most-Cited Works
- → PaLI: A Jointly-Scaled Multilingual Language-Image Model(2022)194 cited
- → PaLI-X: On Scaling up a Multilingual Vision and Language Model(2023)38 cited
- → On Scaling Up a Multilingual Vision and Language Model(2024)34 cited
- → PaLI-3 Vision Language Models: Smaller, Faster, Stronger(2023)26 cited
- → PaliGemma: A versatile 3B VLM for transfer(2024)11 cited
- → Scaling Pre-training to One Hundred Billion Data for Vision Language Models(2025)2 cited
- → Gemini Embedding: Generalizable Embeddings from Gemini(2025)2 cited
- → Improve Supervised Representation Learning with Masked Image Modeling(2023)1 cited
- → TIPS: Text-Image Pretraining with Spatial awareness(2024)1 cited
- → EmbeddingGemma: Powerful and Lightweight Text Representations(2025)