Yonatan Bitton
Google (United States)(US)Google (Israel)(IL)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Topic Modeling, Natural Language Processing Techniques, Domain Adaptation and Few-Shot Learning, Video Analysis and Summarization
Most-Cited Works
- → DataComp: In search of the next generation of multimodal datasets(2023)73 cited
- → OpenFlamingo: An Open-Source Framework for Training Large Autoregressive Vision-Language Models(2023)70 cited
- → Breaking Common Sense: WHOOPS! A Vision-and-Language Benchmark of Synthetic and Compositional Images(2023)26 cited
- → Data Efficient Masked Language Modeling for Vision and Language(2021)17 cited
- → DOCCI: Descriptions of Connected and Contrasting Images(2024)14 cited
- → What You See is What You Read? Improving Text-Image Alignment Evaluation(2023)14 cited
- → Cross-lingual Unified Medical Language System entity linking in online health communities(2020)13 cited
- → VisIT-Bench: A Benchmark for Vision-Language Instruction Following Inspired by Real-World Use(2023)11 cited
- → A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains(2024)8 cited
- → ImageInWords: Unlocking Hyper-Detailed Image Descriptions(2024)8 cited