Owais Khan Mohammed
Microsoft (United States)(US)Microsoft (Finland)(FI)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Natural Language Processing Techniques, Advanced Image and Video Retrieval Techniques, Domain Adaptation and Few-Shot Learning, Topic Modeling
Most-Cited Works
- → Image as a Foreign Language: BEIT Pretraining for Vision and Vision-Language Tasks(2023)444 cited
- → Language Is Not All You Need: Aligning Perception with Language Models(2023)164 cited
- → Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks(2022)150 cited
- → DUBLIN: Visual Document Understanding By Language-Image Network(2023)3 cited
- → ArK: Augmented Reality with Knowledge Interactive Emergent Ability(2023)2 cited
- → VLMo: Unified Vision-Language Pre-Training with Mixture-Of-Modality-Experts(2022)