Jing Yu Koh
Carnegie Mellon University(US)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Generative Adversarial Networks and Image Synthesis, Domain Adaptation and Few-Shot Learning, Topic Modeling, Advanced Neural Network Applications
Most-Cited Works
- → Scaling Autoregressive Models for Content-Rich Text-to-Image Generation(2022)340 cited
- → Cross-Modal Contrastive Learning for Text-to-Image Generation(2021)306 cited
- → Vector-quantized Image Modeling with Improved VQGAN(2021)92 cited
- → Text-to-Image Generation Grounded by Fine-Grained User Attention(2021)54 cited
- → Generating Images with Multimodal Language Models(2023)39 cited
- → A New Path: Scaling Vision-and-Language Navigation with Synthetic Instructions and Imitation Learning(2023)28 cited
- → Grounding Language Models to Images for Multimodal Inputs and Outputs(2023)25 cited
- → Improving Customer Satisfaction in Bike Sharing Systems through Dynamic Repositioning(2019)24 cited
- → VisualWebArena: Evaluating Multimodal Agents on Realistic Visual Web Tasks(2024)19 cited