Pengchuan Zhang
Greentech (France)(FR)Green Technology(US)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Domain Adaptation and Few-Shot Learning, Advanced Neural Network Applications, Advanced Image and Video Retrieval Techniques, Human Pose and Action Recognition
Most-Cited Works
- → AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks(2018)1,864 cited
- → Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks(2020)1,493 cited
- → VinVL: Revisiting Visual Representations in Vision-Language Models(2021)846 cited
- → RegionCLIP: Region-based Language-Image Pretraining(2022)466 cited
- → Dynamic DETR: End-to-End Object Detection with Dynamic Attention(2021)385 cited
- → Florence: A New Foundation Model for Computer Vision(2021)340 cited
- → Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding(2021)316 cited
- → Object-Driven Text-To-Image Synthesis via Adversarial Training(2019)314 cited
- → An Empirical Study of Training End-to-End Vision-and-Language Transformers(2022)308 cited
- → Focal Self-attention for Local-Global Interactions in Vision Transformers(2021)267 cited