Xizhou Zhu
Tsinghua University(CN)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Advanced Neural Network Applications, Domain Adaptation and Few-Shot Learning, Advanced Image and Video Retrieval Techniques, Topic Modeling
Most-Cited Works
- → Deformable ConvNets V2: More Deformable, Better Results(2019)2,574 cited
- → Deformable DETR: Deformable Transformers for End-to-End Object Detection(2020)1,866 cited
- → InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions(2023)836 cited
- → VL-BERT: Pre-training of Generic Visual-Linguistic Representations(2019)782 cited
- → Deep Feature Flow for Video Recognition(2017)668 cited
- → Flow-Guided Feature Aggregation for Video Object Detection(2017)666 cited
- → Planning-oriented Autonomous Driving(2023)644 cited
- → An Empirical Study of Spatial Attention Mechanisms in Deep Networks(2019)522 cited
- → Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks(2024)314 cited