Botian Shi
Beijing Academy of Artificial Intelligence(CN)Shanghai Artificial Intelligence Laboratory
Publications by Year
Research Areas
Advanced Neural Network Applications, Multimodal Machine Learning Applications, Topic Modeling, Natural Language Processing Techniques, Autonomous Vehicle Technology and Safety
Most-Cited Works
- → How far are we to GPT-4V? Closing the gap to commercial multimodal models with open-source suites(2024)175 cited
- → UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation(2020)169 cited
- → LoGoNet: Towards Accurate 3D Object Detection with Local-to-Global Cross- Modal Fusion(2023)151 cited
- → Multi-Sensor Fusion and Cooperative Perception for Autonomous Driving: A Review(2023)134 cited
- → Drive Like a Human: Rethinking Autonomous Driving with Large Language Models(2024)123 cited
- UniViLM: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation.(2020)
- → Knowledge Aware Semantic Concept Expansion for Image-Text Matching(2019)74 cited
- → Dense Procedure Captioning in Narrated Instructional Videos(2019)71 cited
- → Homogeneous Multi-modal Feature Fusion and Interaction for 3D Object Detection(2022)50 cited
- → Microsoft Concept Graph: Mining Semantic Concepts for Short Text Understanding(2019)41 cited