Dongxu Li
Xiamen University(CN)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Human Pose and Action Recognition, Topic Modeling, Hand Gesture Recognition Systems, Natural Language Processing Techniques
Most-Cited Works
- → BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models(2023)906 cited
- → BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation(2022)862 cited
- → Word-level Deep Sign Language Recognition from Video: A New Large-scale Dataset and Methods Comparison(2020)545 cited
- → InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning(2023)401 cited
- → Transferring Cross-Domain Knowledge for Video Sign Language Recognition(2020)144 cited
- → Enhanced Spatio-Temporal Interaction Learning for Video Deraining: Faster and Better(2022)141 cited
- → From Images to Textual Prompts: Zero-shot Visual Question Answering with Frozen Large Language Models(2023)125 cited
- → ARVo: Learning All-Range Volumetric Correspondence for Video Deblurring(2021)75 cited
- → TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation(2020)73 cited
- → cosFormer: Rethinking Softmax in Attention(2022)65 cited