0 works0 citations0 h-index

Anwen Hu

Publications by Year

Research Areas

Multimodal Machine Learning Applications, Topic Modeling, Natural Language Processing Techniques, Advanced Image and Video Retrieval Techniques, Video Analysis and Summarization

Most-Cited Works

→ mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality(2023)167 cited
→ mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration(2024)131 cited
→ WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training(2021)85 cited
→ UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model(2023)49 cited
→ mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding(2024)29 cited
→ Leveraging Multi-Token Entities in Document-Level Named Entity Recognition(2020)29 cited
→ ICECAP: Information Concentrated Entity-aware Image Captioning(2020)19 cited
→ mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding(2023)17 cited
→ TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging(2024)13 cited
→ mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding(2025)11 cited