Anwen Hu
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Topic Modeling, Natural Language Processing Techniques, Advanced Image and Video Retrieval Techniques, Video Analysis and Summarization
Most-Cited Works
- → mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality(2023)167 cited
- → mPLUG-OwI2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration(2024)131 cited
- → WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training(2021)85 cited
- → UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model(2023)49 cited
- → mPLUG-DocOwl 1.5: Unified Structure Learning for OCR-free Document Understanding(2024)29 cited
- → Leveraging Multi-Token Entities in Document-Level Named Entity Recognition(2020)29 cited
- → ICECAP: Information Concentrated Entity-aware Image Captioning(2020)19 cited
- → mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding(2023)17 cited
- → TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging(2024)13 cited
- → mPLUG-DocOwl2: High-resolution Compressing for OCR-free Multi-page Document Understanding(2025)11 cited