Ruihua Song
Renmin University of China(CN)
Publications by Year
Research Areas
Multimodal Machine Learning Applications, Video Analysis and Summarization, Topic Modeling, Speech and Audio Processing, Domain Adaptation and Few-Shot Learning
Most-Cited Works
- → Pre-trained models: Past, present and future(2021)909 cited
- → Towards artificial general intelligence via a multimodal foundation model(2022)252 cited
- → WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training(2021)85 cited
- → Evaluating the Effectiveness of Personalized Web Search(2008)65 cited
- → AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios(2022)41 cited
- → Intelligent Agents with LLM-based Process Automation(2024)40 cited
- → Class-Aware Sounding Objects Localization via Audiovisual Correspondence(2021)37 cited
- → Neural Storyboard Artist(2019)31 cited
- → Image Inspired Poetry Generation in XiaoIce(2018)31 cited
- → What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?(2024)28 cited