a public good project by the
Synthesis
Company
of California

© 2026

Ruihua Song | doi.page

0 works0 citations0 h-index

Google Scholar OpenAlex

Ruihua Song

Renmin University of China(CN)

Publications by Year

Research Areas

Multimodal Machine Learning Applications, Video Analysis and Summarization, Topic Modeling, Speech and Audio Processing, Domain Adaptation and Few-Shot Learning

Most-Cited Works

→ Pre-trained models: Past, present and future(2021)909 cited
→ Towards artificial general intelligence via a multimodal foundation model(2022)252 cited
→ WenLan: Bridging Vision and Language by Large-Scale Multi-Modal Pre-Training(2021)85 cited
→ Evaluating the Effectiveness of Personalized Web Search(2008)65 cited
→ AdaSpeech 4: Adaptive Text to Speech in Zero-Shot Scenarios(2022)41 cited
→ Intelligent Agents with LLM-based Process Automation(2024)40 cited
→ Class-Aware Sounding Objects Localization via Audiovisual Correspondence(2021)37 cited
→ Neural Storyboard Artist(2019)31 cited
→ Image Inspired Poetry Generation in XiaoIce(2018)31 cited
→ What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?(2024)28 cited