a public good project by the
Synthesis
Company
of California

© 2026

Zhehuai Chen | doi.page

0 works0 citations0 h-index

Google Scholar OpenAlex

Zhehuai Chen

Nvidia (United States)(US)

Publications by Year

Research Areas

Speech Recognition and Synthesis, Natural Language Processing Techniques, Topic Modeling, Speech and Audio Processing, Music and Audio Processing

Most-Cited Works

→ Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages(2023)112 cited
→ Knowledge Distillation for Sequence Model(2018)71 cited
→ MAESTRO: Matched Speech Text Representations through Modality Matching(2022)68 cited
→ Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR(2019)51 cited
→ Improving Speech Recognition Using Consistent Predictions on Synthesized Speech(2020)50 cited
→ End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder(2019)49 cited
→ On Modular Training of Neural Acoustics-to-Word Model for LVCSR(2018)35 cited
→ Phone Synchronous Speech Recognition With CTC Lattices(2016)34 cited
→ Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection(2020)34 cited
→ Injecting Text in Self-Supervised Speech Pretraining(2021)25 cited