Zhehuai Chen
Nvidia (United States)(US)
Publications by Year
Research Areas
Speech Recognition and Synthesis, Natural Language Processing Techniques, Topic Modeling, Speech and Audio Processing, Music and Audio Processing
Most-Cited Works
- → Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages(2023)112 cited
- → Knowledge Distillation for Sequence Model(2018)71 cited
- → MAESTRO: Matched Speech Text Representations through Modality Matching(2022)68 cited
- → Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR(2019)51 cited
- → Improving Speech Recognition Using Consistent Predictions on Synthesized Speech(2020)50 cited
- → End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder(2019)49 cited
- → On Modular Training of Neural Acoustics-to-Word Model for LVCSR(2018)35 cited
- → Phone Synchronous Speech Recognition With CTC Lattices(2016)34 cited
- → Improving Speech Recognition Using GAN-Based Speech Synthesis and Contrastive Unspoken Text Selection(2020)34 cited
- → Injecting Text in Self-Supervised Speech Pretraining(2021)25 cited