James Qin
Publications by Year
Research Areas
Speech Recognition and Synthesis, Speech and Audio Processing, Music and Audio Processing, Natural Language Processing Techniques, Topic Modeling
Most-Cited Works
- → Conformer: Convolution-augmented Transformer for Speech Recognition(2020)2,580 cited
- → LaMDA: Language Models for Dialog Applications(2022)705 cited
- → ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context(2020)250 cited
- → w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training(2021)250 cited
- → Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition(2020)200 cited
- → Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling(2019)183 cited
- → BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition(2022)153 cited
- → Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages(2023)112 cited
- → Vector-quantized Image Modeling with Improved VQGAN(2021)92 cited