a public good project by the
Synthesis
Company
of California

© 2026

James Qin | doi.page

0 works0 citations0 h-index

Google Scholar OpenAlex

James Qin

Publications by Year

Research Areas

Speech Recognition and Synthesis, Speech and Audio Processing, Music and Audio Processing, Natural Language Processing Techniques, Topic Modeling

Most-Cited Works

→ Conformer: Convolution-augmented Transformer for Speech Recognition(2020)2,580 cited
→ LaMDA: Language Models for Dialog Applications(2022)705 cited
→ ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context(2020)250 cited
→ w2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training(2021)250 cited
→ Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition(2020)200 cited
→ Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling(2019)183 cited
→ BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition(2022)153 cited
→ Google USM: Scaling Automatic Speech Recognition Beyond 100 Languages(2023)112 cited
→ Vector-quantized Image Modeling with Improved VQGAN(2021)92 cited