Shiyin Kang
Group Sense (China)(CN)
Publications by Year
Research Areas
Speech Recognition and Synthesis, Music and Audio Processing, Speech and Audio Processing, Natural Language Processing Techniques, Topic Modeling
Most-Cited Works
- → Phonetic posteriorgrams for many-to-one voice conversion without parallel data training(2016)314 cited
- → Voice conversion using deep Bidirectional Long Short-Term Memory based Recurrent Neural Networks(2015)273 cited
- → Deep Learning for Acoustic Modeling in Parametric Speech Generation: A systematic review of existing techniques and future trends(2015)240 cited
- → Multi-distribution deep belief network for speech synthesis(2013)105 cited
- → FullSubNet+: Channel Attention Fullsubnet with Complex Spectrograms for Speech Enhancement(2022)104 cited
- → DurIAN: Duration Informed Attention Network For Multimodal Synthesis(2019)94 cited
- → Audio-Visual Recognition of Overlapped Speech for the LRS2 Dataset(2020)84 cited
- → DurIAN: Duration Informed Attention Network for Speech Synthesis(2020)77 cited
- → Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams(2016)68 cited
- → A deep recurrent approach for acoustic-to-articulatory inversion(2015)61 cited