Nabarun Goswami
The University of Tokyo(JP)
Publications by Year
Research Areas
Speech and Audio Processing, Speech Recognition and Synthesis, Music and Audio Processing, Generative Adversarial Networks and Image Synthesis, Face recognition and analysis
Most-Cited Works
- → Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation(2018)166 cited
- → Recursive Speech Separation for Unknown Number of Speakers(2019)87 cited
- → PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation(2018)74 cited
- → The Sound Demixing Challenge 2023 – Music Demixing Track(2024)21 cited
- → SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate(2022)2 cited
- → Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation(2024)2 cited
- → ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model(2025)2 cited