0 works0 citations0 h-index

Nabarun Goswami

The University of Tokyo(JP)

Publications by Year

Research Areas

Speech and Audio Processing, Speech Recognition and Synthesis, Music and Audio Processing, Generative Adversarial Networks and Image Synthesis, Face recognition and analysis

Most-Cited Works

→ Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation(2018)166 cited
→ Recursive Speech Separation for Unknown Number of Speakers(2019)87 cited
→ PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation(2018)74 cited
→ The Sound Demixing Challenge 2023 – Music Demixing Track(2024)21 cited
→ SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate(2022)2 cited
→ Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation(2024)2 cited
→ ARTalk: Speech-Driven 3D Head Animation via Autoregressive Model(2025)2 cited