Xiquan Li
Shanghai Jiao Tong University(CN)
Publications by Year
Research Areas
Music and Audio Processing, Speech Recognition and Synthesis, Generative Adversarial Networks and Image Synthesis, Music Technology and Sound Studies, Speech and Audio Processing
Most-Cited Works
- → EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark(2024)22 cited
- → DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning(2025)3 cited
- → SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs(2025)3 cited
- → SLAM-Omni: Timbre-Controllable Voice Interaction System with Single-Stage Training(2025)2 cited
- → Variation in Urban Forest Regulation of Air Particulate Matter Concentration(2022)2 cited
- → URO-Bench: Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models(2025)
- Audio ControlNet for Fine-Grained Audio Generation and Editing(2026)
- → SemanticAudio: Audio Generation and Editing in Semantic Space(2026)
- → Towards Reliable Large Audio Language Model(2025)
- → SAC: Neural Speech Codec with Semantic-Acoustic Dual-Stream Quantization(2025)