We are looking for a postdoctoral researcher in speech and audio processing, with a possible start in the Fall 2026 semester. If you are interested in working with us, please apply through the following form: https://t.co/ssjinOn6LW
WAVLab @ #ICASSP2026
We will present 8 papers at ICASSP in Barcelona. If you are attending, please stop by the talks/posters and chat with the authors.
arXiv links and presentation info below.
1/5
Congrats to Brian @brianyan918 on finishing his PhD defense today! It was great to see so many people show up for this big event and celebrate such an important milestone. Wishing you all the best in what comes next!
Congratulations to Siddhant @Sid_Arora_18 on a successful PhD defense today! It was wonderful to celebrate this big milestone together. Wishing him all the best for the exciting journey ahead.
Pu Wang, Shinji Watanabe, Hugo Van hamme, "SSVD-O: Parameter-Efficient Fine-Tuning with Structured SVD for Speech Recognition," https://t.co/vM5sgTuMAL
Shih-Heng Wang, Jiatong Shi, Jinchuan Tian, Haibin Wu, Shinji Watanabe, "Do Neural Codecs Generalize? A Controlled Study Across Unseen Languages and Non-Speech Tasks," https://t.co/hCQOKgxCga
Heading to NeurIPS 2025 in San Diego!
I’ll present our spotlight poster, ARECHO, focusing on speech multi-metric estimation.
📍 Exhibit Hall C,D,E #2000
🗓️ Thu Dec 4, 11 a.m.–2 p.m. PST
If you’re around, let’s say hi or grab a coffee!
This is exactly the reason we worked for ESPnet-Codec, but being really hard to keep tracking as people are fast nowadays.
The similar issue happens at most speech tasks from ASR, TTS, to general speech LLM. It's a bit sad time for driving scientific findings 🥲
Speech isn’t just sound -> it’s how we turn thought into expression.
Our new work, Speech-DRAME, measures how well speech AI can act, aligning evaluation with human perception.
Paper: https://t.co/QzDuj2eJ6Z
Code: https://t.co/dOMqq8rgFJ