``Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information,'' Nicholas Sanders, Yuanchao Li, Korin Richmond, Simon King, https://t.co/u23yq5ZVqf
@ZhengjunYue@ieeeICASSP@dorienherremans That someone is me, haha. I am refunding my booked hotel, but the flight is non-refundable, and now I have no funding to attend the satellite event in Suzhou. And it's not only Chinese students, African students (perhaps some other regions as well) are facing the same problem.
#Interspeech2025 Please be sure to select “14.07 Responsible Speech Foundation Models (Special Session)” as your paper subject area when making a submission in the CMT system
📢📢CFP & Spread the word!!
Again, "Responsible Speech Foundation Models" will appear as a special session @ISCAInterspeech#Interspeech2025
Last time, we got 8 great papers from CMU, CUHK, NTU, Apple, and more. Come join the party and win a paper award!!
https://t.co/mS2AnXmeG7
@pengyf21 I sometimes ask speech researchers if they use ASR or SDS in their daily lives. The answers are pretty much the same: "No" with an awkward laugh😂
@jiatongshi My pleasure, Jiatong! As I am working on the acoustic similarity across audio, speech, and music, as well as their real and synthetic forms, I find VERSA to be a very timely and nice work. Bravo!
For those whose excellent work on speech foundation models was unfortunately not accepted by #ICASSP2025, you may consider resubmitting to our session at #Interspeech2025 😋😋
📢📢CFP & Spread the word!!
Again, "Responsible Speech Foundation Models" will appear as a special session @ISCAInterspeech#Interspeech2025
Last time, we got 8 great papers from CMU, CUHK, NTU, Apple, and more. Come join the party and win a paper award!!
https://t.co/mS2AnXmeG7
📢📢CFP & Spread the word!!
Again, "Responsible Speech Foundation Models" will appear as a special session @ISCAInterspeech#Interspeech2025
Last time, we got 8 great papers from CMU, CUHK, NTU, Apple, and more. Come join the party and win a paper award!!
https://t.co/mS2AnXmeG7
👏👏Check out the winner @enshi_zhang of our Generative Error Correction Challenge for LLM-based emotion recognition. They achieved 75.2% accuracy on ASR transcriptions of IEMOCAP using text refinement and GPT-4.0.
``Improving Speech-based Emotion Recognition with Contextual Utterance Analysis and LLMs,'' Enshi Zhang, Christian Poellabauer, https://t.co/UrKzYZIUPX
``Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition,'' Chao-Han Huck Yang, Taejin Park, Yuan Gong, Yuanchao Li, Zhehuai Chen, Yen-Ting Lin, Chen Chen, Yuchen… https://t.co/PdOoz4Yy5k
@Peppermint_2525 It’s too bad that Honda ended the ASIMO project just 3 months after I joined the Innovation Lab. Now the Innovation Lab has closed as well. Honda and several Japanese companies had the hardware resources but were just a few steps behind becoming Tesla. 体制のせいか
👏👏Check out the 2nd best-performing system in our Generative Error Correction Challenge for LLM-based emotion recognition (yet unreasonably rejected by SLT2024). They achieved 75.1% accuracy on ASR transcriptions of IEMOCAP using GPT-4o.
``Context and System Fusion in Post-ASR Emotion Recognition with Large Language Models,'' Pavel Stepachev, Pinzhen Chen, Barry Haddow, https://t.co/FYKdxu55T2
SIGDIAL was successfully hosted by Speech and Audio Processing lab @KyotoU_News where I did my master's study. Check out the keynote talk given by my senpai, Koji: https://t.co/tlFPWVA0MS
Successfully completed hosting SIGDIAL and YRRSDS this week, and also having my own keynote talk! Great to meet my new and old friends in Kyoto! Thanks for attending!
Dear speech researchers,
ICASSP'25 is looking for a reviewer in the speech and language areas.
Please nominate yourself via https://t.co/KjAd5EAbUy!
Note that this form is only for the speech and language areas.