Interested in ill-posed learning tasks, uncertainty prediction, conditional density estimation or multi-head deep neural networks ?
In our new paper, accepted at #ICML24, we tackle these challenges by exploring the Winner-Takes-All (WTA) training scheme.
[1/n]
Our paper entitled "Neural Blind Source Separation and Diarization for Distant Speech Recognition" is accepted to Interspeech 2024!
Our neural FCASA is a method to jointly separate and diarize speech mixtures without supervision by isolated signals.
https://t.co/ruPMGTX3M9
INTERSPEECH paper for @tp_adasp:
# 5 - RIR-in-a-Box: Estimating Room Acoustics from 3D Mesh Data through Shoebox Approximation, Liam Kelley, Diego Di Carlo, Mathieu Fontaine,
Aditya Arie Nugraha, Yoshiaki Bando
INTERSPEECH paper for @tp_adasp:
# 4 - Speech dereverberation constrained on room impulse response characteristics, Louis Bahrman, Mathieu Fontaine, Jonathan Le Roux, Gaël Richard
EUSIPCO papers for @tp_adasp:
#2 - A. Emelchenkov, M. Fontaine, Y. Grenier, H. Mahé, F. Roueff; Multifrequency highly oscillating aperiodic amplitude estimation for nonlinear chirp signal
Resilient Multiple Choice Learning: A learned scoring scheme with application to audio scene analysis
by @VLetzelter@MathieuFontai19@Mickael_Chen P. Pérez, S. Essid, G. Richard
tl;dr: extend MCL for cond. distrib. estimation in regression #neurips2023
https://t.co/g4S1eVFAM0
Today, we announce Ego-Exo4D, the largest and most diverse multi-view dataset, showing human experts around the world performing a core set of skilled activities, w/ unprecedented multi-modality, novel new video-language resources, and rich annotations