The Moving Drone: Negotiating Agency Between the Voice and the Virtual
Nithya Shikarpur, Victor Arul, Anna Huang
https://t.co/zCx7nLpise [ππ.ππ³]
π¬Published in NIME music track 2026
Generative Modeling of Bach-Style Symbolic Music: A Comparative Study of Autoregressive, Latent-Variable, and Adversarial Approaches
Kyuil Lee, Dezhi Yu, Yongkang Huang
https://t.co/tnFpUaSixv [ππ.ππ³ ππ.π»πΆ]
Fast-SDE: Efficient Single-Microphone Sound Source Distance Estimation in Reverberant Environments
Jiang Wang, Runwu Shi, β¦
https://t.co/FWmCwihYea [ππ.ππ³ ππ.ππΎ]
π¬To appear in the 35th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)
Real-Time Language Model Jamming: A Case Study for Live Music Accompaniment Generation
Bowen Zheng, Andrew H. Yang, Jiaqi Ruan, Jia He, Xinyue Li, Yuan-Hsin Chen, Ziyu Wang, Xiaosong Ma
https://t.co/SxF5nhyS2k [ππ.ππ³ ππ.πΎπ]
π¬Accepted to RTAS 2026
SpAArSIST: Sparsified AASIST for Efficient and Reliable Anti-Spoofing
Anton Firc, VojtΔch StanΔk, ZbynΔk LiΔka, Kamil Malinka, Martin PereΕ‘Γni
https://t.co/mjsnPcRCbB [ππ.ππ³ ππ.π»πΆ]
π¬Accepted at Interspeech 2026
The Hidden Cost of Pairwise Verification in Synthetic Speech Source Tracing
Anton Firc, ZbynΔk LiΔka, VojtΔch StanΔk, Kamil Malinka
https://t.co/AyLVjEGbZM [ππ.ππ³]
π¬Accepted at Interspeech 2026
CS-YODAS: A Mined Dataset of In-the-Wild Code-Switched Speech
Brian Yan, Qingzheng Wang, Matthew Wiesner, Anuj Diwan, Olga Iakovenko, Alexander Polok, Injy Hamed, Shuichiro Shimizu, Iris Emerman Thomas Hain, David R. Mortensen, β¦
https://t.co/b2RJWFqcoe [ππ.ππ³]
Steering Where to Listen: Instruction-Based Activation Steering Redirects Temporal Attention in Large Audio-Language Models
Tsung-En Lin, Hung-Yi Lee
https://t.co/XzmgcfoIIO [ππ.ππ³ ππ.π°πΈ ππππ.π°π]