Can we transform offline audio diffusion into real-time streaming interactive instruments?
Yes!
Presenting Live Music Diffusion Models: a new paradigm for taking your favorite open models into live performance, right on your own laptop! 🎵🎵
🧵
Thrilled to announce “MIDI-LLM: Adapting LLMs for Text-to-MIDI Music Generation” w/ @huangcza and Yoon Kim!
🎸 Live Demo https://t.co/6N6AqyrZuW
💻 https://t.co/Z1v82uool5
🤗 https://t.co/k9deMY82Vk
From a text prompt, it generates MIDIs you can edit directly in a DAW 🧵
How do we design real-time audio jamming models, and do we have the right data? Our new arXiv preprint studies the design and its tradeoffs.
Paper: https://t.co/Nqzj2lI5PE
Audio samples & code: https://t.co/9PfT4XpyPU
This is happening tomorrow from 9am to 12 pm and 1 pm-4 pm in East Ballroom C! You can come and interact with our model too!! 🎤 Hope to see you there :)
#NeurIPS2024
We studied some musicians interacting with GaMaDHaNi, a generative model for Hindustani vocal music 🎤! I am so excited to present this work at @NeurIPSConf@ML4CDworkshop this week!
📝https://t.co/4UAZxcA4Uo
🎧https://t.co/VpHooj9f3B
An example interaction with the model:
We studied some musicians interacting with GaMaDHaNi, a generative model for Hindustani vocal music 🎤! I am so excited to present this work at @NeurIPSConf@ML4CDworkshop this week!
📝https://t.co/4UAZxcA4Uo
🎧https://t.co/VpHooj9f3B
An example interaction with the model:
While the model presents exciting directions for creative explorations and human-AI partnerships, this work notes the experiences of musicians to inform the model's future development!
More information on the generative model: https://t.co/pBjSd7LYVJ
We built a hierarchical generative model to sing Hindustani vocal melodies 🎤! We will be presenting this work at ISMIR 2024! @ISMIRConf
📝Paper: https://t.co/SMZ1DK9mZ7
💻Code: https://t.co/yrE7S6fNtN
👩🏽💻Demo: https://t.co/4luZu4vT9d
🎧Samples: https://t.co/5eKhMJkNCp
We built a hierarchical generative model to sing Hindustani vocal melodies 🎤! We will be presenting this work at ISMIR 2024! @ISMIRConf
📝Paper: https://t.co/SMZ1DK9mZ7
💻Code: https://t.co/yrE7S6fNtN
👩🏽💻Demo: https://t.co/4luZu4vT9d
🎧Samples: https://t.co/5eKhMJkNCp
@0xhexhex Hi, we used two open source datasets collected by folks at @mtg_upf: Saraga and Hindustani Raga Recognition Dataset. Let me know how the tinkering goes!
We built a hierarchical generative model to sing Hindustani vocal melodies 🎤! We will be presenting this work at ISMIR 2024! @ISMIRConf
📝Paper: https://t.co/SMZ1DK9mZ7
💻Code: https://t.co/yrE7S6fNtN
👩🏽💻Demo: https://t.co/4luZu4vT9d
🎧Samples: https://t.co/5eKhMJkNCp
Intending to use this model for interactive human-AI generation, we present two possible use cases with our model: (1) primed generation and (2) coarse pitch conditioning. Learn more and play around with these interactions in our demo (https://t.co/4luZu4vT9d)!
This looks so cool! Excited to play around with this soon 🥳
Just fyi, seems like this video on the website doesn't have audio enabled @pika_labs@demi_guo_
Introducing Pika 1.0, the idea-to-video platform that brings your creativity to life.
Create and edit your videos with AI.
Rolling out to new users on web and discord, starting today. Sign up at https://t.co/JHRrinsIwx
🚨Understanding In-Context Learning:
1. Pretrained LLMs can implement learning algorithms to learn from data in-context.
2. Transformers can encode multiple algorithms for the same task and use one based on context at inference time.
3. Attention-free models also exhibit ICL.
@the_smg97 @arvshank Yeah that makes sense! So are you saying more discrete changes in speed vs. the current continuous change or just increasing the wavelength of the sine wave to make the change in speed slower and thus less obvious? Or maybe both?
Today @snpranav and I got into an argument over how frustrating it would be for the speed of a song to constantly change as a function of time. To settle it, I decided to use Bespoke to control the speed of 'Spain' by Chick Corea based on a modified sine wave :)