Jaekwon Im @osalooloo - Twitter Profile

about 1 year ago

https://t.co/YP8kYnwCiR 카이스트에서 진행한 유하영 재즈워크샵 영상이 올라왔습니다. 아주 좋은 질문으로 가득한 좋은 시간이었네요. 진행 @osalooloo 주최 @juhan_nam 기획 @keunwoochoi

0

2

1

646

Jaekwon Im @osalooloo

about 1 year ago

I'm attending #ICASSP2025 and presenting FlashSR. I’d love to connect with anyone interested in audio super-resolution, audio generation, diffusion, flow matching, or related topics. Please feel free to stop by the session or message me for a coffee chat.

osalooloo's tweet photo. I'm attending #ICASSP2025 and presenting FlashSR. I’d love to connect with anyone interested in audio super-resolution, audio generation, diffusion, flow matching, or related topics. Please feel free to stop by the session or message me for a coffee chat. https://t.co/pIaErgzP8f

Jaekwon Im @osalooloo

over 1 year ago

🌟 Excited to announce the release of the code and model weights for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation, accepted at ICASSP 2025! 🎉 🔗 Check out the demo, code, and paper here: https://t.co/I2JfTmWsHl

osalooloo's tweet photo. 🌟 Excited to announce the release of the code and model weights for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation, accepted at ICASSP 2025! 🎉

🔗 Check out the demo, code, and paper here: https://t.co/I2JfTmWsHl https://t.co/5cAsjLXKUE

3

26

3

15

3K

0

4

0

522

Jaekwon Im @osalooloo

over 1 year ago

⚡ Achieves performance approximately 14 times faster than real-time on a single A6000 GPU. 🔬 Applies diffusion distillation to the audio super-resolution task, and introduces the SR Vocoder, specifically designed for SR models operating on mel-spectrograms.

0

2

0

298

Jaekwon Im @osalooloo

over 1 year ago

🌟 Excited to announce the release of the code and model weights for FlashSR: One-step Versatile Audio Super-resolution via Diffusion Distillation, accepted at ICASSP 2025! 🎉 🔗 Check out the demo, code, and paper here: https://t.co/I2JfTmWsHl

3

26

3

15

3K

Who to follow

Alkis Koudounas

@AlkisKoudounas

Research @Sony | ex- @AmazonScience | PhD @PoliTOnews || Post-training SpeechLLMs | Trustworthy and Responsible AI

Yuan Gong

@YGongND

Prev. Grok Voice Modeling Lead @xAI, MIT CSAIL, Notre Dame

Gaudio Lab, Inc.

@gaudiolab

Open a new realm of your world with Gaudio Lab’s AI powered audio technologies. Generative Sound | Source Separation | Spatial Audio

Jaekwon Im @osalooloo

over 1 year ago

🚀 A one-step diffusion model for audio super-resolution that can upsample various types of audio—music, speech, and sound effects—from any sampling rate between 4kHz and 32kHz to 48kHz.

0

2

0

293

Jaekwon Im @osalooloo

over 1 year ago

Key highlights of FlashSR:

0

2

0

388

osalooloo retweeted

Keunwoo Choi @keunwoochoi

about 2 years ago

hi music people, i wrote a tutorial on large language models and music information retrieval. of course it's called.. LLMs <3 MIR 🥁 have fun! https://t.co/g7VXyW0lgd

4

200

26

133

26K

osalooloo retweeted

이준원 Junwon Lee @jnwnlee

about 2 years ago

Are you wondering how to make a Synchronized Foley Sound for Sora-made videos? Here’s our work 🔊T-Foley. (ICASSP 2024) Demo https://t.co/0dm6WUXymQ Code https://t.co/48OyVRFzaZ (It was a pleasure to work with @yo__j__)

1

22

3

4

2K

Jaekwon Im @osalooloo

about 2 years ago

1

4

1

0

238

Jaekwon Im @osalooloo

about 2 years ago

Super excited to share my work "DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech" at #ICASSP2024 on April 17th. Looking forward to connecting with many researchers :). I'm always up for a coffee chat too! Demo: https://t.co/PjgAQMHWzT

osalooloo's tweet photo. Super excited to share my work "DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech" at #ICASSP2024 on April 17th.
Looking forward to connecting with many researchers :).
I'm always up for a coffee chat too!

Demo: https://t.co/PjgAQMHWzT https://t.co/IuVqWfacY6

1

17

0

775

osalooloo retweeted

Christian Steinmetz

@csteinmetz1

over 2 years ago

DIFFRENT: A Diffusion Model for Recording Environment Transfer of Speech Proposes a diffusion model to transfer the environment from a reference sound to a new recording taking into account the microphone and placement, room acoustics, and ambient noise. https://t.co/nZ5vALZdOH

6

69

10

19

3K

Jaekwon Im @osalooloo

over 2 years ago

@csteinmetz1 Thank you so much for sharing my work!!😀😀

0

6

0

170

osalooloo retweeted

Keunwoo Choi @keunwoochoi

almost 3 years ago

https://t.co/z533CBcKuR Our paper about DCASE Challenge T7 - Foley Sound Synthesis was accepted to the DCASE Workshop 🥳 I can't make it to Finland🇫🇮, but some of the authors will be there to tell you what we went through while organizing the first generative challenge at DCASE.

0

36

6

4

3K

osalooloo retweeted

AuditoryLab @AuditoryLabInfo

almost 3 years ago

Worked hard, learned a lot, and met a great team during the first-ever competition for generative everyday sounds at @DCASE_Challenge, Task 7 Foley sound! @keunwoochoi @osalooloo @KeisukeImoto @forthshinji @YukiOkamoto19 Mathieu Lagrange, Brian McFee.

1

4

1

645

osalooloo retweeted

arXiv Sound @ArxivSound

about 3 years ago

``Foley Sound Synthesis at the DCASE 2023 Challenge. (arXiv:2304.12521v1 [https://t.co/mPAjntoGrG]),'' Keunwoo Choi, Jaekwon Im, Laurie Heller, Brian McFee, Keisuke Imoto, Yuki Okamoto, Mathieu Lagrange, Shinosuke Takamichi, https://t.co/pOr7wZp1R8

0

23

8

5

2K

osalooloo retweeted

AuditoryLab @AuditoryLabInfo

over 3 years ago

Organizers: Keunwoo Choi, Jaekwon Im, Gaudio Labs Keisuke Imoto, Doshisha U Mathieu Lagrange, CNRS/Ecole Central de Nantes Laurie Heller, CMU Brian McFee, NYU Yuki Okamoto, Ritsumeikan U Shinnosuke Takamichi, The U of Tokyo @KeisukeImoto @keunwoochoi @forthshinji @osalooloo

0

1

0

913

osalooloo retweeted

AuditoryLab @AuditoryLabInfo

over 3 years ago

Just launched a generative audio challenge for environmental sounds! Synthesis by ML or any method. Deadline May 15. Task 7 of @DCASE_Challenge 2023. https://t.co/rk9axDvztW @ieee_AASP @acousticsorg @AESorg @kaggle #foleysynthesischallenge #audiosynthesis #soundclassification

2

5

1

2

309

Jaekwon Im

@osalooloo

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users