Scott Wisdom @ScottTWisdom - Twitter Profile

about 1 year ago

Veo 3 is here, and in addition to better visuals, it makes noises and speaks! This was a massive effort made possible by incredible passion from the whole Veo team and the many other team enabling it to launch today. Looking forward to seeing what others do with it! #veo3

12

232

30

37

20K

ScottTWisdom retweeted

Sundar Pichai

@sundarpichai

about 1 year ago

Veo 3, our SOTA video generation model, has native audio generation and is absolutely mindblowing. For filmmakers + creatives, we’re combining the best of Veo, Imagen and Gemini into a new filmmaking tool called Flow. Ready today for Google AI Pro and Ultra plan subscribers.

9

842

59

96

93K

ScottTWisdom retweeted

Google DeepMind @GoogleDeepMind

almost 2 years ago

We're sharing progress on our video-to-audio (V2A) generative technology. 🎥 It can add sound to silent clips that match the acoustics of the scene, accompany on-screen action, and more. Here are 4 examples - turn your sound on. 🧵🔊 https://t.co/VHpJ2cBr24

90

1K

349

360

529K

ScottTWisdom retweeted

Vivek Kumar @vivek_kumar

over 2 years ago

It's so awesome to see the impact of the computational audio capabilities we developed featured in @madebygoogle 🎉 🎉 🎉 Congrats to John Hershey, @ScottTWisdom, @PGetreuer & everyone who contributed for pioneering new computational audio capabilities in Pixel8 #MadeByGoogle

4

60

16

4

20K

Who to follow

Qiuqiang Kong

@QiuqiangK

Assistant Professor at @CUHKofficial, previously at @ByteDanceTalk, Ph.D. at @UniOfSurrey

Anurag Kumar

@AcouIntel

Research Scientist, @GoogleDeepMind | Prev: @AIatMeta | CMU @SCSatCMU | @IITKanpur | Audio/Speech, Multimodal AI

Xubo Liu

@LiuXub

Research Scientist, Meta Superintelligence Labs

ScottTWisdom retweeted

Jonathan Le Roux @JonathanLeRoux

about 3 years ago

Sorry it took forever (I did the editing this year...): videos of all #SANE2022 talks by @TweetRupal @mhnt1580 @ScottTWisdom @tnsainath @shinjiw_at_cmu @anoopcherian @gan_chuang are finally available! Here's the essential binge-watching YouTube playlist👇 https://t.co/kLnv3XxV3p

0

40

14

5

4K

ScottTWisdom retweeted

Jonathan Le Roux @JonathanLeRoux

over 3 years ago

Strong showing at #SANE2022 to learn about the latest and greatest in speech and audio research from a stellar lineup!

JonathanLeRoux's tweet photo. Strong showing at #SANE2022 to learn about the latest and greatest in speech and audio research from a stellar lineup! https://t.co/BCfEStIonc

2

70

3

1

0

ScottTWisdom retweeted

Efthymios Tzinis @ETzinis

over 3 years ago

Here is a short presentation of AudioScopeV2!📢 @ScottTWisdom and I are looking forward to discussing further about open-domain on-screen sound separation and meeting you in #ECCV2022! webpage:https://t.co/tnFj8h2vSU... arxiv:https://t.co/Xl8MyGN44c video:https://t.co/lRb19rp34N

1

26

2

1

0

ScottTWisdom retweeted

Jonathan Le Roux @JonathanLeRoux

over 3 years ago

Full list of speakers and talk details for #SANE2022 (Thursday 10/6, Cambridge, MA) now available! @anoopcherian @gan_chuang @mhnt1580 @TweetRupal @tnsainath @shinjiw_at_cmu @ScottTWisdom Poster & demo submissions due 9/21. Registration/Details: https://t.co/gKea2KEc05

1

35

8

0

ScottTWisdom retweeted

Efthymios Tzinis @ETzinis

almost 4 years ago

I am 😃 that we will present AudioScopeV2 at #ECCV2022! If you want to learn about improved audio-visual attention models and calibration for on-screen sound separation check our paper w. @ScottTWisdom! project-page: https://t.co/56xex144Qx new dataset: https://t.co/26F3UgkJ4P

1

21

3

1

0

ScottTWisdom retweeted

AK

@_akhaliq

almost 4 years ago

Distance-Based Sound Separation abs: https://t.co/FMb1QKgibn project page: https://t.co/a29MkM7qpO With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds

_akhaliq's tweet photo. Distance-Based Sound Separation
abs: https://t.co/FMb1QKgibn
project page: https://t.co/a29MkM7qpO

With a single nearby speaker and four distant speakers, the model improves scale-invariant signal to noise ratio by 4.4 dB for near sounds and 6.8 dB for far sounds https://t.co/nvtx2tbV02

0

104

24

20

0

ScottTWisdom retweeted

Jonathan Le Roux @JonathanLeRoux

about 4 years ago

SANE is back! Thursday, Oct. 6 in Kendall Square, Cambridge, MA. Confirmed speakers: A. Cherian @anoopcherian, C. Gan @gan_chuang, W.-N. Hsu @mhnt1580, T. Sainath @tnsainath, S. Watanabe @shinjiw_at_cmu, S. Wisdom @ScottTWisdom. More details: https://t.co/gKea2KEc05

1

26

4

0

ScottTWisdom retweeted

Aswin Sivaraman @actuallyaswin

over 4 years ago

Happy to see my summer work with @ScottTWisdom, Hakan Erdogan, and John Hershey was accepted for presentation at @ieeeICASSP 2022 😊 My first ICASSP paper in the books! Immensely thankful for their mentorship. Our first version can be found on arXiv at: https://t.co/xZ0BL7znaN

actuallyaswin's tweet photo. Happy to see my summer work with @ScottTWisdom, Hakan Erdogan, and John Hershey was accepted for presentation at @ieeeICASSP 2022 😊 My first ICASSP paper in the books! Immensely thankful for their mentorship. Our first version can be found on arXiv at: https://t.co/xZ0BL7znaN https://t.co/Jiiz5wqvDv

2

36

1

0

ScottTWisdom retweeted

Sundar Pichai

@sundarpichai

over 4 years ago

We can learn a lot about our environment just by listening to the birds. New #GoogleAI approaches can help isolate and identify birdsongs, helping ecologists better understand food systems and forest health. 🐦 https://t.co/Va9kjPTHRj

101

2K

152

17

0

ScottTWisdom retweeted

Eduardo Fonseca @edfonseca_

over 4 years ago

Our paper received a #WASPAA2021 special award for *Best Audio Representation Learning Paper*: "Self-Supervised Learning from Automatically Separated Sound Scenes". 🎉🚀 paper: https://t.co/NvEhyI8BzE talk: https://t.co/TD2x6Gs9b8 slides: https://t.co/U0LcbcgjfC 👇

9

88

16

6

0

ScottTWisdom retweeted

Yuma Koizumi @yuma_koizumi

over 4 years ago

Our DF-Conformer paper has received the “Best Speech Enhancement Paper Award” from #WASPAA2021! Yay!!

2

77

15

2

0

ScottTWisdom retweeted

Eduardo Fonseca @edfonseca_

over 4 years ago

🔊Here's the video presentation of our WASPAA21 paper: "Self-Supervised Learning from Automatically Separated Sound Scenes". Work done during an internship at Google Research. paper: https://t.co/NvEhyI8BzE video: https://t.co/TD2x6Gs9b8 slides: https://t.co/U0LcbcgjfC

edfonseca_'s tweet photo. 🔊Here's the video presentation of our WASPAA21 paper: "Self-Supervised Learning from Automatically Separated Sound Scenes". Work done during an internship at Google Research.
paper: https://t.co/NvEhyI8BzE
video: https://t.co/TD2x6Gs9b8
slides: https://t.co/U0LcbcgjfC https://t.co/s31B3O5nf4

1

68

11

3

0

ScottTWisdom retweeted

Eduardo Fonseca @edfonseca_

over 5 years ago

🔊Happy to announce FSD50K: the new open dataset of human-labeled sound events! Over 51k Freesound audio clips, totalling over 100h of audio manually labeled using 200 classes drawn from the AudioSet Ontology. Paper: https://t.co/fn5NSsdkgy Dataset: https://t.co/DmeCDQj6yW

edfonseca_'s tweet photo. 🔊Happy to announce FSD50K: the new open dataset of human-labeled sound events! Over 51k Freesound audio clips, totalling over 100h of audio manually labeled using 200 classes drawn from the AudioSet Ontology.

Paper: https://t.co/fn5NSsdkgy
Dataset: https://t.co/DmeCDQj6yW https://t.co/oKzW55LGWp

4

239

79

36

0

ScottTWisdom retweeted

Efthymios Tzinis @ETzinis

over 5 years ago

I am thrilled to announce that our paper "Unsupervised Sound Separation using Mixtures of Mixtures" got accepted to #NeurIPS2020 as a #Spotlight paper!! 📢📢 All kudos to @ScottTWisdom and the rest of the Google guys! https://t.co/2nbGkjNABI

3

69

11

5

0

ScottTWisdom retweeted

Mirco Ravanelli @mirco_ravanelli

almost 6 years ago

We are very happy to announce that all the videos of our recent #ICML2020 workshop on self-supervised learning are now publicly available at https://t.co/rn07I6IVuJ Thanks #ICML2020 and @SlidesLive for that! @MILAMontreal #DeepLearning #AI #Speech #MachineLearning

mirco_ravanelli's tweet photo. We are very happy to announce that all the videos of our recent #ICML2020 workshop on self-supervised learning are now publicly available at

https://t.co/rn07I6IVuJ

Thanks #ICML2020 and @SlidesLive for that!

@MILAMontreal #DeepLearning #AI #Speech #MachineLearning https://t.co/tkOJDQbBNz

2

126

56

18

0

Scott Wisdom @ScottTWisdom

almost 6 years ago

Glad you like it, thanks for the nice summary!

Joe Antognini @joe_antognini

almost 6 years ago

I'm a bit late posting this, but a very cool paper from Scott Wisdom and collaborators (including @ETzinis) out of Google introducing "MixIT": https://t.co/9FVhFwntL6 They tackle the problem of *unsupervised* source separation! 1/10

1

3

0

1

5

0

Scott Wisdom

@ScottTWisdom

Who to follow

Last Seen Users on Sotwe

Trends for you

Most Popular Users