We have a pretty awesome line up at the conversational AI track at GTC!
https://t.co/CgBIo22ttc
Tomorrow I'll be moderating talks from Facebook AI Research on dialogue agents in Minecraft as well as a talk on GPT-3 from OpenAI!
We’ve had a dream for many years of a TTS engine with enough emotional range to do voice acting. We’re incredibly proud to unveil Flowtron, the TTS narrating the 2020 I AM AI video. Take a listen: https://t.co/J6jDe1TdC7
We’ve had a dream for many years of a TTS engine with enough emotional range to do voice acting. We’re incredibly proud to unveil Flowtron, the TTS narrating the 2020 I AM AI video. Take a listen: https://t.co/J6jDe1TdC7
Very often I'd turn around from my desk to be delighted with blowing examples from Deep Learning Super Sampling 2.0! Definitely a game changer! DLSS 2.0 is "a model to rule them all", just like WaveGlow! Take a look and compare DLSS with other methods!
We just released the paper and code for Mellotron: a multispeaker voice synthesis model that can make a voice emote and sing without emotive or singing training data.
https://t.co/XyzIUXcFXa
Celebrate WaveGlow's anniversary with us! Listen to our new samples and download our new model weights with higher audio quality and higher throughput. WaveGlow latest implementation produces audio samples at a rate of 4850 kHz! 😱 https://t.co/3LSj8khZ22
The stable release of @PyTorch 1.0 is here! A big thank you to the community that's formed around #PyTorch and to those contributing with code, feedback, or new projects. Learn how to get started here: https://t.co/GX8poSLL6o
Image padding causes small but measurable artifacts for CNNs. Partial convolution padding improves ResNet-50 Top1 by 0.478% on average, and improves semantic segmentation results on image borders. Paper by @GuilinL: https://t.co/PGUQYkkKpW Code: https://t.co/XmEb0cm626
We have published our Tacotron 2 and WaveGlow trained model! Visit our repository https://t.co/TWZVhS9s8c to check out our text-to-speech synthesis demo and build your own high quality TTS models with faster than real-time inference!
WaveGlow inference: now faster! We improved mel-spectrogram inversion speed from 500 kHz to 1200 kHz on a single V100 GPU. The model is the same, just implemented better. More to come! https://t.co/JICEKsd00V
We just open-sourced a suite of ODE solvers in PyTorch:
https://t.co/y3INuQUs9C
Everything happens on the GPU and is differentiable. Now you can use ODEs in your deep learning models! Credit to @rtqichen.