WaveNet by Google DeepMind | Two Minute Papers #93

1.55M subscribers

129,052 views

About
Share

Published On Sep 12, 2016

Let's talk about Google DeepMind's Wavenet! This piece of work is about generating audio waveforms for Text To Speech and more. Text To Speech basically means that we have a voice reading whatever we have written down. The difference in this work, is, however that it can synthesize these samples in someone's voice provided that we have training samples of this person speaking.

__________________________

The paper "WaveNet: A Generative Model for Raw Audio" is available here:
https://arxiv.org/abs/1609.03499

The blog post about this with the sound samples is available here:
https://deepmind.com/blog/wavenet-gen...

The machine learning reddit thread about this paper is available here:
https://www.reddit.com/r/MachineLearn...

Recommended for you:
Every Two Minute Papers episode on deep learning:    • AI and Deep Learning - Two Minute Papers

WE WOULD LIKE TO THANK OUR GENEROUS PATREON SUPPORTERS WHO MAKE TWO MINUTE PAPERS POSSIBLE:
Sunil Kim, Julian Josephs, Daniel John Benton, Dave Rushton-Smith, Benjamin Kang.
  / twominutepapers

We also thank Experiment for sponsoring our series. - https://experiment.com/

Thanks so much to JulioC EA for the Spanish captions! :)

Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_c...

Music: Dat Groove by Audionautix is licensed under a Creative Commons Attribution license (https://creativecommons.org/licenses/...)
Artist: http://audionautix.com/

The thumbnail background image was found on Pixabay - https://pixabay.com/hu/spektrum-hangs...
Splash screen/thumbnail design: Felícia Fehér - http://felicia.hu

Károly Zsolnai-Fehér's links:
Facebook →   / twominutepapers
Twitter →   / karoly_zsolnai
Web → https://cg.tuwien.ac.at/~zsolnai/

Published On Sep 12, 2016

Share/Embed

Video Link