Published On Sep 12, 2016
Let's talk about Google DeepMind's Wavenet! This piece of work is about generating audio waveforms for Text To Speech and more. Text To Speech basically means that we have a voice reading whatever we have written down. The difference in this work, is, however that it can synthesize these samples in someone's voice provided that we have training samples of this person speaking.
__________________________
The paper "WaveNet: A Generative Model for Raw Audio" is available here:
https://arxiv.org/abs/1609.03499
The blog post about this with the sound samples is available here:
https://deepmind.com/blog/wavenet-gen...
The machine learning reddit thread about this paper is available here:
https://www.reddit.com/r/MachineLearn...
Recommended for you:
Every Two Minute Papers episode on deep learning: • AI and Deep Learning - Two Minute Papers
WE WOULD LIKE TO THANK OUR GENEROUS PATREON SUPPORTERS WHO MAKE TWO MINUTE PAPERS POSSIBLE:
Sunil Kim, Julian Josephs, Daniel John Benton, Dave Rushton-Smith, Benjamin Kang.
/ twominutepapers
We also thank Experiment for sponsoring our series. - https://experiment.com/
Thanks so much to JulioC EA for the Spanish captions! :)
Subscribe if you would like to see more of these! - http://www.youtube.com/subscription_c...
Music: Dat Groove by Audionautix is licensed under a Creative Commons Attribution license (https://creativecommons.org/licenses/...)
Artist: http://audionautix.com/
The thumbnail background image was found on Pixabay - https://pixabay.com/hu/spektrum-hangs...
Splash screen/thumbnail design: Felícia Fehér - http://felicia.hu
Károly Zsolnai-Fehér's links:
Facebook → / twominutepapers
Twitter → / karoly_zsolnai
Web → https://cg.tuwien.ac.at/~zsolnai/