Actor Critic Algorithms
Siraj Raval Siraj Raval
767K subscribers
90,680 views
0

 Published On Dec 16, 2017

Reinforcement learning is hot right now! Policy gradients and deep q learning can only get us so far, but what if we used two networks to help train and AI instead of one? Thats the idea behind actor critic algorithms. I'll explain how they work in this video using the 'Doom" shooting game as an example.

Code for this video:
https://github.com/llSourcell/actor_c...

i-Nickk's winning code:
https://github.com/I-NicKK/Tic-Tac-Toe

Vignesh's runner up code:
https://github.com/tj27-vkr/Q-learnin...

Taryn's Twitter:
  / tarynsouthern  

More learning resources:
https://papers.nips.cc/paper/1786-act...
http://rll.berkeley.edu/deeprlcourse/...
http://web.mit.edu/jnt/www/Papers/J09...
http://mlg.eng.cam.ac.uk/rowan/files/...
http://mi.eng.cam.ac.uk/~mg436/Lectur...

Please Subscribe! And like. And comment. That's what keeps me going.

Want more inspiration & education? Connect with me:
Twitter:   / sirajraval  
Facebook:   / sirajology  


Join us in the Wizards Slack channel:
http://wizards.herokuapp.com/

And please support me on Patreon:
https://www.patreon.com/user?u=3191693 Instagram:   / sirajraval   Instagram:   / sirajraval  
Signup for my newsletter for exciting updates in the field of AI:
https://goo.gl/FZzJ5w
Hit the Join button above to sign up to become a member of my channel for access to exclusive content! Join my AI community: http://chatgptschool.io/ Sign up for my AI Sports betting Bot, WagerGPT! (500 spots available):
https://www.wagergpt.co

show more

Share/Embed