Published On Premiered Sep 7, 2022
New course announcement ✨
We're teaching an in-person LLM bootcamp in the SF Bay Area on November 14, 2023. Come join us if you want to see the most up-to-date materials building LLM-powered products and learn in a hands-on environment.
https://www.scale.bythebay.io/llm-wor...
Hope to see some of you there!
--------------------------------------------------------------------------------------------- In this video, we cover the seventh lab in the course, on deploying ML-powered applications using Gradio and a model-as-a-service architecture.
Try out the lab on Colab here:
https://fsdl.me/lab07-colab
Find all the labs on GitHub here:
https://github.com/full-stack-deep-le...
Subscribe to our channel or sign up at https://fullstackdeeplearning.com/cou... to follow along with the 2022 course!
00:00 Overview
01:06 Compiling the model to TorchScript
06:00 Why not deploy on GPUs?
08:58 Building a GUI with gradio
15:34 Spinning up a model service
21:11 Creating a public URL with ngrok
24:52 Writing a Dockerfile for our server
30:06 Recap