Published On Premiered Aug 29, 2022
New course announcement ✨
We're teaching an in-person LLM bootcamp in the SF Bay Area on November 14, 2023. Come join us if you want to see the most up-to-date materials building LLM-powered products and learn in a hands-on environment.
https://www.scale.bythebay.io/llm-wor...
Hope to see some of you there!
--------------------------------------------------------------------------------------------- In this video, we cover the data stack from how data is stored and versioned to how it is processed and annotated.
00:00 Key points
01:18 Sources of data: filesystems, latency numbers, object stores, databases, data warehouses
10:48 Exploring data
12:08 Processing data
15:50 Feature stores
17:17 Summary of best practices and some sample datasets
20:31 Self-supervised learning and data labeling
29:52 Data versioning
Detailed notes and slides: https://fullstackdeeplearning.com/cou...
Subscribe to our channel and sign up at https://fullstackdeeplearning.com/cou... to follow along with the 2022 course!