Lecture 04: Data Management (FSDL 2022)
YouTube Viewers YouTube Viewers
31.2K subscribers
4,917 views
0

 Published On Premiered Aug 29, 2022

New course announcement ✨

We're teaching an in-person LLM bootcamp in the SF Bay Area on November 14, 2023. Come join us if you want to see the most up-to-date materials building LLM-powered products and learn in a hands-on environment.

https://www.scale.bythebay.io/llm-wor...

Hope to see some of you there!

--------------------------------------------------------------------------------------------- In this video, we cover the data stack from how data is stored and versioned to how it is processed and annotated.

00:00 Key points
01:18 Sources of data: filesystems, latency numbers, object stores, databases, data warehouses
10:48 Exploring data
12:08 Processing data
15:50 Feature stores
17:17 Summary of best practices and some sample datasets
20:31 Self-supervised learning and data labeling
29:52 Data versioning

Detailed notes and slides: https://fullstackdeeplearning.com/cou...

Subscribe to our channel and sign up at https://fullstackdeeplearning.com/cou... to follow along with the 2022 course!

show more

Share/Embed