Soumil Shah
40.5K subscribers
8:04
Hudi with Kyuubi, a distributed & multi-tenant gateway, to provide serverless SQL on lakehouses
Soumil Shah
70 views • 3 days ago
5:18
Learn How to read Data from Kafka and insert New Data into Postgres Using Trino
Soumil Shah
58 views • 5 days ago
4:33
Learn How to Query Kafka Topics RealTime with trino
Soumil Shah
69 views • 5 days ago
2:03
Gratitude Overflowing: A Huge Thank You to All 40,000 Subscribers for Your Unwavering Love & Support
Soumil Shah
64 views • 8 days ago
17:16
Bringing Data Frm MySQL to Kafka Using Debezium, Joining Kafka Topics with Flink and ingest data
Soumil Shah
243 views • 10 days ago
7:55
Build Universal Data lake with MySQL + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
Soumil Shah
132 views • 2 weeks ago
27:58
Build Universal Data lake with Posgres + Debezium+Kafka+DeltaSTreamer + Minio+HiveMetastore+Trino
Soumil Shah
795 views • 2 weeks ago
4:05
Upcoming Next Video: Real World Data Engineering Project|with Kafka + Debezium+ deltaStreamer+ Trino
Soumil Shah
189 views • 2 weeks ago
8:55
Reading Data from Hudi INC & Joining with Delta Tables using HudiStreamer & SQL-Based Transformer
Soumil Shah
69 views • 3 weeks ago
10:38
Building a Universal Data Lakehouse with Apache XTable, MinIO, and Trino (hudi | Iceberg| Delta)
Soumil Shah
154 views • 3 weeks ago
23:10
Building DataLakeHouse: XTable, MinIO, StarRocks, DeltaStreamer - Interoperating Hudi, IceBerg,Delta
Soumil Shah
120 views • 3 weeks ago
2:00
Universal DataLakehouse: Unlocking Data at Scale for the World's Most Sophisticated Organization|4/9
Soumil Shah
54 views • 4 weeks ago
4:19
How to Get Email Alerts When AWS DMS Task Fails in an Event-Driven Fashion
Soumil Shah
72 views • 4 weeks ago
3:48
A Simple Config-Driven Python Template for Rapid DMS to S3 Task | Single Task per Table Strategy
Soumil Shah
63 views • 4 weeks ago
11:49
Dynamically Build and Schedule DeltaStreamer Jobs to EMR Serverless and Airflow Dag Creation
Soumil Shah
98 views • 1 month ago
4:26
From Uber to Open Source: Apache Hudi's Story
Soumil Shah
142 views • 1 month ago
13:02
Schedule Your DeltaStreamer Job Using Airflow & Run Them on EMR Serverless hands on Labs
Soumil Shah
297 views • 1 month ago
6:53
How to perform Backfilling jobs with Hudi DeltaStreamer and Spark SQL using SqlSource Class
Soumil Shah
71 views • 1 month ago
9:08
Mastering Incremental ETL with DeltaStreamer and SQL-Based Transformer
Soumil Shah
109 views • 1 month ago
9:39
DeltaStreamer & XTable: Hudi & Iceberg Interoperability | EMR Serverless Labs
Soumil Shah
68 views • 1 month ago
4:45
Managing Updates & Deletes in Glue Hudi Spark Jobs with CDC Data: Using _hoodie_is_deleted Flag
Soumil Shah
67 views • 1 month ago
7:46
Learn how to Run DeltaStreamer with onetable on AWS Glue Hands on Labs
Soumil Shah
94 views • 1 month ago
11:31
Learn How to use Glue 4.0 interactive session with Glue Dbt Adapter to join Hudi Tables
Soumil Shah
115 views • 1 month ago
10:07
Multi-Modal Indexing: RLI, ColumnStats, DeltaStreamer, OneTable | Interop Hudi, Iceberg & Delta
Soumil Shah
60 views • 1 month ago
7:14
Learn How to use Onetable with your DeltaStreamer and Interoperate between Hudi Iceberg&Delta Lakes
Soumil Shah
56 views • 1 month ago
8:49
Interoperate Between Hudi Delta and Iceberg Hands-on Labs on AWS | Hands on labs
Soumil Shah
96 views • 1 month ago
4:35
How to Query Apache Hudi tables from Glue Interactive Notebook for AdHoc Analysis
Soumil Shah
87 views • 1 month ago
7:30
Learn How you can run DeltaStreamer Running on AWS Glue with Hudi 0.14 Step by Step Guide
Soumil Shah
115 views • 1 month ago
8:49
Getting Started with Open Data lineage | Marquez Project | Apache Hudi Spark jobs
Soumil Shah
314 views • 2 months ago
14:55
Build Incremental ETL pipeline with Hudi and Airflow and MinIO
Soumil Shah
306 views • 2 months ago
Load More