Hands-On Guide: Reading Data from Hudi Tables Incrementally, Joining with Delta Tables using HudiStreamer and SQL-Based TransformerApril 3, 2024 bySoumil Shahblogapache hudideltastreamerhudi streamerdeltasql transformerlinkedin
Record Level Indexing in Apache Hudi Delivers 70% Faster Point LookupsMarch 30, 2024 bySoumil Shahblogapache hudirecord level indexperformancelinkedin
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocksFebruary 6, 2024 bySoumil Shahblogapache hudilinkedinbeginnerapache sparkapache hivehive metastoreminiostarrocksdockerpythonpostgrespostgresql
Learn How to Move Data From MongoDB to Apache Hudi Using PySparkJanuary 20, 2024 bySoumil Shahblogapache hudilinkedinbeginnermongodbapache sparkpyspark
Deleting Items from Apache Hudi using Delta Streamer in UPSERT Mode with Kafka Avro MessagesJanuary 18, 2024 bySoumil Shahblogapache hudilinkedinbeginnerhudi streamerdeltastreamerapache kafkaapache avroupsertdelete
Small Talk about Apache HudiJanuary 5, 2024 byAshok Kumar Kunkalablogapache hudilinkedinbeginnerinsertsupsertscowmor
From Data lake to Microservices: Unleashing the Power of Apache Hudi's Record Level Index with FastAPI and Spark ConnectJanuary 1, 2024 bySoumil Shahblogapache hudilinkedinbeginnerapache sparkrecord level indexpysparkupsertsFastAPI
Mastering Data Lakes: A Deep Dive into MINIO, Hudi, and Delta StreamerNovember 30, 2023 bySoumil Shahapache hudiminohow-todeltastreamerlinkedin
Real-Time Data Processing with Postgres, Debezium, Kafka, Schema Registry, and Delta Streamer Guide for BegineersNovember 26, 2023 bySoumil Shahapache hudipostgreshow-todebeziumapache kafkadeltastreamerlinkedin
Hudi Streamer (Delta Streamer) Hands-On Guide: Local Ingestion from Parquet SourceNovember 19, 2023 bySoumil Shahapache hudihudi streamerhow-toapache parquetlinkedin
UPSERT Performance Evaluation of Hudi 0.14 and Spark 3.4.1: Record Level Index vs. Global Bloom & Global Simple IndexesOctober 29, 2023 bySoumil Shahlinkedinapache hudiqueryingindexingperformance
Hudi Best Practices: Handling Failed Inserts/Upserts with Error TablesJuly 2, 2023 bySoumil Shahbloglinkedinapache hudiinsertsupserts