From Swamp to Stream: How Apache Hudi Transforms the Modern Data LakeApril 6, 2025 by Everton Gomededata lakehouseincremental processingdml
How a POC became a production-ready Hudi data lakehouse through close team collaborationFebruary 12, 2024 by Xiaoxiao Rey and Hussein Awalaleboncoinbeginnergdprdml
Deleting Items from Apache Hudi using Delta Streamer in UPSERT Mode with Kafka Avro MessagesJanuary 18, 2024 by Soumil Shahbeginnerhudi streamerapache kafkaapache avrodml
From Data lake to Microservices: Unleashing the Power of Apache Hudi's Record Level Index with FastAPI and Spark ConnectJanuary 1, 2024 by Soumil Shahbeginnerapache sparkindexingdmlfastapi
Get started with Apache Hudi using AWS Glue by implementing key design concepts – Part 1October 17, 2023 by Srinivas Kandi and Ravi Ithadmlawsindexing
Create an Apache Hudi-based-near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSightAugust 3, 2023 by Raj Ramasubbu, Sundeep Kumar and Rahul Sonawanecdcdmlaws
How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMRMay 16, 2023 by Sekar Srinivasan, Amit Kumar Agrawal, Chandra Dhandapani and Viral Shahstreaminggdprdmlaws
Get started with Apache Hudi using AWS Glue by implementing key design concepts – Part 1October 17, 2022 by Amit Maindola, Srinivas Kandi and Mitesh Pateldmlaws