Ingest streaming data to Apache Hudi tables using AWS Glue and Apache Hudi DeltaStreamerOctober 6, 2022 byVishal Pathak,Anand PrakashandNoritaka Sekiyamahow-tostreaming ingestiondeltastreameramazon
Data processing with Spark: time travelingSeptember 28, 2022 byPetrica Leucahow-totime travel querydevgenius
Building Streaming Data Lakes with Hudi and MinIOSeptember 20, 2022 byMatt Sarrelhow-todatalakedatalake platformstreaming ingestionminio
Data Lake / Lakehouse Guide: Powered by Data Lake Table Formats (Delta Lake, Iceberg, Hudi)August 25, 2022 bySimon Spätiblogdatalakelakehousecomparisonairbyte
Implementation of SCD-2 (Slowly Changing Dimension) with Apache Hudi & SparkAugust 24, 2022 byJayasheel Kalgal,Esha DhingandPrashant Mishrause-casescd2walmartglobaltech
Use Flink Hudi to Build a Streaming Data Lake PlatformAugust 12, 2022 byChen YuzhaoandLiu Dalongblogapache flinkalibabacloudstreaming ingestion
How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platformAugust 9, 2022 byKevin ChunandDylan Quuse-casenear real-time analyticsincremental processingamazon
Build Open Lakehouse using Apache Hudi & dbtJuly 11, 2022 byVinoth Govindarajanhow-todeltastreamerincremental processingapache hudi
Apache Hudi vs Delta Lake - Transparent TPC-DS Lakehouse Performance BenchmarksJune 29, 2022 byAlexey Kudinkinperformancedatalakecomparisononehouse
Hudi’s Column Stats Index and Data Skipping feature help speed up queries by an orders of magnitude!June 9, 2022 byAlexey Kudinkindesignindexingdata skippingonehouse
Asynchronous Indexing using HudiJune 4, 2022 bySagar Sumitdesignmulti modal indexingonehouseasync indexing
The story of building a data lake that can be deleted on a record-by-record basis using Apache HudiMay 25, 2022 byShota Ejimause-casegdpr deletionyahoo