Get started with Apache Hudi using AWS Glue by implementing key design concepts – Part 1October 17, 2022 byAmit Maindola,Srinivas KandiandMitesh Patelhow-tobulk-insertamazon
What, Why and How : Apache Hudi’s Bloom IndexOctober 8, 2022 bySivabalan Narayananhow-todesignbloomindexingmedium
Ingest streaming data to Apache Hudi tables using AWS Glue and Apache Hudi DeltaStreamerOctober 6, 2022 byVishal Pathak,Anand PrakashandNoritaka Sekiyamahow-tostreaming ingestiondeltastreameramazon
Data processing with Spark: time travelingSeptember 28, 2022 byPetrica Leucahow-totime travel querydevgenius
Building Streaming Data Lakes with Hudi and MinIOSeptember 20, 2022 byMatt Sarrelhow-todatalakedatalake platformstreaming ingestionminio
Data Lake / Lakehouse Guide: Powered by Data Lake Table Formats (Delta Lake, Iceberg, Hudi)August 25, 2022 bySimon Spätiblogdatalakelakehousecomparisonairbyte
Implementation of SCD-2 (Slowly Changing Dimension) with Apache Hudi & SparkAugust 24, 2022 byJayasheel Kalgal,Esha DhingandPrashant Mishrause-casescd2walmartglobaltech
Use Flink Hudi to Build a Streaming Data Lake PlatformAugust 12, 2022 byChen YuzhaoandLiu Dalongblogapache flinkalibabacloudstreaming ingestion
How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platformAugust 9, 2022 byKevin ChunandDylan Quuse-casenear real-time analyticsincremental processingamazon
Build Open Lakehouse using Apache Hudi & dbtJuly 11, 2022 byVinoth Govindarajanhow-todeltastreamerincremental processingapache hudi
Apache Hudi vs Delta Lake - Transparent TPC-DS Lakehouse Performance BenchmarksJune 29, 2022 byAlexey Kudinkinperformancedatalakecomparisononehouse
Hudi’s Column Stats Index and Data Skipping feature help speed up queries by an orders of magnitude!June 9, 2022 byAlexey Kudinkindesignindexingdata skippingonehouse