Storing 200 Billion Entities: Notion’s Data Lake ProjectNovember 12, 2024 byByteByteGoblogapache hudiuse-casebytebytego
Uber’s Big Data Revolution: From MySQL to Hadoop and BeyondSeptember 14, 2024 byVu Trinhblogapache hudiuse-casesubstack
Navigating the Future: The Evolutionary Journey of Upstox’s Data PlatformMarch 10, 2024 byManish Gauravuse-caseapache hudiupstox-engineering
Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for successFebruary 27, 2024 byToney Thomas, Ben Vengerovsky and Rada Stanicblogapache hudiuse-casedata meshamazon
How a POC became a production-ready Hudi data lakehouse through close team collaborationFebruary 12, 2024 byXiaoxiao Rey and Hussein Awalause-caseapache hudileboncoin-tech-blogbeginnerdeletegdpr deletionupsert
How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMRMay 16, 2023 bySekar Srinivasan,Amit Kumar Agrawal,Chandra DhandapaniandViral Shahuse-casestreaming ingestiongdpr deletiondeletesamazon
Lakehouse at Fortune 1 ScaleMay 3, 2023 bySamuel Guleffuse-casecomparisonperformancewalmartglobaltech
Build Your First Hudi Lakehouse with AWS S3 and AWS GlueDecember 19, 2022 byNadine Farahhow-touse-caseapache hudiaws s3aws glue
How Hudl built a cost-optimized AWS Glue pipeline with Apache Hudi datasetsNovember 10, 2022 byIndira Balakrishnan,Ramzi YassineandSwagat Kulkarniuse-casecost efficiencyincremental processingnear real-time analyticsamazon
Implementation of SCD-2 (Slowly Changing Dimension) with Apache Hudi & SparkAugust 24, 2022 byJayasheel Kalgal,Esha DhingandPrashant Mishrause-casescd2walmartglobaltech
How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platformAugust 9, 2022 byKevin ChunandDylan Quuse-casenear real-time analyticsincremental processingamazon
The story of building a data lake that can be deleted on a record-by-record basis using Apache HudiMay 25, 2022 byShota Ejimause-casegdpr deletionyahoo