Load data incrementally from transactional data lakes to data warehousesOctober 19, 2023 byNoritaka Sekiyamaincremental updatesamazonhow toqueryingawsredshiftapache hudi
Get started with Apache Hudi using AWS Glue by implementing key design concepts – Part 1October 17, 2023 bySrinivas KandiandRavi Ithaaws glueapache hudihow-toamazondesignupsertsbulk insertindexing
Simplify operational data processing in data lakes using AWS Glue and Apache HudiSeptember 13, 2023 bySrinivas KandiandRavi Ithaaws glueamazonhow-todata processingapache hudiaws
Create an Apache Hudi-based-near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSightAugust 3, 2023 byRaj Ramasubbu,Sundeep KumarandRahul Sonawanehow-tocdcupsertsamazon
How Zoom implemented streaming log ingestion and efficient GDPR deletes using Apache Hudi on Amazon EMRMay 16, 2023 bySekar Srinivasan,Amit Kumar Agrawal,Chandra DhandapaniandViral Shahuse-casestreaming ingestiongdpr deletiondeleteamazon
Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 2: AWS Glue Studio Visual EditorMarch 20, 2023 byNoritaka Sekiyama,Scott LongandSean Maaws glueglue studioblogamazon
Automate schema evolution at scale with Apache Hudi in AWS Glue | Amazon Web ServicesFebruary 7, 2023 bySubhro Bose,Eva FangandKetan Karalkarhow-toschema evolutionamazon
Introducing native support for Apache Hudi, Delta Lake, and Apache Iceberg on AWS Glue for Apache Spark, Part 1: Getting StartedJanuary 27, 2023 byAkira Ajisaka, Noritaka Sekiyama and Savio Dsouzablogamazon
Build your Apache Hudi data lake on AWS using Amazon EMR – Part 1November 22, 2022 bySuthan PhillipsandDylan Quhow-tobest-practicesamazon
How Hudl built a cost-optimized AWS Glue pipeline with Apache Hudi datasetsNovember 10, 2022 byIndira Balakrishnan,Ramzi YassineandSwagat Kulkarniuse-casecost-efficiencyincremental-processingnear real-time analyticsamazon
Get started with Apache Hudi using AWS Glue by implementing key design concepts – Part 1October 17, 2022 byAmit Maindola,Srinivas KandiandMitesh Patelhow-tobulk-insertamazon
Ingest streaming data to Apache Hudi tables using AWS Glue and Apache Hudi DeltaStreamerOctober 6, 2022 byVishal Pathak,Anand PrakashandNoritaka Sekiyamahow-tostreaming ingestiondeltastreameramazon
How NerdWallet uses AWS and Apache Hudi to build a serverless, real-time analytics platformAugust 9, 2022 byKevin ChunandDylan Quuse-casenear real-time analyticsincremental-processingamazon
New features from Apache Hudi 0.9.0 on Amazon EMRApril 4, 2022 byKunal Gautam,Gabriele CacciolaandUdit Mehrotrablogamazon
Zendesk - Insights for CTOs: Part 3 – Growing your business with modern data capabilitiesMarch 24, 2022 bySyed JaffryandJohnathan Hwangblogmodern data-architecturenear real-time analyticsgdpr deletionstreaming ingestionamazon
Build a serverless pipeline to analyze streaming data using AWS Glue, Apache Hudi, and Amazon S3March 9, 2022 byNikhil KhokharandDipta Bhattacharyahow-tostreaming ingestionamazon
Create a low-latency source-to-data lake pipeline using Amazon MSK Connect, Apache Flink, and Apache HudiMarch 1, 2022 byAli Alemihow-tostreaming ingestionapache flinkapache kafkaamazon
New features from Apache Hudi 0.7.0 and 0.8.0 available on Amazon EMRDecember 20, 2021 byUdit MehrotraandGagan Brahmiblogamazon
How GE Aviation built cloud-native data pipelines at enterprise scale using the AWS platformNovember 16, 2021 byAlcuin WeidusandSuresh Patnamuse-caseanalytics at-scaleamazon
How Amazon Transportation Service enabled near-real-time event analytics at petabyte scale using AWS Glue with Apache HudiOctober 14, 2021 byMadhavan Sriram,Diego Menin,Gabriele CacciolaandKunal Gautamuse-casenear real-time analyticsanalytics at-scaleamazon
Part1: Query apache hudi dataset in an amazon S3 data lake with amazon athena : Read optimized queriesJuly 16, 2021 byDhiraj Thakur,Sameer GoelandImtiaz Sayedhow-toread-optimized-queriesamazon
Build Slowly Changing Dimensions Type 2 (SCD2) with Apache Spark and Apache Hudi on Amazon EMRApril 12, 2021 byDavid Greenshteinhow-toscd2amazon
Build a data lake using amazon kinesis data stream for amazon dynamodb and apache hudiMarch 4, 2021 byDhiraj Thakur,Dylan QuandSaurabh Shrivastavahow-tostreaming ingestionamazon
New – Insert, Update, Delete Data on S3 with Amazon EMR and Apache HudiNovember 15, 2019 byDanilo Pocciablogamazon