Building Data Lakes on AWS with Kafka Connect, Debezium, Apicurio Registry, and Apache HudiFebruary 27, 2024 byGary A. Staffordblogapache hudiitnextbeginnerapache kafkakafka connectdebeziumapicurio registryawsapache sparkdeltastreamerhudi streameramazon rdsamazon mksamazon eksaws glueamazon emr
Use Amazon Athena with Spark SQL for your open-source transactional table formatsJanuary 24, 2024 byPathik Shah, Raj Devnathblogapache hudiawsbeginneraws glueaws athenatime travel queryclusteringcompactionaws s3apache icebergdelta lake
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 byRaymond Lai, Aditya Shah, Bin Wang, and Melody Yangblogapache hudiawsintermediateamazon emraws lake formationaws glueaws s3amazon sagemakeraws cloud9amazon athenaaccess control
Load data incrementally from transactional data lakes to data warehousesOctober 19, 2023 byNoritaka Sekiyamaincremental updatesamazonhow toqueryingawsamazon redshiftapache hudi
Run Apache Hudi at scale on AWSDecember 1, 2022 byImtiaz Sayed,,Shana Schipers,Dylan Qu,Carlos Rodrigues,Arun A KandFrancisco Morilloawsguideapache hudi