Empowering data-driven excellence: How the Bluestone Data Platform embraced data mesh for successFebruary 27, 2024 by Toney Thomas, Ben Vengerovsky and Rada Stanicdata meshaws
Enabling near real-time data analytics on the data lakeFebruary 23, 2024 by Shi Kai Ng and Shuguang Xiangbimorgrab
How a POC became a production-ready Hudi data lakehouse through close team collaborationFebruary 12, 2024 by Xiaoxiao Rey and Hussein Awalaleboncoinbeginnergdprdml
Building an Open Source Data Lake House with Hudi, Postgres Hive Metastore, Minio, and StarRocksFebruary 6, 2024 by Soumil Shahbeginnerapache sparkapache hiveminiostarrocksdockerpythonpostgres
Combine Transactional Integrity and Data Lake Operations with YugabyteDB and Apache HudiFebruary 6, 2024 by Balachandar Seetharamanaciddata lakehousecdcetlyugabyte
Apache Hudi: Managing Partition on a petabyte-scale tableFebruary 4, 2024 by Krishna Prasadawsapache spark
Leverage Partition Paths of your data lake tables to Optimize Data Retrieval Costs on the cloudJanuary 30, 2024 by Krishna Prasadawsperformanceapache spark
Use Amazon Athena with Spark SQL for your open-source transactional table formatsJanuary 24, 2024 by Pathik Shah, Raj Devnathbeginnerqueryingclusteringcompactionapache icebergawsdelta lake
Data Engineering: Bootstrapping Data lake with Apache HudiJanuary 20, 2024 by Krishna Prasadbeginneretlawsapache spark
Learn How to Move Data From MongoDB to Apache Hudi Using PySparkJanuary 20, 2024 by Soumil Shahbeginnermongodbapache spark
Deleting Items from Apache Hudi using Delta Streamer in UPSERT Mode with Kafka Avro MessagesJanuary 18, 2024 by Soumil Shahbeginnerhudi streamerapache kafkaapache avrodml
Enforce fine-grained access control on Open Table Formats via Amazon EMR integrated with AWS Lake FormationJanuary 17, 2024 by Raymond Lai, Aditya Shah, Bin Wang, and Melody Yangawsaccess control