Apache Hudi

Apache Hudi

Apache Hudi is a free and open source tool for ingesting and managing storage of large analytical datasets over DFS. With Hudi you can stream and process bid data by an order of magnitude faster than traditional batch processing. Apache Hudi work with Apache Spark and integrates well with third-party services like Amazon EMR, AWS Glue Catalog, AWS Lambda, Hadoop Distributed File System (HDFS), Google Bigtable, WebHDFS and Google Cloud Tasks.

Overview of Apache Hudi

Apache Hudi is a free and open source tool for ingesting and managing storage of large analytical datasets over DFS. With Hudi you can stream and process bid data by an order of magnitude faster than traditional batch processing. Apache Hudi work with Apache Spark and integrates well with third-party services like Amazon EMR, AWS Glue Catalog, AWS Lambda, Hadoop Distributed File System (HDFS), Google Bigtable, WebHDFS and Google Cloud Tasks.

Apache Hudi Specifications

Category

Development & DevOps

Platform

Self-Hosted

Pricing Model

Free

Apache Hudi Recommendations