- Overview
- Alternatives
- Pros & Cons
- Compare
Apache Hudi is a free and open source tool for ingesting and managing storage of large analytical datasets over DFS. With Hudi you can stream and process bid data by an order of magnitude faster than traditional batch processing. Apache Hudi work with Apache Spark and integrates well with third-party services like Amazon EMR, AWS Glue Catalog, AWS Lambda, Hadoop Distributed File System (HDFS), Google Bigtable, WebHDFS and Google Cloud Tasks.