- Overview
- Alternatives
- Pros & Cons
- Compare
Apache Hadoop is a free and open source software library that allows for the distributed processing of large data sets across clusters of computers using simple programming models. Apache Hadoop is designed to scale well, from single servers to thousands of machines offering local computation and storage. With Apache Hadoop you can store any kind of data and utilize the enormous processing power with the ability to handle virtually limitless concurrent tasks or jobs.