Hadoop is an open source framework for writing and running distributed applications that process large amounts of data.
The Hadoop platform was designed to solve problems where you have a lot of data.
Hadoop is designed to run on a large number of machines that don’t share any memory or disks. That means you can buy a whole bunch of commodity servers, slap them in a rack, and run the Hadoop software on each one.