In this tutorial, we’ll install Hadoop in stand-alone mode and run one of the example example MapReduce programs it includes to verify the installation. Hadoop clusters are relatively complex to set up, so the project includes a stand-alone mode which is suitable for learning about Hadoop, performing simple operations, and debugging. Many other processing models are available for the 2.x version of Hadoop. It distributes work within the cluster or map, then organizes and reduces the results from the nodes into a response to a query. Hadoop 2.7 is comprised of four main layers: It was the first major open source project in the big data playing field and is sponsored by the Apache Software Foundation.
Hadoop is a Java-based programming framework that supports the processing and storage of extremely large datasets on a cluster of inexpensive machines.