. . "We all know that Apache Hadoop is an open source framework that allows distributed processing of large sets of data set across different clusters using simple programming." . .