Hadoop is mainly use to handle the big data so simple example is word count program is here. In this program using MapReduce concept we can count the words and reduce the repeated words and reduce the file size and save it into HDFS . MapReduce using key and value pair concept. So next time when we search anything from
Hadoop is an Apache open source software library written completely in Java, designed to deliver a distributed file system (HDFS) and a method for distributed computation called MapReduce. It can scale up from single servers to thousands of machines. It provides a framework that allows distributed processing of large sets of data across clusters of computers using simple programming models.
Recent Comments