Hadoop
Hadoop
System).
In Dec 2004, Google releases papers with MapReduce.
In 2006, Yahoo created Hadoop based on GFS and MapReduce with
Doug Cutting and team.
In 2007 Yahoo started using Hadoop on a 1000 node cluster.
In Jan 2008, Yahoo released Hadoop as an open source project to
Apache Software Foundation.
Doug quoted on Google’s contribution to the development of
Hadoop framework:
▪ “Google is living a few years in the future and sending the rest of us
messages.”
Lets understand the problems associated with Big Data and
how Hadoop solved that problem.
The first problem is storing the colossal amount of data
The second problem is storing heterogeneous data
The third problem, which is the processing speed