Module - 3 - Session - 1 The History of Hadoop
Module - 3 - Session - 1 The History of Hadoop
being used by many other companies besides Yahoo!, such as Facebook, and the
New York Times.
In April 2008, Hadoop broke a world record to become the fastest system
to sort an entire terabyte of data. Running on a 910-node cluster, Hadoop sorted
1 terabyte in 209 seconds (just under 3.5 minutes), beating the previous year’s
winner of 297 seconds. In November of the same year, Google reported that its
MapReduce implementation sorted 1 terabyte in 68 seconds. Then, in April 2009,
it was announced that a team at Yahoo! Had used Hadoop to sort 1 terabyte in 62
seconds.
Today, Hadoop is widely used in mainstream enterprises. Hadoop’s role as a
general purpose storage and analysis platform for big data has been recognized by
the industry, and this fact is reflected in the number of products that use or
incorporate Hadoop in some way. Commercial Hadoop support is available from
large, established enterprise vendors, including EMC, IBM, Microsoft, and
Oracle, as well as from specialist Hadoop companies such as Cloudera,
Hortonworks, etc.