Presented by Theerthana.H Pradeepa.A
Presented by Theerthana.H Pradeepa.A
THEERTHANA.H
PRADEEPA.A
Can you think of?
Can you think of running a query on 20,980,000 GB file.
What if we get a new data set like this, every day?
What if we need to execute complex queries on this data set everyday ?
Does anybody really deal with this type of data set?
Is it possible to store and analyze this data?
Yes Google deals with more than 20 PB data everyday
What is Bigdata?
Collection of data sets so large and complexthat it becomes difficult to
process using on-hand database management tools or traditional data
processing applications
“Big Data” is the data whose scale, diversity, and complexity require new
architecture, techniques, algorithms, and analytics to manage it and extract
value and hidden knowledge from it͙
‘Big Data’ is similar to ‘small data’, but bigger in size
Big Data generates value from the storage and processing of very large
quantities of digital information that cannot be analyzed with traditional
computing techniques.
Characteristics of Bigdata:
1st Character of Bigdata Volume
A typical PC might have had 10 gigabytes of storage in 2000.
Today, Face book ingests 500 terabytes of new data every day.
The smart phones, the data they create and consume; sensors
embedded into everyday objects will soon result in billions of
new, constantly-updated data feeds containing environmental,
location, and other information, including video.
2nd Character of Bigdata Velocity
Click streams and ad impressions capture user behavior at millions of
events per second
Big Data isn't just numbers, dates, and strings. Big Data is also geospatial
data, 3D data, audio and video, and unstructured text, including log files and
social media.
Competitive advantage
Big Data is already an important part of the $64 billion database and data
analytics market
And the Internet boom of the 1990s, and the social media explosion of
today.