0% found this document useful (0 votes)
56 views14 pages

Presented by Theerthana.H Pradeepa.A

The document discusses big data, including its volume, velocity, and variety characteristics. It notes that big data is data that is too large to process with traditional database tools. Examples are provided of the massive amounts of data generated every day by companies like Facebook and from sensors. Big data analytics can provide competitive advantages and better business decisions. Open source tools like Hadoop, Spark, and Cassandra are commonly used for big data. Challenges include needing specialized computing power and integrating diverse data sources, while benefits include targeting customer outcomes and building a better information ecosystem.

Uploaded by

meena
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
56 views14 pages

Presented by Theerthana.H Pradeepa.A

The document discusses big data, including its volume, velocity, and variety characteristics. It notes that big data is data that is too large to process with traditional database tools. Examples are provided of the massive amounts of data generated every day by companies like Facebook and from sensors. Big data analytics can provide competitive advantages and better business decisions. Open source tools like Hadoop, Spark, and Cassandra are commonly used for big data. Challenges include needing specialized computing power and integrating diverse data sources, while benefits include targeting customer outcomes and building a better information ecosystem.

Uploaded by

meena
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 14

PRESENTED BY

THEERTHANA.H
PRADEEPA.A
Can you think of?
 Can you think of running a query on 20,980,000 GB file.
 What if we get a new data set like this, every day?
 What if we need to execute complex queries on this data set everyday ?
 Does anybody really deal with this type of data set?
 Is it possible to store and analyze this data?
 Yes Google deals with more than 20 PB data everyday
What is Bigdata?
 Collection of data sets so large and complexthat it becomes difficult to
process using on-hand database management tools or traditional data
processing applications

 “Big Data” is the data whose scale, diversity, and complexity require new
architecture, techniques, algorithms, and analytics to manage it and extract
value and hidden knowledge from it͙
 ‘Big Data’ is similar to ‘small data’, but bigger in size

 An aim to solve new problems or old problems in a better way

 Big Data generates value from the storage and processing of very large
quantities of digital information that cannot be analyzed with traditional
computing techniques.
Characteristics of Bigdata:
1st Character of Bigdata Volume
 A typical PC might have had 10 gigabytes of storage in 2000.

 Today, Face book ingests 500 terabytes of new data every day.

 Boeing 737 will generate 240 terabytes of flight data during a


single flight across the US.

 The smart phones, the data they create and consume; sensors
embedded into everyday objects will soon result in billions of
new, constantly-updated data feeds containing environmental,
location, and other information, including video.
2nd Character of Bigdata Velocity
 Click streams and ad impressions capture user behavior at millions of
events per second

 high-frequency stock trading algorithms reflect market changes within


microseconds

 machine to machine processes exchange data between billions of


devices

 infrastructure and sensors generate massive log data in real-time

 on-line gaming systems support millions of concurrent users, each


producing multiple inputs per second.
3rd Character of Bigdata Variety

 Big Data isn't just numbers, dates, and strings. Big Data is also geospatial
data, 3D data, audio and video, and unstructured text, including log files and
social media.

 Traditional database systems were designed to address smaller volumes of


structured data, fewer updates or a predictable, consistent data structure.

 Big Data analysis includes different types of data


Bigdata Analytics
 Examining large amount of data
 Appropriate information

 Identification of hidden patterns

 Competitive advantage

 Better business decisions: strategic and operational

 Effective marketing, customer satisfaction, increased revenue


Bigdata Tools
 8 Open Source Big Data Tools to use in 2018

 Apache Hadoop. The long-standing champion in the field of Big Data


processing, well-known for its capabilities for huge-scale data processing.
...

 Apache Spark. ...


 Apache Storm. ...
 Apache Cassandra. ...
 MongoDB. ...
 R Programming Environment. ...
 Neo4j. ...
 Apache SAMOA.
Bigdata Applications
Challenges of Bigdata
 It requires special computer power.
 Using real time insights requires a different way of working within your
organization.
 Dealing with data growth.
 Validating data.
 Integrating disparate data sources
 Organizational resistance.
Risks in Bigdata
 Here are the five biggest risks that big data presents for digital
enterprises.
 Unorganized data.
 Data storage and retention.
 Cost management.
 Incompetent analytics.
 Data privacy.
Benefits of Bigdata
 Our newest research finds that organizations are using big data to target
customer-centric outcomes, tap into internal data and build a better
information ecosystem.

 Big Data is already an important part of the $64 billion database and data
analytics market

 It offers commercial opportunities of a comparable

 scale to enterprise software in the late 1980s

 And the Internet boom of the 1990s, and the social media explosion of
today.

You might also like