The document provides an overview of big data including what big data is, the three V's of big data which are variety, velocity, and volume, and opportunities with big data such as making better decisions, discovering insights, and automating processes. Big data refers to extremely large datasets that are difficult to manage and analyze using traditional tools and comes in diverse structured, semi-structured, and unstructured formats generated at high speeds in large volumes.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0 ratings0% found this document useful (0 votes)
36 views8 pages
Lesson 1 Overview of Big Data
The document provides an overview of big data including what big data is, the three V's of big data which are variety, velocity, and volume, and opportunities with big data such as making better decisions, discovering insights, and automating processes. Big data refers to extremely large datasets that are difficult to manage and analyze using traditional tools and comes in diverse structured, semi-structured, and unstructured formats generated at high speeds in large volumes.
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8
Lesson 1:
Overview of Big Data
What is Big Data? • Big data refers to extremely large and complex datasets that cannot be easily managed, processed, or analyzed using traditional data processing tools. • 100,000 tweets • 204 million emails • 6 million views and 277,000 Facebook sent Logins • 61,000 hours of music • 2+ million Google searches listened to on Spotify • 3 million uploads on Flickr • 20 million photo views Three V’s of Big Data • 1. Variety: Big data comes in diverse formats. It includes structured data (e.g., relational databases), semi-structured data (e.g., JSON, XML), and unstructured data (e.g., text, images, videos). Managing and analyzing such diverse data types is a key challenge in big data processing. Sample Relational Databases Three V’s of Big Data 2. Velocity: Big data is generated at a high speed. The data is produced rapidly and continuously, often in real-time. Three V’s of Big Data • 3. Volume: Big data involves a vast amount of data. This could be terabytes, petabytes, or even exabytes of information, far beyond what traditional databases can handle. Big Data Opportunities
Making better informed decisions
e.g. strategies, recommendations
Discovering hidden insights
e.g. anomalies, patterns, trends
Automating business processes
e.g. complex events, translation, subscribing to SAAS. Q&A and Discussion